SlideShare a Scribd company logo
1
Lecture-7Lecture-7
De-normalizationDe-normalization
Mamuna Fatima

2
Striking a balance between “good” & “evil”
Flat Table
Data Lists
Data Cubes 1st
Normal Form
2nd
Normal Form
3rd
Normal Form
4+ Normal Forms
NormalizationDe-normalization
One big flat file
Too many tables

3
What is De-normalization?
the aim is to enhance performance without
loss of information.
 Normalization is a rule of thumb in DBMS,
but in DSS ease of use is achieved by way of
denormalization.
 De-normalization comes in many flavors,
such as combining tables, splitting tables,
adding data etc., but all done very carefully.

 Bringing “close” dispersed but related data items.
 Very early studies showed performance difference in orders of
magnitude for different number de-normalized tables and rows
per table.
 The level of de-normalization should be carefully considered.
4
Why De-normalization In DSS?

5
How De-normalization improves performance?
De-normalization specifically improves
performance by either:
 Reducing the number of tables and hence the
reliance on joins, which consequently speeds up
performance.
 Reducing the number of joins required during
query execution, or

6
4 Guidelines for De-normalization
1. Carefully do a cost-benefit analysis
(frequency of use, additional storage,
join time).
2. Do a data requirement and storage
analysis.
3. When in doubt, don’t denormalize.

7
Areas for Applying De-Normalization Techniques
 Dealing with the abundance of star schemas.
 Fast access of time series data for analysis.
 Fast aggregate (sum, average etc.) results and
complicated calculations.
 Multidimensional analysis (e.g. geography) in a complex
hierarchy.
 Dealing with few updates but many join queries.
De-normalization will ultimately affect the database size and
query performance.

8
Five principal De-normalization techniques
1. Collapsing Tables.
- Two entities with a One-to-One relationship.
- Two entities with a Many-to-Many relationship.
2. Splitting Tables (Horizontal/Vertical Splitting).
3. Pre-Joining.
4. Adding Redundant Columns (Reference Data).
5. Derived Attributes (Summary, Total, Balance etc).

9
Collapsing Tables
ColA ColB
ColA ColC
normalized
ColA ColB ColC
denormalized
 Reduced storage space.
 Reduced update time.
 Does not changes business view.
 Reduced foreign keys.

9
Collapsing Tables
ColA ColB
ColA ColC
normalized
ColA ColB ColC
denormalized
 Reduced storage space.
 Reduced update time.
 Does not changes business view.
 Reduced foreign keys.

More Related Content

PPTX
introduction to NOSQL Database
nehabsairam
 
PPT
Data warehouse
shachibattar
 
PPT
Deadlock management
Ahmed kasim
 
PPTX
Etl elt simplified
Ramchandra Koty
 
PPTX
Map Reduce
Prashant Gupta
 
PPTX
multi dimensional data model
moni sindhu
 
PPT
Dwh lecture 08-denormalization tech
Sulman Ahmed
 
PPTX
Distributed dbms architectures
Pooja Dixit
 
introduction to NOSQL Database
nehabsairam
 
Data warehouse
shachibattar
 
Deadlock management
Ahmed kasim
 
Etl elt simplified
Ramchandra Koty
 
Map Reduce
Prashant Gupta
 
multi dimensional data model
moni sindhu
 
Dwh lecture 08-denormalization tech
Sulman Ahmed
 
Distributed dbms architectures
Pooja Dixit
 

What's hot (20)

PPTX
Lec 7 query processing
Md. Mashiur Rahman
 
PPTX
Ppt
bullsrockr666
 
PDF
Spark Streaming | Twitter Sentiment Analysis Example | Apache Spark Training ...
Edureka!
 
PDF
management of distributed transactions
Nilu Desai
 
PPT
Data Warehouse Modeling
vivekjv
 
PPT
Data warehouse
Medma Infomatix (P) Ltd.
 
PPT
Data warehouse architecture
uncleRhyme
 
PPTX
Data Warehouse
MadhuriNigam1
 
PPTX
Concurrency control
Subhasish Pati
 
PPTX
Introduction to distributed database
Sonia Panesar
 
PPTX
Denormalization
Amna Magzoub
 
PPT
Sql Server Performance Tuning
Bala Subra
 
PPTX
SQL for ETL Testing
Garuda Trainings
 
PPTX
Database architecture
VENNILAV6
 
POTX
database
Shwetanshu Gupta
 
PPTX
ETL big data with apache hadoop
Maulik Thaker
 
PPTX
Data warehouse
Yogendra Uikey
 
PPT
Database management system presentation
sameerraaj
 
PPTX
Spark introduction and architecture
Sohil Jain
 
Lec 7 query processing
Md. Mashiur Rahman
 
Spark Streaming | Twitter Sentiment Analysis Example | Apache Spark Training ...
Edureka!
 
management of distributed transactions
Nilu Desai
 
Data Warehouse Modeling
vivekjv
 
Data warehouse
Medma Infomatix (P) Ltd.
 
Data warehouse architecture
uncleRhyme
 
Data Warehouse
MadhuriNigam1
 
Concurrency control
Subhasish Pati
 
Introduction to distributed database
Sonia Panesar
 
Denormalization
Amna Magzoub
 
Sql Server Performance Tuning
Bala Subra
 
SQL for ETL Testing
Garuda Trainings
 
Database architecture
VENNILAV6
 
ETL big data with apache hadoop
Maulik Thaker
 
Data warehouse
Yogendra Uikey
 
Database management system presentation
sameerraaj
 
Spark introduction and architecture
Sohil Jain
 
Ad

Similar to Dwh lecture 07-denormalization (20)

PPT
Dwh lecture-07-denormalization
Sulman Ahmed
 
PPT
Lecture 7
Shani729
 
PPT
denormalization.ppt
ABUSUFYAN55
 
PDF
De normalozation
Khuram Shahzad
 
PPT
When & Why\'s of Denormalization
Aliya Saldanha
 
PPT
Intro to Data warehousing Lecture 04
AnwarrChaudary
 
PDF
RDBMS Denormalization - Benefits & Pitfalls
Shyam Anand
 
PDF
Normalization
PratibhaRashmiSingh
 
PDF
Normalisation [Slides].pdf introduction language
AndrewSilungwe2
 
PPTX
Importance of Normalization
Shwe Yee
 
PDF
Cs437 lecture 7-8
Aneeb_Khawar
 
PPT
Lecture 9
Shani729
 
PPTX
What is Normalization?
Ducat
 
PPT
Normalization.ppt What is Normalizations
SHAKIR325211
 
PPTX
Database Normalisation
sheetalverma38
 
PPTX
Sql server ___________session3-normailzation
Ehtisham Ali
 
PPT
MODULE 3 -Normalization bwdhwbifnweipfnewknfqekndd_1.ppt
HemaSenthil5
 
PPT
MODULE 3 -Normalization_1.ppt moduled in design
HemaSenthil5
 
PDF
Ijsws14 423 (1)-paper-17-normalization of data in (1)
Raghavendra Pokuri
 
Dwh lecture-07-denormalization
Sulman Ahmed
 
Lecture 7
Shani729
 
denormalization.ppt
ABUSUFYAN55
 
De normalozation
Khuram Shahzad
 
When & Why\'s of Denormalization
Aliya Saldanha
 
Intro to Data warehousing Lecture 04
AnwarrChaudary
 
RDBMS Denormalization - Benefits & Pitfalls
Shyam Anand
 
Normalization
PratibhaRashmiSingh
 
Normalisation [Slides].pdf introduction language
AndrewSilungwe2
 
Importance of Normalization
Shwe Yee
 
Cs437 lecture 7-8
Aneeb_Khawar
 
Lecture 9
Shani729
 
What is Normalization?
Ducat
 
Normalization.ppt What is Normalizations
SHAKIR325211
 
Database Normalisation
sheetalverma38
 
Sql server ___________session3-normailzation
Ehtisham Ali
 
MODULE 3 -Normalization bwdhwbifnweipfnewknfqekndd_1.ppt
HemaSenthil5
 
MODULE 3 -Normalization_1.ppt moduled in design
HemaSenthil5
 
Ijsws14 423 (1)-paper-17-normalization of data in (1)
Raghavendra Pokuri
 
Ad

More from Sulman Ahmed (20)

PPT
Entrepreneurial Strategy Generating and Exploiting new entries
Sulman Ahmed
 
PPT
Entrepreneurial Intentions and corporate entrepreneurship
Sulman Ahmed
 
PPT
Entrepreneurship main concepts and description
Sulman Ahmed
 
PPTX
Run time Verification using formal methods
Sulman Ahmed
 
PPTX
Use of Formal Methods at Amazon Web Services
Sulman Ahmed
 
PPTX
student learning App
Sulman Ahmed
 
PPTX
Software Engineering Economics Life Cycle.
Sulman Ahmed
 
PPTX
Data mining Techniques
Sulman Ahmed
 
PPTX
Rules of data mining
Sulman Ahmed
 
PPTX
Rules of data mining
Sulman Ahmed
 
PPTX
Classification in data mining
Sulman Ahmed
 
PPTX
Data mining Basics and complete description
Sulman Ahmed
 
PPTX
Data mining Basics and complete description onword
Sulman Ahmed
 
PPT
Dwh lecture-06-normalization
Sulman Ahmed
 
PPT
Dwh lecture 12-dm
Sulman Ahmed
 
PPT
Dwh lecture 13-process dm
Sulman Ahmed
 
PPT
Dwh lecture 11-molap
Sulman Ahmed
 
PPT
Dwh lecture 10-olap
Sulman Ahmed
 
PPT
Wbs
Sulman Ahmed
 
PPT
Wbs, estimation and scheduling
Sulman Ahmed
 
Entrepreneurial Strategy Generating and Exploiting new entries
Sulman Ahmed
 
Entrepreneurial Intentions and corporate entrepreneurship
Sulman Ahmed
 
Entrepreneurship main concepts and description
Sulman Ahmed
 
Run time Verification using formal methods
Sulman Ahmed
 
Use of Formal Methods at Amazon Web Services
Sulman Ahmed
 
student learning App
Sulman Ahmed
 
Software Engineering Economics Life Cycle.
Sulman Ahmed
 
Data mining Techniques
Sulman Ahmed
 
Rules of data mining
Sulman Ahmed
 
Rules of data mining
Sulman Ahmed
 
Classification in data mining
Sulman Ahmed
 
Data mining Basics and complete description
Sulman Ahmed
 
Data mining Basics and complete description onword
Sulman Ahmed
 
Dwh lecture-06-normalization
Sulman Ahmed
 
Dwh lecture 12-dm
Sulman Ahmed
 
Dwh lecture 13-process dm
Sulman Ahmed
 
Dwh lecture 11-molap
Sulman Ahmed
 
Dwh lecture 10-olap
Sulman Ahmed
 
Wbs, estimation and scheduling
Sulman Ahmed
 

Recently uploaded (20)

PPTX
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
PPTX
family health care settings home visit - unit 6 - chn 1 - gnm 1st year.pptx
Priyanshu Anand
 
PDF
What is CFA?? Complete Guide to the Chartered Financial Analyst Program
sp4989653
 
DOCX
pgdei-UNIT -V Neurological Disorders & developmental disabilities
JELLA VISHNU DURGA PRASAD
 
PPTX
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
DOCX
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
PDF
Health-The-Ultimate-Treasure (1).pdf/8th class science curiosity /samyans edu...
Sandeep Swamy
 
DOCX
SAROCES Action-Plan FOR ARAL PROGRAM IN DEPED
Levenmartlacuna1
 
PPTX
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
PPTX
How to Track Skills & Contracts Using Odoo 18 Employee
Celine George
 
PDF
Module 2: Public Health History [Tutorial Slides]
JonathanHallett4
 
PDF
RA 12028_ARAL_Orientation_Day-2-Sessions_v2.pdf
Seven De Los Reyes
 
PDF
Biological Classification Class 11th NCERT CBSE NEET.pdf
NehaRohtagi1
 
PDF
Review of Related Literature & Studies.pdf
Thelma Villaflores
 
PPTX
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
PPTX
How to Close Subscription in Odoo 18 - Odoo Slides
Celine George
 
PPTX
CONCEPT OF CHILD CARE. pptx
AneetaSharma15
 
PDF
2.Reshaping-Indias-Political-Map.ppt/pdf/8th class social science Exploring S...
Sandeep Swamy
 
PPTX
INTESTINALPARASITES OR WORM INFESTATIONS.pptx
PRADEEP ABOTHU
 
PPTX
Basics and rules of probability with real-life uses
ravatkaran694
 
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
family health care settings home visit - unit 6 - chn 1 - gnm 1st year.pptx
Priyanshu Anand
 
What is CFA?? Complete Guide to the Chartered Financial Analyst Program
sp4989653
 
pgdei-UNIT -V Neurological Disorders & developmental disabilities
JELLA VISHNU DURGA PRASAD
 
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
Health-The-Ultimate-Treasure (1).pdf/8th class science curiosity /samyans edu...
Sandeep Swamy
 
SAROCES Action-Plan FOR ARAL PROGRAM IN DEPED
Levenmartlacuna1
 
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
How to Track Skills & Contracts Using Odoo 18 Employee
Celine George
 
Module 2: Public Health History [Tutorial Slides]
JonathanHallett4
 
RA 12028_ARAL_Orientation_Day-2-Sessions_v2.pdf
Seven De Los Reyes
 
Biological Classification Class 11th NCERT CBSE NEET.pdf
NehaRohtagi1
 
Review of Related Literature & Studies.pdf
Thelma Villaflores
 
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
How to Close Subscription in Odoo 18 - Odoo Slides
Celine George
 
CONCEPT OF CHILD CARE. pptx
AneetaSharma15
 
2.Reshaping-Indias-Political-Map.ppt/pdf/8th class social science Exploring S...
Sandeep Swamy
 
INTESTINALPARASITES OR WORM INFESTATIONS.pptx
PRADEEP ABOTHU
 
Basics and rules of probability with real-life uses
ravatkaran694
 

Dwh lecture 07-denormalization

  • 2.  2 Striking a balance between “good” & “evil” Flat Table Data Lists Data Cubes 1st Normal Form 2nd Normal Form 3rd Normal Form 4+ Normal Forms NormalizationDe-normalization One big flat file Too many tables
  • 3.  3 What is De-normalization? the aim is to enhance performance without loss of information.  Normalization is a rule of thumb in DBMS, but in DSS ease of use is achieved by way of denormalization.  De-normalization comes in many flavors, such as combining tables, splitting tables, adding data etc., but all done very carefully.
  • 4.   Bringing “close” dispersed but related data items.  Very early studies showed performance difference in orders of magnitude for different number de-normalized tables and rows per table.  The level of de-normalization should be carefully considered. 4 Why De-normalization In DSS?
  • 5.  5 How De-normalization improves performance? De-normalization specifically improves performance by either:  Reducing the number of tables and hence the reliance on joins, which consequently speeds up performance.  Reducing the number of joins required during query execution, or
  • 6.  6 4 Guidelines for De-normalization 1. Carefully do a cost-benefit analysis (frequency of use, additional storage, join time). 2. Do a data requirement and storage analysis. 3. When in doubt, don’t denormalize.
  • 7.  7 Areas for Applying De-Normalization Techniques  Dealing with the abundance of star schemas.  Fast access of time series data for analysis.  Fast aggregate (sum, average etc.) results and complicated calculations.  Multidimensional analysis (e.g. geography) in a complex hierarchy.  Dealing with few updates but many join queries. De-normalization will ultimately affect the database size and query performance.
  • 8.  8 Five principal De-normalization techniques 1. Collapsing Tables. - Two entities with a One-to-One relationship. - Two entities with a Many-to-Many relationship. 2. Splitting Tables (Horizontal/Vertical Splitting). 3. Pre-Joining. 4. Adding Redundant Columns (Reference Data). 5. Derived Attributes (Summary, Total, Balance etc).
  • 9.  9 Collapsing Tables ColA ColB ColA ColC normalized ColA ColB ColC denormalized  Reduced storage space.  Reduced update time.  Does not changes business view.  Reduced foreign keys.
  • 10.  9 Collapsing Tables ColA ColB ColA ColC normalized ColA ColB ColC denormalized  Reduced storage space.  Reduced update time.  Does not changes business view.  Reduced foreign keys.

Editor's Notes

  • #3: In the reality of the “real world”, the enhancement in performance delivered by some selective de-normalization technique can be a very valuable tool.
  • #4: De-normalization does not mean chaos or disorder or indiscipline. De-normalization is the process of selectively transforming normalized relations into un-normalized physical record specifications, with the aim ofreducing query processing time.
  • #8: De-normalization will ultimately affect the database size and query performance. De-normalization is especially useful while dealing with the abundance of star schemas that are found in many data warehouse installations. For such cases, de-normalization provides better performance and a more natural data structure for supporting decision making. The goal of most analytical processes in a typical data warehouse environment is to access aggregates such as averages, sums, complicated formula calculations, top 10 customers etc. Typical OLTP systems contain only the raw transaction data, whiledecision makers expect to find aggregated and time-series data in their data warehouse to get the big picture through immediate query and display.
  • #10: One of the most common and safe de-normalization techniques is combining of One-to One relationships. This situation occurs when for each row of entity A, there is only one related row in entity B. For example, if users frequently need to see COLA, COLB, and COLC together and the data from the two tables are in a One-to-One relationship, the solution is to collapse the two tables into one. For example, SID and gender in one table, and SID and degree in the other table. In general, collapsing tables in One-to-One relationship has fewer drawbacks than others. There are several advantages of this technique, some of the obvious ones being reduced storage space, reduced amount of time for data update, some of the other not so apparent advantages are reduced number of foreign keys on tables, reduced number of indexes since most indexes are created based on primary/foreign keys). Furthermore, combiningthe columns does not change the business view, but does decrease access time by having fewer physical objects and reducing overhead.