2
Most read
3
Most read
18
Most read
Data Cube:
A Multidimensional
Data Model
Example
• Although we usually think of cubes as 3-D geometric structures, in data
warehousing the data cube is n-dimensional.
• To gain a better understanding of data cubes and the multidimensional
data model, let’s start by looking at a simple 2-D data cube that is, in fact, a
table or spreadsheet for sales data from AllElectronics.
• In particular, we will look at the AllElectronics sales data for items sold per
quarter in the city of Vancouver.
• These data are shown in Table 4.2. In this 2-D representation, the sales for
Vancouver are shown with respect to the time dimension (organized in
quarters) and the item dimension (organized according to the types of
items sold).
• The fact or measure displayed is dollars sold (in thousands).
2-D
Example
• Now, suppose that we would like to view the sales data with a third
dimension.
• For instance, suppose we would like to view the data according to
time and item, as well as location, for the cities Chicago, New York,
Toronto, and Vancouver.
• These 3-D data are shown in Table 4.3. The 3-D data in the table are
represented as a series of 2-D tables.
• Conceptually, we may also represent the same data in the form of a 3-
D data cube, as in Figure 4.3.
3-D
In this way, we may display any n-dimensional data as a series of (n-1)-dimensional “cubes.” The data
cube is a metaphor for multidimensional data storage. The actual physical storage of such data may
differ from its logical representation. The important thing to remember is that data cubes are n-
dimensional and do not confine data to 3-D.
3-D data cube representation
4.4 is often referred to as a cuboid
Lattice
Stars, Snowflakes, and Fact Constellations:
Schemas for Multidimensional Data Models
The most popular data model for a data warehouse is a multidimensional
model, which can exist in the form of a star schema, a snowflake schema, or
a fact constellation schema. Let’s look at each of these.
• Star schema: The most common modeling paradigm is the star schema, in
which the data warehouse contains (1) a large central table (fact table)
containing the bulk of the data, with no redundancy, and (2) a set of
smaller attendant tables (dimension tables), one for each dimension. The
schema graph resembles a starburst, with the dimension tables displayed
in a radial pattern around the central fact table.
Star schema of sales data warehouse
Snowflake schema: The snowflake schema is a variant of the star
schema model, where some dimension tables are normalized, thereby
further splitting the data into additional tables. The resulting schema
graph forms a shape similar to a snowflake.
Snowflake schema of a sales data warehouse
• Fact constellation: Sophisticated applications may require multiple
fact tables to share dimension tables. This kind of schema can be
viewed as a collection of stars, and hence is called a galaxy schema or
a fact constellation.
Fact constellation schema of a sales and shipping
data warehouse
Dimensions: The Role of Concept Hierarchies
• A concept hierarchy defines a sequence of mappings from a set of
low-level concepts to higher-level, more general concepts.
• Consider a concept hierarchy for the dimension location. City values
for location include Vancouver, Toronto, New York, and Chicago.
• Each city, however, can be mapped to the province or state to which it
belongs.
• For example, Vancouver can be mapped to British Columbia, and
Chicago to Illinois.
Example
Hierarchical and lattice structures
A concept hierarchy for price.
Examples of typical OLAP operations on
multidimensional data
A Starnet Query Model for Querying
Multidimensional Databases
Source

More Related Content

PPTX
DevOps introduction
PPTX
Histogram
PPTX
DevOps seminar ppt
PPT
3.2 partitioning methods
PPTX
PPTX
Feature selection concepts and methods
PDF
IBM Watson Overview
PPTX
Artificial intelligence in industry
DevOps introduction
Histogram
DevOps seminar ppt
3.2 partitioning methods
Feature selection concepts and methods
IBM Watson Overview
Artificial intelligence in industry

What's hot (20)

PPTX
Data warehouse
PPT
Data Warehouse Modeling
PPTX
Big Data Analytics with Hadoop
PPT
Mining Frequent Patterns, Association and Correlations
PPTX
Introduction to Data Warehousing
PPTX
Data Integration and Transformation in Data mining
PDF
The Data Science Process
PPTX
Data warehouse architecture
DOCX
PPT
Star schema PPT
PPT
Data Warehouse Basic Guide
PPTX
Data Modeling PPT
PPTX
Data mining concepts and work
PPTX
Data cubes
PDF
Data mining (lecture 1 & 2) conecpts and techniques
PPTX
Mobile dbms
PPTX
Data Warehouse
PPT
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
PPTX
WEB BASED INFORMATION RETRIEVAL SYSTEM
PPTX
Information retrieval 13 alternative set theoretic models
Data warehouse
Data Warehouse Modeling
Big Data Analytics with Hadoop
Mining Frequent Patterns, Association and Correlations
Introduction to Data Warehousing
Data Integration and Transformation in Data mining
The Data Science Process
Data warehouse architecture
Star schema PPT
Data Warehouse Basic Guide
Data Modeling PPT
Data mining concepts and work
Data cubes
Data mining (lecture 1 & 2) conecpts and techniques
Mobile dbms
Data Warehouse
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
WEB BASED INFORMATION RETRIEVAL SYSTEM
Information retrieval 13 alternative set theoretic models
Ad

Similar to Data cube (20)

PPTX
Data minng and warehousing lecture notes 1PowerPoint.pptx
PDF
Data mining 3 - Data Models and Data Warehouse Design (cheat sheet - printable)
PPTX
Data Warehouse_Architecture.pptx
PPTX
Data Warehousing for students educationpptx
PPT
My2dw
DOCX
Star ,Snow and Fact-Constullation Schemas??
PPTX
multi dimensional data model
PDF
dataminingpres-150821063129-lva1-app6891 (3).pdf
PPTX
CSC612 THIRD LECTURE ON DATA WAREHOUSE.pptx
PPT
Data Warehousing and Data Mining
PDF
Olap fundamentals
PPTX
19CS3052R-CO1-7-S7 ECE
PPTX
CHAPTER 2 - Datawarehouse Architecture.pptx
PPTX
Unit 2- Data Warehouse Logical Design.pptx
PDF
Cs1011 dw-dm-1
PDF
Data Warehouse Implementation
PPTX
From Tables and Spreadsheets to Data Cubes.pptx
PPTX
Lecture 3:Introduction to Dimensional Modelling.pptx
DOCX
Dimensional data model
PPTX
1-Data Warehousing-Multi Dim Data Model.pptx
Data minng and warehousing lecture notes 1PowerPoint.pptx
Data mining 3 - Data Models and Data Warehouse Design (cheat sheet - printable)
Data Warehouse_Architecture.pptx
Data Warehousing for students educationpptx
My2dw
Star ,Snow and Fact-Constullation Schemas??
multi dimensional data model
dataminingpres-150821063129-lva1-app6891 (3).pdf
CSC612 THIRD LECTURE ON DATA WAREHOUSE.pptx
Data Warehousing and Data Mining
Olap fundamentals
19CS3052R-CO1-7-S7 ECE
CHAPTER 2 - Datawarehouse Architecture.pptx
Unit 2- Data Warehouse Logical Design.pptx
Cs1011 dw-dm-1
Data Warehouse Implementation
From Tables and Spreadsheets to Data Cubes.pptx
Lecture 3:Introduction to Dimensional Modelling.pptx
Dimensional data model
1-Data Warehousing-Multi Dim Data Model.pptx
Ad

More from Hitesh Mohapatra (20)

PDF
Introduction to Edge and Fog Computing.pdf
PDF
Amazon Web Services (AWS) : Fundamentals
PDF
Resource Cluster and Multi-Device Broker.pdf
PDF
Failover System in Cloud Computing System
PDF
Resource Replication & Automated Scaling Listener
PDF
Storage Device & Usage Monitor in Cloud Computing.pdf
PDF
Networking in Cloud Computing Environment
PDF
Uniform-Cost Search Algorithm in the AI Environment
PDF
Logical Network Perimeter in Cloud Computing
PPT
Software Product Quality - Part 1 Presentation
PDF
Multitenancy in cloud computing architecture
PDF
Server Consolidation in Cloud Computing Environment
PDF
Web Services / Technology in Cloud Computing
PDF
Resource replication in cloud computing.
PDF
Software Measurement and Metrics (Quantified Attribute)
PDF
Software project management is an art and discipline of planning and supervis...
PDF
Software project management is an art and discipline of planning and supervis...
PDF
The life cycle of a virtual machine (VM) provisioning process
PDF
BUSINESS CONSIDERATIONS FOR CLOUD COMPUTING
PDF
Traditional Data Center vs. Virtualization – Differences and Benefits
Introduction to Edge and Fog Computing.pdf
Amazon Web Services (AWS) : Fundamentals
Resource Cluster and Multi-Device Broker.pdf
Failover System in Cloud Computing System
Resource Replication & Automated Scaling Listener
Storage Device & Usage Monitor in Cloud Computing.pdf
Networking in Cloud Computing Environment
Uniform-Cost Search Algorithm in the AI Environment
Logical Network Perimeter in Cloud Computing
Software Product Quality - Part 1 Presentation
Multitenancy in cloud computing architecture
Server Consolidation in Cloud Computing Environment
Web Services / Technology in Cloud Computing
Resource replication in cloud computing.
Software Measurement and Metrics (Quantified Attribute)
Software project management is an art and discipline of planning and supervis...
Software project management is an art and discipline of planning and supervis...
The life cycle of a virtual machine (VM) provisioning process
BUSINESS CONSIDERATIONS FOR CLOUD COMPUTING
Traditional Data Center vs. Virtualization – Differences and Benefits

Recently uploaded (20)

PPTX
CS6006 - CLOUD COMPUTING - Module - 1.pptx
PDF
Lesson 3 .pdf
PDF
electrical machines course file-anna university
PPTX
Cisco Network Behaviour dibuywvdsvdtdstydsdsa
PPTX
Environmental studies, Moudle 3-Environmental Pollution.pptx
PPTX
Micro1New.ppt.pptx the mai themes of micfrobiology
PDF
August -2025_Top10 Read_Articles_ijait.pdf
PDF
[jvmmeetup] next-gen integration with apache camel and quarkus.pdf
PDF
MACCAFERRY GUIA GAVIONES TERRAPLENES EN ESPAÑOL
PDF
Unit I -OPERATING SYSTEMS_SRM_KATTANKULATHUR.pptx.pdf
PPTX
Design ,Art Across Digital Realities and eXtended Reality
PDF
Research on ultrasonic sensor for TTU.pdf
DOCX
ENVIRONMENTAL PROTECTION AND MANAGEMENT (18CVL756)
PPT
UNIT-I Machine Learning Essentials for 2nd years
PDF
Micro 4 New.ppt.pdf a servay of cells and microorganism
PDF
SEH5E Unveiled: Enhancements and Key Takeaways for Certification Success
PDF
AIGA 012_04 Cleaning of equipment for oxygen service_reformat Jan 12.pdf
PDF
UEFA_Carbon_Footprint_Calculator_Methology_2.0.pdf
PPTX
CNS - Unit 1 (Introduction To Computer Networks) - PPT (2).pptx
PPTX
Module1.pptxrjkeieuekwkwoowkemehehehrjrjrj
CS6006 - CLOUD COMPUTING - Module - 1.pptx
Lesson 3 .pdf
electrical machines course file-anna university
Cisco Network Behaviour dibuywvdsvdtdstydsdsa
Environmental studies, Moudle 3-Environmental Pollution.pptx
Micro1New.ppt.pptx the mai themes of micfrobiology
August -2025_Top10 Read_Articles_ijait.pdf
[jvmmeetup] next-gen integration with apache camel and quarkus.pdf
MACCAFERRY GUIA GAVIONES TERRAPLENES EN ESPAÑOL
Unit I -OPERATING SYSTEMS_SRM_KATTANKULATHUR.pptx.pdf
Design ,Art Across Digital Realities and eXtended Reality
Research on ultrasonic sensor for TTU.pdf
ENVIRONMENTAL PROTECTION AND MANAGEMENT (18CVL756)
UNIT-I Machine Learning Essentials for 2nd years
Micro 4 New.ppt.pdf a servay of cells and microorganism
SEH5E Unveiled: Enhancements and Key Takeaways for Certification Success
AIGA 012_04 Cleaning of equipment for oxygen service_reformat Jan 12.pdf
UEFA_Carbon_Footprint_Calculator_Methology_2.0.pdf
CNS - Unit 1 (Introduction To Computer Networks) - PPT (2).pptx
Module1.pptxrjkeieuekwkwoowkemehehehrjrjrj

Data cube

  • 2. Example • Although we usually think of cubes as 3-D geometric structures, in data warehousing the data cube is n-dimensional. • To gain a better understanding of data cubes and the multidimensional data model, let’s start by looking at a simple 2-D data cube that is, in fact, a table or spreadsheet for sales data from AllElectronics. • In particular, we will look at the AllElectronics sales data for items sold per quarter in the city of Vancouver. • These data are shown in Table 4.2. In this 2-D representation, the sales for Vancouver are shown with respect to the time dimension (organized in quarters) and the item dimension (organized according to the types of items sold). • The fact or measure displayed is dollars sold (in thousands).
  • 3. 2-D
  • 4. Example • Now, suppose that we would like to view the sales data with a third dimension. • For instance, suppose we would like to view the data according to time and item, as well as location, for the cities Chicago, New York, Toronto, and Vancouver. • These 3-D data are shown in Table 4.3. The 3-D data in the table are represented as a series of 2-D tables. • Conceptually, we may also represent the same data in the form of a 3- D data cube, as in Figure 4.3.
  • 5. 3-D In this way, we may display any n-dimensional data as a series of (n-1)-dimensional “cubes.” The data cube is a metaphor for multidimensional data storage. The actual physical storage of such data may differ from its logical representation. The important thing to remember is that data cubes are n- dimensional and do not confine data to 3-D.
  • 6. 3-D data cube representation
  • 7. 4.4 is often referred to as a cuboid
  • 9. Stars, Snowflakes, and Fact Constellations: Schemas for Multidimensional Data Models The most popular data model for a data warehouse is a multidimensional model, which can exist in the form of a star schema, a snowflake schema, or a fact constellation schema. Let’s look at each of these. • Star schema: The most common modeling paradigm is the star schema, in which the data warehouse contains (1) a large central table (fact table) containing the bulk of the data, with no redundancy, and (2) a set of smaller attendant tables (dimension tables), one for each dimension. The schema graph resembles a starburst, with the dimension tables displayed in a radial pattern around the central fact table.
  • 10. Star schema of sales data warehouse
  • 11. Snowflake schema: The snowflake schema is a variant of the star schema model, where some dimension tables are normalized, thereby further splitting the data into additional tables. The resulting schema graph forms a shape similar to a snowflake.
  • 12. Snowflake schema of a sales data warehouse
  • 13. • Fact constellation: Sophisticated applications may require multiple fact tables to share dimension tables. This kind of schema can be viewed as a collection of stars, and hence is called a galaxy schema or a fact constellation.
  • 14. Fact constellation schema of a sales and shipping data warehouse
  • 15. Dimensions: The Role of Concept Hierarchies • A concept hierarchy defines a sequence of mappings from a set of low-level concepts to higher-level, more general concepts. • Consider a concept hierarchy for the dimension location. City values for location include Vancouver, Toronto, New York, and Chicago. • Each city, however, can be mapped to the province or state to which it belongs. • For example, Vancouver can be mapped to British Columbia, and Chicago to Illinois.
  • 18. A concept hierarchy for price.
  • 19. Examples of typical OLAP operations on multidimensional data
  • 20. A Starnet Query Model for Querying Multidimensional Databases