SlideShare a Scribd company logo
Seminar
On
Process Management In Data Warehouse
Table of Contents:
 What is Process management?
 Data Warehouse Process Architecture:
Data warehouse architecture involves the following components:
 Load management
 Warehouse management
 Query management.
 The 3 Perspectives for the Process Model.
 Conceptual
 Logical
 physical
Typical Data Warehouse Environment
What is Process management?
 Process managers are responsible for maintaining the flow of
data both into and out of the data warehouse.
Three different types of process managers:
o Load manager
o Warehouse manager
o Query manager
Source data
Data Sources:
 The data is extracted from the operational databases or the
external information providers.
 Internal, external, production, archived.
 Gateway are the application programs that are used to extract
data. It is supported by
underlying DBMS and allows the client program to generate
SQL to be executed at a server.
 Open Database Connection (ODBC) and Java Database
Connection (JDBC) are examples of
gateway.
Data Warehouse Process Management Architecture
Load Manager
Load Manager:
Includes all of the software and utilities
required to:
 Extract source system data and move it
to the warehouse Environment.
Complete basic Transformation to
ensure that nonessential data is eliminated
and other data is converted to appropriate
data types.
Fast load data into a staging area where
it can be subsequently manipulated.
EXTRACT
Some of the data elements in the operational database can be reasonably be expected to
be useful in the decision making, but others are of less value for that purpose.
 For this reason, it is necessary to extract the relevant data from the operational
database before bringing into the data warehouse. Many commercial tools are available
to help with the extraction process.
Data Junction is one of the commercial products.
EXTRACT(Cont..)
The user of one of these tools typically has an easy-to-use windowed
interface by which to specify the following:
oWhich files and tables are to be accessed in the source database?
oWhich fields are to be extracted from them? This is often done
internally by SQL Select statement.
oWhat are those to be called in the resulting database?
oWhat is the target machine and database format of the output?
oOn what schedule should the extraction process be repeated?
TRANSFORM
The operational databases developed can be based on any set of priorities, which
keeps changing with the requirements.
Deals with rectifying any inconsistency.
One of the most common transformation issues is ‘Attribute Naming Inconsistency’.
Once all the data elements have right names, they must be converted to common
formats.
TRANSFORM(Cont..)
The conversion may encompass the following:
Characters must be converted ASCII to EBCDIC or vise versa.
Mixed Text may be converted to all uppercase for consistency.
Numerical data must be converted in to a common format.
Data Format has to be standardized.
Measurement may have to convert. (Rs/ $)
Coded data (Male/ Female, M/F) must be converted into a
common format.
LOADING
Loading often implies physical movement of the data from the computer(s)
storing the source database(s) to that which will store the data warehouse
database, assuming it is different.
This takes place immediately after the extraction phase.
The most common channel for data movement is a high-speed
communication link.
Ex: Oracle Warehouse Builder is the API from Oracle, which provides the
features to perform the ETL task on Oracle Data Warehouse.
Warehouse Manager
Warehouse Manager
 The warehouse manager performs all the operations associated with
the management of data in the warehouse.
 Constructed using vendor data management tools and custom-built
programs.
Process management seminar
Detailed Data
 Stores all the detailed data in the database schema.
 In most cases, the detailed data is not stored online but aggregated to
the next level of detail.
 On a regular basis, detailed data is added to the warehouse to
supplement the aggregated data.
Lightly and Highly Summarized Data
 Stores all the pre-defined lightly and highly aggregated data generated by the warehouse
manager.
 Transient as it will be subject to change on an on-going basis in order to respond to
changing query profiles.
 The purpose of summary information is to speed up the performance of queries.
 Removes the requirement to continually perform summary operations (such as
sort or group by) in answering user queries.
 The summary data is updated continuously as new data is loaded into the
warehouse.
Archive / Backup Data
 Stores detailed and summarized data for the purposes of archiving
and backup.
 May be necessary to backup online summary data if this data is kept
beyond the retention period for detailed data.
 The data is transferred to storage archives such as magnetic tape or
optical disk.
Warehouse Manager Architecture
Functions of Warehouse Manager
 Analysis the data to perform consistency and referential integrity checks.
 Creates indexes, business views, partition views against the base data.
 Generates new aggregations and updates the existing aggregations.
 Generates normalizations.
 Transforms and merges the source data into the temporary store of the
published data warehouse.
Cont..
 Backs up the data in the data warehouse.
 Archives the data that has reached the end of its captured life.
Note:
A warehouse manager analyses query profiles to determine whether
the index and aggregations are appropriate.
Query Manager
 Responsible for directing the queries to the suitable
tables.
 Speed of querying and response generation can be
increased.
 Also responsible for scheduling the execution of the
queries posed by the user.
Query Manager
Query Manager Architecture
 Query redirection via C tool or RDBMS
 Stored procedures
 Query management tool
 Query scheduling via C tool or RDBMS
 Query scheduling via third-party software
Query Manager Architecture
 Performs all operations associated with management of user queries.
 Component is usually constructed using
 vendor end-user access tools,
 data warehousing monitoring tools,
 database facilities
 custom built programs.
 The complexity of a query manager is determined by facilities provided
by the end-user access tools and database.
Query Manager Functionality
Detailed Information
 Not kept online, rather it is aggregated to the next level
of detail and then archived to tape.
 Part of data warehouse keeps the detailed information in
the starflake schema.
 loaded into the data warehouse to supplement the
aggregated data.
Detailed Information
 In this area of data warehouse the predefined aggregations are kept.
 These aggregations are generated by warehouse manager.
 This area changes on ongoing basis in order to respond to the changing query profiles.
 Speed up the performance of common queries.
 Increases the operational cost.
 It needs to be updated whenever new data is loaded into the data warehouse.
 It may not have been backed up, since it can be generated fresh from the detailed information.
Summary Information
 It presents the data to the user in a form they understand.
 It schedules the execution of the queries posted by the
end-user.
 It stores query profiles to allow the warehouse manager to
determine which indexes and aggregations are
appropriate.
Functions of Query Manager
 Logical perspective: what steps it consists of
 Physical perspective: how they are to be performed
 Conceptual perspective: why these steps exist
3 Perspectives for the Process Model
 conceptual perspective which abstractly represents the basic
interrelationships between data warehouse stakeholders and processes
in a formal way
 A central logical perspective part of the model, which captures the basic
structure and data characteristics of a process.
 physical perspective counterpart which provides specific details over the
actual components that execute the process.
3 Perspectives for the Process Model
3 Perspectives for the Process Model
 Major purpose –
 to help stakeholders
 understand the reasoning behind decisions on the
architecture
 physical characteristics of data
 warehouse processes
Conceptual Perspective
 Each Type in the logical perspective is the counterpart of a Concept in
the conceptual perspective.
 Concept represents a class of real-world objects, in terms of a
conceptual metamodel
 the Entity-Relationship
 UML notation
 Both Types and Concepts are constructed from Fields , through the
attribute fields
 Consider Field to be a subtype both of LogicalObject and
ConceptualObject.
Concept
 Central conceptual entity
 Generalizes the conceptual counterparts of activities, stakeholders
and data stores
 The class Role is used to express the interdependencies of these
entities, through the attribute RelatesTo. Activity Role, Stakeholder
 Concept are specializations of Roles for processes, persons and
concepts in the conceptual perspective.
 Each Role represents a person, program or data store participating in
the environment of a process,
ROLE
LOGICAL PERSPECTIVE
 Captures the basic structure and data characteristics of a
process.
 In the logical perspective, the modeling is concerned with the
functionality of an activity, describing what this particular
activity is about in terms of consumption and production of
information.
Process management seminar
Physical Perspective
While the logical perspective covers the structure (what?) of a process, the
physical perspective covers the details of its execution (how?).
physical perspective counterpart which provides specific details over the
actual components that execute the process.
The information of the physical perspective can be used to trace and monitor
the execution of data warehouse processes
Summary
 Process managers are responsible for maintaining the flow of data.
 Load manager performs the operations required to extract and load
the data into the database.
 The warehouse manager is responsible for the warehouse
management process.
 The query manager is responsible for directing the queries to suitable
tables.
 3 Perspectives for the Process Model
 Logical perspective
 Physical perspective
 Conceptual perspective
Refrences
 Panos Vassiliadis, Christoph Quix, Yannis Vassiliou, Matthias Jarke, DATA WAREHOUSE
PROCESS MANAGEMENT National Technical University of Athens, Dept. of Electrical and
Computer Eng., Computer Science Division, Iroon Polytechniou 9, 157 73, Athens, Greece
{pvassil,yv}@dbnet.ece.ntua.gr
 www.tutorialspoint.com/dwh
Process management seminar

More Related Content

PPTX
Data warehouse physical design
Er. Nawaraj Bhandari
 
PPT
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Usman Tariq
 
PPTX
Transaction
Er. Nawaraj Bhandari
 
PPTX
Methodology logical database design for the relational
university of the punjab
 
PPTX
Lecture 02 - The Data Warehouse Environment
phanleson
 
PPTX
Database design (conceptual, logical and physical design) unit 2 part 2
Ram Paliwal
 
PDF
Elimination of data redundancy before persisting into dbms using svm classifi...
nalini manogaran
 
PPS
Etl Overview (Extract, Transform, And Load)
LizLavaveshkul
 
Data warehouse physical design
Er. Nawaraj Bhandari
 
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Usman Tariq
 
Methodology logical database design for the relational
university of the punjab
 
Lecture 02 - The Data Warehouse Environment
phanleson
 
Database design (conceptual, logical and physical design) unit 2 part 2
Ram Paliwal
 
Elimination of data redundancy before persisting into dbms using svm classifi...
nalini manogaran
 
Etl Overview (Extract, Transform, And Load)
LizLavaveshkul
 

What's hot (20)

PDF
Informatica push down optimization implementation
divjeev
 
PPT
D01 etl
Prince Jain
 
PPT
Database Systems
Usman Tariq
 
PPT
Lecture 03 - The Data Warehouse and Design
phanleson
 
PPTX
Physical database design(database)
welcometofacebook
 
PDF
Databse management system
Chittagong University
 
PDF
Physical Database Design & Performance
Abdullah Khosa
 
PPT
Week 2 Characteristics & Benefits of a Database & Types of Data Models
oudesign
 
PPT
Database
guest0a6e77
 
PDF
An Overview on Data Quality Issues at Data Staging ETL
idescitation
 
PDF
Cts informatica interview question answers
Sweta Singh
 
PPTX
ETL Process
Rohin Rangnekar
 
PDF
H1803014347
IOSR Journals
 
PDF
CBSE XII Database Concepts And MySQL Presentation
Guru Ji
 
PDF
Role of Data Cleaning in Data Warehouse
Ramakant Soni
 
DOC
To Study E T L ( Extract, Transform, Load) Tools Specially S Q L Server I...
Shahzad
 
DOC
Informatica and datawarehouse Material
obieefans
 
PPT
Ch 2 D B Dvlpt Process
guest8fdbdd
 
PPT
Sap business intelligence 4.0 report basic
tovetrivel
 
PPT
Informatica Power Center - Workflow Manager
ZaranTech LLC
 
Informatica push down optimization implementation
divjeev
 
D01 etl
Prince Jain
 
Database Systems
Usman Tariq
 
Lecture 03 - The Data Warehouse and Design
phanleson
 
Physical database design(database)
welcometofacebook
 
Databse management system
Chittagong University
 
Physical Database Design & Performance
Abdullah Khosa
 
Week 2 Characteristics & Benefits of a Database & Types of Data Models
oudesign
 
Database
guest0a6e77
 
An Overview on Data Quality Issues at Data Staging ETL
idescitation
 
Cts informatica interview question answers
Sweta Singh
 
ETL Process
Rohin Rangnekar
 
H1803014347
IOSR Journals
 
CBSE XII Database Concepts And MySQL Presentation
Guru Ji
 
Role of Data Cleaning in Data Warehouse
Ramakant Soni
 
To Study E T L ( Extract, Transform, Load) Tools Specially S Q L Server I...
Shahzad
 
Informatica and datawarehouse Material
obieefans
 
Ch 2 D B Dvlpt Process
guest8fdbdd
 
Sap business intelligence 4.0 report basic
tovetrivel
 
Informatica Power Center - Workflow Manager
ZaranTech LLC
 
Ad

Similar to Process management seminar (20)

PDF
Data warehousing interview_questionsandanswers
Sourav Singh
 
PPT
Datawarehousing
sumit621
 
PPT
Data Warehouse
nayakslideshare
 
PPTX
Data Warehouse for data analytics presentation
21132067
 
DOCX
UNIT-5 DATA WAREHOUSING.docx
DURGADEVIL
 
DOC
Data warehouse concepts
obieefans
 
PPTX
UNIT - 1 Part 2: Data Warehousing and Data Mining
Nandakumar P
 
PPTX
Module 1_Overview of Database Management System
prajwalr3501
 
PPTX
Lecture 1 to 3intro to normalization in database
maqsoodahmedbscsfkhp
 
PDF
Business Analytics System
Mahesh Patwardhan
 
PPTX
Notes of DBMS Introduction to Database Design
AthiraNair143542
 
PPS
Data Warehouse 101
PanaEk Warawit
 
PPTX
1-SDLC - Development Models – Waterfall, Rapid Application Development, Agile...
JOHNLEAK1
 
PPT
3._DWH_Architecture__Components.ppt
BsMath3rdsem
 
PDF
Query Evaluation Techniques for Large Databases.pdf
RayWill4
 
PPTX
Warehouse Planning and Implementation
SHIKHA GAUTAM
 
PPTX
Chapter 4 Chapter Relational DB - Copy.pptx
OmarOmar731335
 
PDF
An Integrated ERP With Web Portal
Tracy Morgan
 
PPTX
DATA--BASE--MANAGEMENT---Discussion.pptx
angellibres20
 
PDF
M.sc. engg (ict) admission guide database management system 4
Syed Ariful Islam Emon
 
Data warehousing interview_questionsandanswers
Sourav Singh
 
Datawarehousing
sumit621
 
Data Warehouse
nayakslideshare
 
Data Warehouse for data analytics presentation
21132067
 
UNIT-5 DATA WAREHOUSING.docx
DURGADEVIL
 
Data warehouse concepts
obieefans
 
UNIT - 1 Part 2: Data Warehousing and Data Mining
Nandakumar P
 
Module 1_Overview of Database Management System
prajwalr3501
 
Lecture 1 to 3intro to normalization in database
maqsoodahmedbscsfkhp
 
Business Analytics System
Mahesh Patwardhan
 
Notes of DBMS Introduction to Database Design
AthiraNair143542
 
Data Warehouse 101
PanaEk Warawit
 
1-SDLC - Development Models – Waterfall, Rapid Application Development, Agile...
JOHNLEAK1
 
3._DWH_Architecture__Components.ppt
BsMath3rdsem
 
Query Evaluation Techniques for Large Databases.pdf
RayWill4
 
Warehouse Planning and Implementation
SHIKHA GAUTAM
 
Chapter 4 Chapter Relational DB - Copy.pptx
OmarOmar731335
 
An Integrated ERP With Web Portal
Tracy Morgan
 
DATA--BASE--MANAGEMENT---Discussion.pptx
angellibres20
 
M.sc. engg (ict) admission guide database management system 4
Syed Ariful Islam Emon
 
Ad

Recently uploaded (20)

PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
DOCX
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
PDF
Cryptography and Information :Security Fundamentals
Dr. Madhuri Jawale
 
PDF
Principles of Food Science and Nutritions
Dr. Yogesh Kumar Kosariya
 
PDF
top-5-use-cases-for-splunk-security-analytics.pdf
yaghutialireza
 
PPTX
Civil Engineering Practices_BY Sh.JP Mishra 23.09.pptx
bineetmishra1990
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PDF
EVS+PRESENTATIONS EVS+PRESENTATIONS like
saiyedaqib429
 
PDF
Zero Carbon Building Performance standard
BassemOsman1
 
PPTX
Victory Precisions_Supplier Profile.pptx
victoryprecisions199
 
PPTX
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
PPTX
easa module 3 funtamental electronics.pptx
tryanothert7
 
PPTX
database slide on modern techniques for optimizing database queries.pptx
aky52024
 
PDF
The Effect of Artifact Removal from EEG Signals on the Detection of Epileptic...
Partho Prosad
 
PPT
Ppt for engineering students application on field effect
lakshmi.ec
 
PDF
LEAP-1B presedntation xxxxxxxxxxxxxxxxxxxxxxxxxxxxx
hatem173148
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PDF
Chad Ayach - A Versatile Aerospace Professional
Chad Ayach
 
PPTX
Chapter_Seven_Construction_Reliability_Elective_III_Msc CM
SubashKumarBhattarai
 
PDF
Advanced LangChain & RAG: Building a Financial AI Assistant with Real-Time Data
Soufiane Sejjari
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
Cryptography and Information :Security Fundamentals
Dr. Madhuri Jawale
 
Principles of Food Science and Nutritions
Dr. Yogesh Kumar Kosariya
 
top-5-use-cases-for-splunk-security-analytics.pdf
yaghutialireza
 
Civil Engineering Practices_BY Sh.JP Mishra 23.09.pptx
bineetmishra1990
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
EVS+PRESENTATIONS EVS+PRESENTATIONS like
saiyedaqib429
 
Zero Carbon Building Performance standard
BassemOsman1
 
Victory Precisions_Supplier Profile.pptx
victoryprecisions199
 
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
easa module 3 funtamental electronics.pptx
tryanothert7
 
database slide on modern techniques for optimizing database queries.pptx
aky52024
 
The Effect of Artifact Removal from EEG Signals on the Detection of Epileptic...
Partho Prosad
 
Ppt for engineering students application on field effect
lakshmi.ec
 
LEAP-1B presedntation xxxxxxxxxxxxxxxxxxxxxxxxxxxxx
hatem173148
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
Chad Ayach - A Versatile Aerospace Professional
Chad Ayach
 
Chapter_Seven_Construction_Reliability_Elective_III_Msc CM
SubashKumarBhattarai
 
Advanced LangChain & RAG: Building a Financial AI Assistant with Real-Time Data
Soufiane Sejjari
 

Process management seminar

  • 2. Table of Contents:  What is Process management?  Data Warehouse Process Architecture: Data warehouse architecture involves the following components:  Load management  Warehouse management  Query management.  The 3 Perspectives for the Process Model.  Conceptual  Logical  physical
  • 4. What is Process management?  Process managers are responsible for maintaining the flow of data both into and out of the data warehouse. Three different types of process managers: o Load manager o Warehouse manager o Query manager Source data
  • 5. Data Sources:  The data is extracted from the operational databases or the external information providers.  Internal, external, production, archived.  Gateway are the application programs that are used to extract data. It is supported by underlying DBMS and allows the client program to generate SQL to be executed at a server.  Open Database Connection (ODBC) and Java Database Connection (JDBC) are examples of gateway.
  • 6. Data Warehouse Process Management Architecture
  • 8. Load Manager: Includes all of the software and utilities required to:  Extract source system data and move it to the warehouse Environment. Complete basic Transformation to ensure that nonessential data is eliminated and other data is converted to appropriate data types. Fast load data into a staging area where it can be subsequently manipulated.
  • 9. EXTRACT Some of the data elements in the operational database can be reasonably be expected to be useful in the decision making, but others are of less value for that purpose.  For this reason, it is necessary to extract the relevant data from the operational database before bringing into the data warehouse. Many commercial tools are available to help with the extraction process. Data Junction is one of the commercial products.
  • 10. EXTRACT(Cont..) The user of one of these tools typically has an easy-to-use windowed interface by which to specify the following: oWhich files and tables are to be accessed in the source database? oWhich fields are to be extracted from them? This is often done internally by SQL Select statement. oWhat are those to be called in the resulting database? oWhat is the target machine and database format of the output? oOn what schedule should the extraction process be repeated?
  • 11. TRANSFORM The operational databases developed can be based on any set of priorities, which keeps changing with the requirements. Deals with rectifying any inconsistency. One of the most common transformation issues is ‘Attribute Naming Inconsistency’. Once all the data elements have right names, they must be converted to common formats.
  • 12. TRANSFORM(Cont..) The conversion may encompass the following: Characters must be converted ASCII to EBCDIC or vise versa. Mixed Text may be converted to all uppercase for consistency. Numerical data must be converted in to a common format. Data Format has to be standardized. Measurement may have to convert. (Rs/ $) Coded data (Male/ Female, M/F) must be converted into a common format.
  • 13. LOADING Loading often implies physical movement of the data from the computer(s) storing the source database(s) to that which will store the data warehouse database, assuming it is different. This takes place immediately after the extraction phase. The most common channel for data movement is a high-speed communication link. Ex: Oracle Warehouse Builder is the API from Oracle, which provides the features to perform the ETL task on Oracle Data Warehouse.
  • 15. Warehouse Manager  The warehouse manager performs all the operations associated with the management of data in the warehouse.  Constructed using vendor data management tools and custom-built programs.
  • 17. Detailed Data  Stores all the detailed data in the database schema.  In most cases, the detailed data is not stored online but aggregated to the next level of detail.  On a regular basis, detailed data is added to the warehouse to supplement the aggregated data.
  • 18. Lightly and Highly Summarized Data  Stores all the pre-defined lightly and highly aggregated data generated by the warehouse manager.  Transient as it will be subject to change on an on-going basis in order to respond to changing query profiles.  The purpose of summary information is to speed up the performance of queries.  Removes the requirement to continually perform summary operations (such as sort or group by) in answering user queries.  The summary data is updated continuously as new data is loaded into the warehouse.
  • 19. Archive / Backup Data  Stores detailed and summarized data for the purposes of archiving and backup.  May be necessary to backup online summary data if this data is kept beyond the retention period for detailed data.  The data is transferred to storage archives such as magnetic tape or optical disk.
  • 21. Functions of Warehouse Manager  Analysis the data to perform consistency and referential integrity checks.  Creates indexes, business views, partition views against the base data.  Generates new aggregations and updates the existing aggregations.  Generates normalizations.  Transforms and merges the source data into the temporary store of the published data warehouse.
  • 22. Cont..  Backs up the data in the data warehouse.  Archives the data that has reached the end of its captured life. Note: A warehouse manager analyses query profiles to determine whether the index and aggregations are appropriate.
  • 24.  Responsible for directing the queries to the suitable tables.  Speed of querying and response generation can be increased.  Also responsible for scheduling the execution of the queries posed by the user. Query Manager
  • 26.  Query redirection via C tool or RDBMS  Stored procedures  Query management tool  Query scheduling via C tool or RDBMS  Query scheduling via third-party software Query Manager Architecture
  • 27.  Performs all operations associated with management of user queries.  Component is usually constructed using  vendor end-user access tools,  data warehousing monitoring tools,  database facilities  custom built programs.  The complexity of a query manager is determined by facilities provided by the end-user access tools and database. Query Manager Functionality
  • 28. Detailed Information  Not kept online, rather it is aggregated to the next level of detail and then archived to tape.  Part of data warehouse keeps the detailed information in the starflake schema.  loaded into the data warehouse to supplement the aggregated data.
  • 30.  In this area of data warehouse the predefined aggregations are kept.  These aggregations are generated by warehouse manager.  This area changes on ongoing basis in order to respond to the changing query profiles.  Speed up the performance of common queries.  Increases the operational cost.  It needs to be updated whenever new data is loaded into the data warehouse.  It may not have been backed up, since it can be generated fresh from the detailed information. Summary Information
  • 31.  It presents the data to the user in a form they understand.  It schedules the execution of the queries posted by the end-user.  It stores query profiles to allow the warehouse manager to determine which indexes and aggregations are appropriate. Functions of Query Manager
  • 32.  Logical perspective: what steps it consists of  Physical perspective: how they are to be performed  Conceptual perspective: why these steps exist 3 Perspectives for the Process Model
  • 33.  conceptual perspective which abstractly represents the basic interrelationships between data warehouse stakeholders and processes in a formal way  A central logical perspective part of the model, which captures the basic structure and data characteristics of a process.  physical perspective counterpart which provides specific details over the actual components that execute the process. 3 Perspectives for the Process Model
  • 34. 3 Perspectives for the Process Model
  • 35.  Major purpose –  to help stakeholders  understand the reasoning behind decisions on the architecture  physical characteristics of data  warehouse processes Conceptual Perspective
  • 36.  Each Type in the logical perspective is the counterpart of a Concept in the conceptual perspective.  Concept represents a class of real-world objects, in terms of a conceptual metamodel  the Entity-Relationship  UML notation  Both Types and Concepts are constructed from Fields , through the attribute fields  Consider Field to be a subtype both of LogicalObject and ConceptualObject. Concept
  • 37.  Central conceptual entity  Generalizes the conceptual counterparts of activities, stakeholders and data stores  The class Role is used to express the interdependencies of these entities, through the attribute RelatesTo. Activity Role, Stakeholder  Concept are specializations of Roles for processes, persons and concepts in the conceptual perspective.  Each Role represents a person, program or data store participating in the environment of a process, ROLE
  • 38. LOGICAL PERSPECTIVE  Captures the basic structure and data characteristics of a process.  In the logical perspective, the modeling is concerned with the functionality of an activity, describing what this particular activity is about in terms of consumption and production of information.
  • 40. Physical Perspective While the logical perspective covers the structure (what?) of a process, the physical perspective covers the details of its execution (how?). physical perspective counterpart which provides specific details over the actual components that execute the process. The information of the physical perspective can be used to trace and monitor the execution of data warehouse processes
  • 41. Summary  Process managers are responsible for maintaining the flow of data.  Load manager performs the operations required to extract and load the data into the database.  The warehouse manager is responsible for the warehouse management process.  The query manager is responsible for directing the queries to suitable tables.  3 Perspectives for the Process Model  Logical perspective  Physical perspective  Conceptual perspective
  • 42. Refrences  Panos Vassiliadis, Christoph Quix, Yannis Vassiliou, Matthias Jarke, DATA WAREHOUSE PROCESS MANAGEMENT National Technical University of Athens, Dept. of Electrical and Computer Eng., Computer Science Division, Iroon Polytechniou 9, 157 73, Athens, Greece {pvassil,yv}@dbnet.ece.ntua.gr  www.tutorialspoint.com/dwh