SlideShare a Scribd company logo
ETL and
its impact on
Business
Intelligence
Presentedby- Isha Pande(23)
Index
Introduction to Business Intelligence
Extract-Transform-Load Process
Business Intelligence Architecture
Five-layered BI Architecture
Summary
Bibilography
Introduction to
Business
Intelligence
Business Intelligence
The term Business Intelligence (BI) refers to technologies, applications and
practices for the collection, integration, analysis, and presentation of
business information. The purpose of Business Intelligence is to support
better business decision making.
RAW DATA BI MEANINGFULDATA
Business Intelligence
Business Intelligence means better decision-making:
Decision making is the important factor for the success of the
organization.
BI helps in the accurate and timely decision making by providing all the
business information.
Many large organization making faster, better and accurate decision
making with the help of Business Intelligence.
Extract-Transform-
Load Process
ETL (Extract-Transform-Load) Process
> Extract, Transform and Load (ETL) process is important component of BI.
Summary
EXTRACT TRANSFORM LOAD ANALYZE
ETL (Extract-Transform-Load) Process
> Extract describes the gathering of data from various sources.
> Transformation is modification to match a desired state.
> Load is its import into a database or data warehouse.
> Data Warehouse is predominantly used to store detailed summary data and
metadata(data about data).
> On the other hand, metadata include information on data themselves. They
facilitate a process of extracting, transforming and loading data through
presenting sources of data in the layout of data warehouses. Metadata are
also used to automate summary data creation and queries management.
> ETL processes take up to 80% of the effort in BI projects. A high
performance is thereby vital to be able to process large amounts of data and
to have a up-to-date database.
Business Intelligence
Architecture
Business Intelligence Architecture
Existing BI architectures typically feature a unidirectional communication
flow between different components:
• The architectures only features a one-way data flow.
• The limitation of unidirectional data flow is that no adjustment or correction
is allowed on data source even if an error is found.
• If organizations want to correct the error, they have to repeat the entire BI
process especially that of the cleansing procedures again. To overcome
these problems, a two-way data integration flow [14] is suggested whereby
the cleansed data can be sent back to data sources to improve accuracy
and reduce cleansing work.
Business Intelligence Architecture
Existing BI architectures lack support on metadata management:
• A good BI architecture should include the layer of metadata.
• A metadata repository is essential for business users to store and
standardize metadata across different systems.
• By having a well-structured metadata, organizations will be able to track
and monitor data flows within their BI environment.
• In addition, they will be able to ensure the consistency of definitions and
descriptions of data that support BI components and thus avoid
misunderstanding and misinterpretation of data
Business Intelligence Architecture
Some of the architectures do not include operational data store (ODS)
within the BI environment:
• It is essential to implement ODS to provide current or near current
integrated information that can be accessed or updated directly by users.
• Through this way, decision makers will be able to react faster to changing
business environment and requirements.
• It is necessary to consider data staging area in the ETL (Extract Transform-
Load) process. As most of the data from data source require cleansing and
transformation, it is important to create a temporary storage for data to
reside prior to loading into ODS or data warehouse.
Five-layered BI
Architecture
Five-layered BI
Architecture
Five-layered BI Architecture
Data Source Layer:
• The purpose of this layer is to identify the data source. It is very important
for an or-ganization to know where all required data are from.
• This layer can help not only to increase the reliability of data but also to
improve data organizing.
• There are two types of data sources used by BI applications:
(a) Internal data source (b) External data source
Five-layered BI Architecture
Data Source Layer:
(a) Internal data source:
The data which is obtained and maintained by the system inside the
organization such as Customer Relationship Management system (CRM) and
Enterprise Resource Planning system (ERP). Internal data source also
includes all business operation related data such as sales data, customers
data etc.
(b) External data source:
All data which are organized outside the organization are called external
source for example, data from partners, data from suppliers, information
from governmentsand so on.
Five-layered BI Architecture
ETL Layer:
• This layer focuses on three main processes:
(a) Extraction
(b) Transformation
(c) Loading
Five-layered BI Architecture
ETL Layer:
(a) Extraction:
Extraction is used to read data from different sources both internally or
externally.
(b) Transformation:
Transformation is the process that converts the extracted data from its
original form into the form which is needed.
(c) Loading:
Loading process is defined to put data into data warehouse.
• The purpose of ETL layer is to guarantee that all extracted data are
cleansed and are converted into right form and the data loads into target
warehouse.
Five-layered BI Architecture
Data Warehouse Layer:
• This layer provides the storage for data which are in desired and reliable
formats for further use whenever and wherever needed.
• There are three components in data warehouse layer:
(a) ODS (b) Data Warehouse (c) Data Marts
Five-layered BI Architecture
Data Warehouse Layer:
(a) ODS: Operational data store (ODS) is used to integrate data and load
them into data ware-house. ODS operatesas a short-term memory since the
data in ODS is updated frequently.
(b) Data Warehouse: Data warehouse is a central storage of structured
data. And all data in data warehouse are ready for analytical use. Unlike
ODS, data warehouse is a long-term memory as the data are permanent.
Both current and history data are stored in data warehouses. It is a subject-
oriented, integrated, time-variant and non-volatile collection of data to
support the managements’ decision making process.
(c) Data Marts: Data mart can be considered as a sub-set of data
warehouse. Usually, data mart consists of data from one single subject area
and it is created directly from the data warehouse.
Five-layered BI Architecture
End User Layer:
• The end user layer consists of tools that display information in different
formats to different users. These tools can be grouped hierarchically in a
pyramid shape. As one moves from the bottom to the top of the pyramid,
the degree of comprehensiveness at which data are being processed
increases.
Five-layered BI Architecture
Metadata Layer:
• Metadata refers to data about data.
• It describes where data are being used and stored,
the source of data, what changes have been made
to the data, and how one piece of data relates to
other information.
• In this architecture, Metadata management is
applied to all other four layers.
• The purpose of this layer is that by using the well-
structured metadata, organizations is able to track
and monitor the data flow in a BI solution.
• Furthermore, it can help to avoid misun-
derstanding of data.
Summary
• BI supports decision making.
• Extract, Transform and Load (ETL) process is important component of BI.
• The Five-layered BI architecture draws a big picture showing how BI
solution works.
• Data are gathered from both internal and external sources. Then these
sources will be extracted, transformed and loaded into data warehouse
for further analytical use.
• End users use different tools to analyze the data to meet the business
requirements or needs.
• All the data are managed by the metadata layer in the architecture.
Bibilography
1) Nitin Anand. ETL and its impact on Business Intelligence.
New Delhi, February 2014
2) In Lih Ong, Pei Hwa Siew and Siew Fan Wong. A Five-Layered Business
Intelligence Architecture. Selangor, Malaysia, 2011
3) Mengwei Lu. DiscoveringMicrosoft Self-serviceBI solution: Power BI.
Helsinki, Finland, May 2014
Thank You!

More Related Content

PPTX
Business Intelligence - Conceptual Introduction
Ahmed Rami Elsherif, PMP, ITBMC
 
PPT
Date warehousing concepts
pcherukumalla
 
PPS
Data Warehouse 101
PanaEk Warawit
 
PPTX
Presentation on Business Intelligence (BI)
AkashBorse2
 
PPTX
Business intelligence- Components, Tools, Need and Applications
raj
 
PDF
DataOps - The Foundation for Your Agile Data Architecture
DATAVERSITY
 
PDF
Data Catalog as the Platform for Data Intelligence
Alation
 
Business Intelligence - Conceptual Introduction
Ahmed Rami Elsherif, PMP, ITBMC
 
Date warehousing concepts
pcherukumalla
 
Data Warehouse 101
PanaEk Warawit
 
Presentation on Business Intelligence (BI)
AkashBorse2
 
Business intelligence- Components, Tools, Need and Applications
raj
 
DataOps - The Foundation for Your Agile Data Architecture
DATAVERSITY
 
Data Catalog as the Platform for Data Intelligence
Alation
 

What's hot (20)

PDF
Modern Data architecture Design
Kujambu Murugesan
 
PDF
Building a Winning Roadmap for Analytics
Ironside
 
PPTX
Business intelligence
Randy L. Archambault
 
PPT
Introduction to Business Intelligence
Ronan Soares
 
PPTX
Data Warehouse Fundamentals
Rashmi Bhat
 
PPTX
Presentation on Big Data
Md. Salman Ahmed
 
PDF
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
DATAVERSITY
 
PDF
BIG DATA.pdf
naveenlingala2
 
PPTX
Data Architecture Brief Overview
Hal Kalechofsky
 
PPTX
Big data architecture
Dr. Jasmine Beulah Gnanadurai
 
PPTX
What exactly is Business Intelligence?
James Serra
 
PPTX
Etl - Extract Transform Load
ABDUL KHALIQ
 
PDF
Data Virtualization: An Essential Component of a Cloud Data Lake
Denodo
 
PDF
Master Data Management's Place in the Data Governance Landscape
CCG
 
PPTX
Introduction to Big Data
Srinath Perera
 
PPTX
Introduction to Power BI and Data Visualization
Swapnil Jadhav
 
PPTX
What is big data?
David Wellman
 
PPT
Business intelligence
Mustafa Ali Hassan, MBA
 
PDF
Business Analytics Overview
Dr Susan Entwisle
 
PPTX
Business intelligence and big data
Shäîl Rûlès
 
Modern Data architecture Design
Kujambu Murugesan
 
Building a Winning Roadmap for Analytics
Ironside
 
Business intelligence
Randy L. Archambault
 
Introduction to Business Intelligence
Ronan Soares
 
Data Warehouse Fundamentals
Rashmi Bhat
 
Presentation on Big Data
Md. Salman Ahmed
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
DATAVERSITY
 
BIG DATA.pdf
naveenlingala2
 
Data Architecture Brief Overview
Hal Kalechofsky
 
Big data architecture
Dr. Jasmine Beulah Gnanadurai
 
What exactly is Business Intelligence?
James Serra
 
Etl - Extract Transform Load
ABDUL KHALIQ
 
Data Virtualization: An Essential Component of a Cloud Data Lake
Denodo
 
Master Data Management's Place in the Data Governance Landscape
CCG
 
Introduction to Big Data
Srinath Perera
 
Introduction to Power BI and Data Visualization
Swapnil Jadhav
 
What is big data?
David Wellman
 
Business intelligence
Mustafa Ali Hassan, MBA
 
Business Analytics Overview
Dr Susan Entwisle
 
Business intelligence and big data
Shäîl Rûlès
 
Ad

Similar to ETL and its impact on Business Intelligence (20)

PPTX
Business Intelligence for Data SCience Students
smartguykrish11
 
PDF
Master data management and data warehousing
Zahra Mansoori
 
PPSX
Navigating the BI Stack _
Michael Phipps
 
PPT
The Concepts Of Business Intelligence By Power BI
HendraLesmana74
 
PPT
Business Intelligence Details and Descriptions
syedas1mal1
 
PPTX
An Introduction To BI
MoniqueO Opris
 
PDF
Performance management capability
designer DATA
 
PPT
Bi concepts
anushaccc
 
PDF
Information & Data Architecture
Sammer Qader
 
DOCX
4Emerging Trends in Business IntelligenceITS 531.docx
blondellchancy
 
PPTX
Bi Architecture And Conceptual Framework
Slava Kokaev
 
PPTX
SAP BW Introduction.
Deloitte India (Offices of the US)
 
PPTX
Business Intelligence Module 2
Home
 
PPTX
Smart comp in 21st centry
Muhammad Shoaib
 
PDF
Business Intelligence
Amulya Lohani
 
PPT
Dw & etl concepts
jeshocarme
 
PPT
Datawarehousing & DSS
Deepali Raut
 
PPTX
Manish tripathi-ea-dw-bi
A P
 
PPTX
Presentasi 1 - Business Intelligence
DEDE IRYAWAN
 
PPS
Technologies
guest6cdabe
 
Business Intelligence for Data SCience Students
smartguykrish11
 
Master data management and data warehousing
Zahra Mansoori
 
Navigating the BI Stack _
Michael Phipps
 
The Concepts Of Business Intelligence By Power BI
HendraLesmana74
 
Business Intelligence Details and Descriptions
syedas1mal1
 
An Introduction To BI
MoniqueO Opris
 
Performance management capability
designer DATA
 
Bi concepts
anushaccc
 
Information & Data Architecture
Sammer Qader
 
4Emerging Trends in Business IntelligenceITS 531.docx
blondellchancy
 
Bi Architecture And Conceptual Framework
Slava Kokaev
 
Business Intelligence Module 2
Home
 
Smart comp in 21st centry
Muhammad Shoaib
 
Business Intelligence
Amulya Lohani
 
Dw & etl concepts
jeshocarme
 
Datawarehousing & DSS
Deepali Raut
 
Manish tripathi-ea-dw-bi
A P
 
Presentasi 1 - Business Intelligence
DEDE IRYAWAN
 
Technologies
guest6cdabe
 
Ad

Recently uploaded (20)

PDF
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
PPTX
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
PPTX
Introduction to computer chapter one 2017.pptx
mensunmarley
 
PPTX
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
PPTX
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
PPTX
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
PPTX
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PDF
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PDF
blockchain123456789012345678901234567890
tanvikhunt1003
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
PPTX
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PPT
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
PDF
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
PPTX
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PPTX
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
Introduction to computer chapter one 2017.pptx
mensunmarley
 
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
blockchain123456789012345678901234567890
tanvikhunt1003
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 

ETL and its impact on Business Intelligence

  • 1. ETL and its impact on Business Intelligence Presentedby- Isha Pande(23)
  • 2. Index Introduction to Business Intelligence Extract-Transform-Load Process Business Intelligence Architecture Five-layered BI Architecture Summary Bibilography
  • 4. Business Intelligence The term Business Intelligence (BI) refers to technologies, applications and practices for the collection, integration, analysis, and presentation of business information. The purpose of Business Intelligence is to support better business decision making. RAW DATA BI MEANINGFULDATA
  • 5. Business Intelligence Business Intelligence means better decision-making: Decision making is the important factor for the success of the organization. BI helps in the accurate and timely decision making by providing all the business information. Many large organization making faster, better and accurate decision making with the help of Business Intelligence.
  • 7. ETL (Extract-Transform-Load) Process > Extract, Transform and Load (ETL) process is important component of BI. Summary EXTRACT TRANSFORM LOAD ANALYZE
  • 8. ETL (Extract-Transform-Load) Process > Extract describes the gathering of data from various sources. > Transformation is modification to match a desired state. > Load is its import into a database or data warehouse. > Data Warehouse is predominantly used to store detailed summary data and metadata(data about data). > On the other hand, metadata include information on data themselves. They facilitate a process of extracting, transforming and loading data through presenting sources of data in the layout of data warehouses. Metadata are also used to automate summary data creation and queries management. > ETL processes take up to 80% of the effort in BI projects. A high performance is thereby vital to be able to process large amounts of data and to have a up-to-date database.
  • 10. Business Intelligence Architecture Existing BI architectures typically feature a unidirectional communication flow between different components: • The architectures only features a one-way data flow. • The limitation of unidirectional data flow is that no adjustment or correction is allowed on data source even if an error is found. • If organizations want to correct the error, they have to repeat the entire BI process especially that of the cleansing procedures again. To overcome these problems, a two-way data integration flow [14] is suggested whereby the cleansed data can be sent back to data sources to improve accuracy and reduce cleansing work.
  • 11. Business Intelligence Architecture Existing BI architectures lack support on metadata management: • A good BI architecture should include the layer of metadata. • A metadata repository is essential for business users to store and standardize metadata across different systems. • By having a well-structured metadata, organizations will be able to track and monitor data flows within their BI environment. • In addition, they will be able to ensure the consistency of definitions and descriptions of data that support BI components and thus avoid misunderstanding and misinterpretation of data
  • 12. Business Intelligence Architecture Some of the architectures do not include operational data store (ODS) within the BI environment: • It is essential to implement ODS to provide current or near current integrated information that can be accessed or updated directly by users. • Through this way, decision makers will be able to react faster to changing business environment and requirements. • It is necessary to consider data staging area in the ETL (Extract Transform- Load) process. As most of the data from data source require cleansing and transformation, it is important to create a temporary storage for data to reside prior to loading into ODS or data warehouse.
  • 15. Five-layered BI Architecture Data Source Layer: • The purpose of this layer is to identify the data source. It is very important for an or-ganization to know where all required data are from. • This layer can help not only to increase the reliability of data but also to improve data organizing. • There are two types of data sources used by BI applications: (a) Internal data source (b) External data source
  • 16. Five-layered BI Architecture Data Source Layer: (a) Internal data source: The data which is obtained and maintained by the system inside the organization such as Customer Relationship Management system (CRM) and Enterprise Resource Planning system (ERP). Internal data source also includes all business operation related data such as sales data, customers data etc. (b) External data source: All data which are organized outside the organization are called external source for example, data from partners, data from suppliers, information from governmentsand so on.
  • 17. Five-layered BI Architecture ETL Layer: • This layer focuses on three main processes: (a) Extraction (b) Transformation (c) Loading
  • 18. Five-layered BI Architecture ETL Layer: (a) Extraction: Extraction is used to read data from different sources both internally or externally. (b) Transformation: Transformation is the process that converts the extracted data from its original form into the form which is needed. (c) Loading: Loading process is defined to put data into data warehouse. • The purpose of ETL layer is to guarantee that all extracted data are cleansed and are converted into right form and the data loads into target warehouse.
  • 19. Five-layered BI Architecture Data Warehouse Layer: • This layer provides the storage for data which are in desired and reliable formats for further use whenever and wherever needed. • There are three components in data warehouse layer: (a) ODS (b) Data Warehouse (c) Data Marts
  • 20. Five-layered BI Architecture Data Warehouse Layer: (a) ODS: Operational data store (ODS) is used to integrate data and load them into data ware-house. ODS operatesas a short-term memory since the data in ODS is updated frequently. (b) Data Warehouse: Data warehouse is a central storage of structured data. And all data in data warehouse are ready for analytical use. Unlike ODS, data warehouse is a long-term memory as the data are permanent. Both current and history data are stored in data warehouses. It is a subject- oriented, integrated, time-variant and non-volatile collection of data to support the managements’ decision making process. (c) Data Marts: Data mart can be considered as a sub-set of data warehouse. Usually, data mart consists of data from one single subject area and it is created directly from the data warehouse.
  • 21. Five-layered BI Architecture End User Layer: • The end user layer consists of tools that display information in different formats to different users. These tools can be grouped hierarchically in a pyramid shape. As one moves from the bottom to the top of the pyramid, the degree of comprehensiveness at which data are being processed increases.
  • 22. Five-layered BI Architecture Metadata Layer: • Metadata refers to data about data. • It describes where data are being used and stored, the source of data, what changes have been made to the data, and how one piece of data relates to other information. • In this architecture, Metadata management is applied to all other four layers. • The purpose of this layer is that by using the well- structured metadata, organizations is able to track and monitor the data flow in a BI solution. • Furthermore, it can help to avoid misun- derstanding of data.
  • 23. Summary • BI supports decision making. • Extract, Transform and Load (ETL) process is important component of BI. • The Five-layered BI architecture draws a big picture showing how BI solution works. • Data are gathered from both internal and external sources. Then these sources will be extracted, transformed and loaded into data warehouse for further analytical use. • End users use different tools to analyze the data to meet the business requirements or needs. • All the data are managed by the metadata layer in the architecture.
  • 24. Bibilography 1) Nitin Anand. ETL and its impact on Business Intelligence. New Delhi, February 2014 2) In Lih Ong, Pei Hwa Siew and Siew Fan Wong. A Five-Layered Business Intelligence Architecture. Selangor, Malaysia, 2011 3) Mengwei Lu. DiscoveringMicrosoft Self-serviceBI solution: Power BI. Helsinki, Finland, May 2014