SlideShare a Scribd company logo
9
Most read
11
Most read
20
Most read
© 2019 IDERA, Inc. All rights reserved.
STRAIGHT TALK TO
DEMYSTIFY DATA LINEAGE
© 2019 IDERA, Inc. All rights reserved. 2
DRIVING ENTERPRISE DATA GOVERNANCE
▪ Key drivers for instituting data governance:
• Improved information utilization
• Better data quality
• Improved interoperability
• Improved technical operationalization
• Reduced operational costs
• Streamlined design and development
• Improved business accountability
• Compliance with data use agreements
• Compliance with regulatory demands
• Improved business results
• Trustworthy analytics
• Trustworthy reporting
© 2019 IDERA, Inc. All rights reserved.
OBJECTIVES OF A DATA GOVERNANCE PROGRAM
Understand and interpret
business data
dependencies
Define and
approve data
policies
Develop procedures for
operationalization
Continuously
monitor
compliance
© 2019 IDERA, Inc. All rights reserved.
DATA LINEAGE POWERS DATA GOVERNANCE
▪ Data lineage methods help to
develop a map of the enterprise
data landscape
▪ Data lineage provides a holistic
description of each data object’s
• Sources
• Information pipelines
• Transformations
• Methods of access
• Controls
• All other fundamental aspects of
information utility
© 2019 IDERA, Inc. All rights reserved.
ASPECTS OF DATA LINEAGE
Business
lineage
Technical
lineage
Procedural
lineage
The semantic aspects
of tracing data meaning
and usage semantics
The structural aspects
of data element
concepts and their use
across the enterprise
A trace of data's
journey through
different systems and
data stores, providing
an audit trail of the
changes along the way
Data lineage combines
three different aspects of
corporate metadata:
© 2019 IDERA, Inc. All rights reserved.
TECHNIQUES SUPPORTING LINEAGE
Policy management
Glossary
Business Process Model
© 2019 IDERA, Inc. All rights reserved.
BUSINESS LINEAGE
▪ Inventory and description of
business characteristics of data
assets captured within a data
catalog, accumulating
information such as:
• Data asset description
• Business glossary
• Data asset location
• Data sensitivity
• Access rights
© 2019 IDERA, Inc. All rights reserved.
TECHNICAL/STRUCTURAL LINEAGE
▪ Catalogs which data element concepts are used
▪ Notes how data element concepts are
manifested as data elements within specific
data assets
▪ Not limited to static data sets
• Data in motion
• Manifestation of data element concepts in dynamic
contexts such as reports and feature sets for analysis
© 2019 IDERA, Inc. All rights reserved.
PROCEDURAL LINEAGE
▪ Identify the original
introduction of data elements
▪ Establish the process flow
for data elements that are
central to data policy
compliance
▪ Draft a mapping of data
element use to the business
application touch points
▪ Determine where data
instances are created,
updated, or just read
▪ Document transformations
applied
© 2019 IDERA, Inc. All rights reserved.
BENEFITS OF DATA LINEAGE
▪ Analyzing data dependencies
▪ Validating semantic consistency
▪ Impact analysis
▪ Data quality root cause analysis
▪ Integrating data controls
▪ Enforcing regulatory compliance
▪ Protecting sensitive data
Resulting in:
▪ Better data quality
▪ Better business decisions
© 2019 IDERA, Inc. All rights reserved.
ANALYZING DATA DEPENDENCIES
▪ Unexposed data dependencies
introduce risks in ensuring high-
quality usable data
• Reports, dashboards, and
analyses may appear to be
derived from data sets from
isolated systems, but in many
cases there is a chain of
processing that ultimately
originates with data taken from a
shared data source
• Multiple data sets may be
populated using data from distinct
yet structurally and semantically
equivalent sources
?
=
© 2019 IDERA, Inc. All rights reserved.
VALIDATING SEMANTIC CONSISTENCY
Social Security
Number
Identifier
Unique number assigned by
Social Security Administration
Authentication
Last four digits of number
assigned by the Social Security
Administration
Identifier
Unique number assigned by the
company
Customer ID
© 2019 IDERA, Inc. All rights reserved.
IMPACT ANALYSIS
▪ External drivers and directives may demand
changes to organizational information systems
▪ Data lineage allows forward-dependency tracing
to identify downstream systems impacted by
changes to
• Business term definitions
• Data element specifications
• Augmentation of data element semantics
• Changes in business process flow
© 2019 IDERA, Inc. All rights reserved.
ISSUE ROOT CAUSE ANALYSIS
▪ Data lineage maps the
information production
flow
▪ A data steward can
use the lineage maps
to reverse-trace back
through the data
production flow
▪ Enables identification
of the point of
introduction of a data
error
© 2019 IDERA, Inc. All rights reserved.
INTEGRATED DATA CONTROLS
▪ Identification of “problem
spots” and key phases in
business information flows
highlight opportunities for
integrated data controls
▪ Data controls validate data
flowing through selected
processing phases
▪ Alerts are generated when
invalid data values are
identified
© 2019 IDERA, Inc. All rights reserved.
ENFORCING DATA POLICIES
▪ Data policies can be formulated to reflect
externally-imposed data compliance requirements
▪ Business lineage is used to
• Capture external policy definitions
• Standardize semantics across different application usage
of shared data element concepts
▪ Technical lineage allows for
• Standardized specifications for data element validation
• Institution of audit controls for demonstrating compliance
▪ Examples:
• GDPR
• CCPA
• 12 CFR Part 11
• HIPAA Privacy Rule
© 2019 IDERA, Inc. All rights reserved.
PROTECTING SENSITIVE DATA
▪ Business lineage traces origin and levels of data
sensitivity
▪ Coupled with procedural lineage allows for
insertion of data protection techniques
• Encryption at rest
• Encryption in motion
• Data masking
• Access controls
© 2019 IDERA, Inc. All rights reserved.
SOME QUESTIONS DATA LINEAGE CAN ANSWER
▪ To understand organizational data
• What’s important?
• Where is it? (can be may places)
• Where did it come from?
• How is it used (business processes)?
• What is the chain of custody?
• What are the business rules?
▪ To support governance
• How do I identify private information?
• How long should I keep the information?
• Master Data Management classification
• Data quality
• Is it fit for purpose?
• What changed and why?
© 2019 IDERA, Inc. All rights reserved.
CONSIDERATIONS
▪ Data lineage augments the corporate toolkit for deploying data
governance
▪ Look for products that simplify the data steward’s consumption
of data lineage mappings, and have:
• The ability to enable users to see the flow of data through the data
production lifecycle
• A mechanism for enumerating the data sources for the different data
pipelines
• The ability to identify data elements and link them to data models and
to metadata for data element concepts and business glossaries
• A method of documenting data transformations and allowing data
professionals to review those transformations across a variety of data
pipelines
• The capability of interoperating with existing ETL/data integration tools
to import data pipelines along with their collected transformations.
• A means for collaboration around data pipelines and associated
metadata
• The ability to display a visual presentation allowing data stewards to
review the data lineage
© 2019 IDERA, Inc. All rights reserved.
THANKS!
Any questions?
Learn more at:
www.idera.com
20

More Related Content

What's hot (20)

PDF
Activate Data Governance Using the Data Catalog
DATAVERSITY
 
PDF
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY
 
PDF
Data Quality Best Practices
DATAVERSITY
 
PDF
Modern Data architecture Design
Kujambu Murugesan
 
PPTX
Data Governance Intro.pptx
BHARATH KUNAMNENI
 
PDF
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
DATAVERSITY
 
PDF
DAS Slides: Master Data Management – Aligning Data, Process, and Governance
DATAVERSITY
 
PDF
Data modelling 101
Christopher Bradley
 
PPTX
Data Quality & Data Governance
Tuba Yaman Him
 
PPTX
Data Governance Best Practices
Boris Otto
 
PDF
Snowflake Company Presentation
AndrewJiang18
 
PDF
MDM for product data with Talend
Jean-Michel Franco
 
PPT
Data Architecture for Data Governance
DATAVERSITY
 
PDF
Considerations for Data Access in the Lakehouse
Databricks
 
PDF
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
DATAVERSITY
 
PDF
Webinar Data Mesh - Part 3
Jeffrey T. Pollock
 
PDF
Collibra - Forrester Presentation : Data Governance 2.0
Guillaume LE GALIARD
 
PDF
Data Architecture - The Foundation for Enterprise Architecture and Governance
DATAVERSITY
 
PDF
Business Intelligence (BI) and Data Management Basics
amorshed
 
PPTX
Data Lakehouse Symposium | Day 4
Databricks
 
Activate Data Governance Using the Data Catalog
DATAVERSITY
 
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY
 
Data Quality Best Practices
DATAVERSITY
 
Modern Data architecture Design
Kujambu Murugesan
 
Data Governance Intro.pptx
BHARATH KUNAMNENI
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
DATAVERSITY
 
DAS Slides: Master Data Management – Aligning Data, Process, and Governance
DATAVERSITY
 
Data modelling 101
Christopher Bradley
 
Data Quality & Data Governance
Tuba Yaman Him
 
Data Governance Best Practices
Boris Otto
 
Snowflake Company Presentation
AndrewJiang18
 
MDM for product data with Talend
Jean-Michel Franco
 
Data Architecture for Data Governance
DATAVERSITY
 
Considerations for Data Access in the Lakehouse
Databricks
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
DATAVERSITY
 
Webinar Data Mesh - Part 3
Jeffrey T. Pollock
 
Collibra - Forrester Presentation : Data Governance 2.0
Guillaume LE GALIARD
 
Data Architecture - The Foundation for Enterprise Architecture and Governance
DATAVERSITY
 
Business Intelligence (BI) and Data Management Basics
amorshed
 
Data Lakehouse Symposium | Day 4
Databricks
 

Similar to Straight Talk to Demystify Data Lineage (20)

PDF
Understanding data lineage: Enabling Security Investigations | The Enterprise...
Enterprise world
 
PDF
Foundational Strategies for Trust in Big Data Part 3: Data Lineage
Precisely
 
PDF
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
DATAVERSITY
 
PPTX
The art of implementing data lineage
Leigh Hill
 
PDF
Strategic imperative the enterprise data model
DATAVERSITY
 
PPTX
An Agile & Adaptive Approach to Addressing Financial Services Regulations and...
Neo4j
 
PDF
Getting Started with Data Governance? Use Process Models!
DATAVERSITY
 
PDF
Data Lineage: Using Knowledge Graphs for Deeper Insights into Your Data Pipel...
Neo4j
 
PDF
How to get data lineage right
Leigh Hill
 
PDF
Top Data Provenance Tools: Enhance Data Governance and Transparency
knowledgenile
 
PPTX
How to establish a sustainable solution for data lineage
Leigh Hill
 
PDF
Data lineage to drive compliance and as a business imperative
Leigh Hill
 
PPTX
Who changed my data? Need for data governance and provenance in a streaming w...
DataWorks Summit
 
PDF
Top 10 Artifacts Needed For Data Governance
First San Francisco Partners
 
PDF
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
CCG
 
PPTX
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
Precisely
 
DOCX
Project Documentation
Rohan Reddy
 
PDF
Modernizing Data Management Through Metadata
MANTA
 
PPTX
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
Precisely
 
DOCX
Global Data Management: Governance, Security and Usefulness in a Hybrid World
Neil Raden
 
Understanding data lineage: Enabling Security Investigations | The Enterprise...
Enterprise world
 
Foundational Strategies for Trust in Big Data Part 3: Data Lineage
Precisely
 
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
DATAVERSITY
 
The art of implementing data lineage
Leigh Hill
 
Strategic imperative the enterprise data model
DATAVERSITY
 
An Agile & Adaptive Approach to Addressing Financial Services Regulations and...
Neo4j
 
Getting Started with Data Governance? Use Process Models!
DATAVERSITY
 
Data Lineage: Using Knowledge Graphs for Deeper Insights into Your Data Pipel...
Neo4j
 
How to get data lineage right
Leigh Hill
 
Top Data Provenance Tools: Enhance Data Governance and Transparency
knowledgenile
 
How to establish a sustainable solution for data lineage
Leigh Hill
 
Data lineage to drive compliance and as a business imperative
Leigh Hill
 
Who changed my data? Need for data governance and provenance in a streaming w...
DataWorks Summit
 
Top 10 Artifacts Needed For Data Governance
First San Francisco Partners
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
CCG
 
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
Precisely
 
Project Documentation
Rohan Reddy
 
Modernizing Data Management Through Metadata
MANTA
 
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
Precisely
 
Global Data Management: Governance, Security and Usefulness in a Hybrid World
Neil Raden
 
Ad

More from DATAVERSITY (20)

PDF
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
DATAVERSITY
 
PDF
Data at the Speed of Business with Data Mastering and Governance
DATAVERSITY
 
PDF
Exploring Levels of Data Literacy
DATAVERSITY
 
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
PDF
Make Data Work for You
DATAVERSITY
 
PDF
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 
PDF
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
PDF
Data Modeling Fundamentals
DATAVERSITY
 
PDF
Showing ROI for Your Analytic Project
DATAVERSITY
 
PDF
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
PDF
Is Enterprise Data Literacy Possible?
DATAVERSITY
 
PDF
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
PDF
Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
PDF
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
PDF
Data Governance Trends and Best Practices To Implement Today
DATAVERSITY
 
PDF
2023 Trends in Enterprise Analytics
DATAVERSITY
 
PDF
Data Strategy Best Practices
DATAVERSITY
 
PDF
Who Should Own Data Governance – IT or Business?
DATAVERSITY
 
PDF
Data Management Best Practices
DATAVERSITY
 
PDF
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
DATAVERSITY
 
Exploring Levels of Data Literacy
DATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
Make Data Work for You
DATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
Data Modeling Fundamentals
DATAVERSITY
 
Showing ROI for Your Analytic Project
DATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
Is Enterprise Data Literacy Possible?
DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
DATAVERSITY
 
2023 Trends in Enterprise Analytics
DATAVERSITY
 
Data Strategy Best Practices
DATAVERSITY
 
Who Should Own Data Governance – IT or Business?
DATAVERSITY
 
Data Management Best Practices
DATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
Ad

Recently uploaded (20)

PDF
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
PPTX
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
PPTX
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
PPTX
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
PPTX
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
PDF
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PPTX
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PDF
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
PDF
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
PDF
Early_Diabetes_Detection_using_Machine_L.pdf
maria879693
 
PPTX
The _Operations_on_Functions_Addition subtruction Multiplication and Division...
mdregaspi24
 
PPTX
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
PPTX
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
Early_Diabetes_Detection_using_Machine_L.pdf
maria879693
 
The _Operations_on_Functions_Addition subtruction Multiplication and Division...
mdregaspi24
 
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 

Straight Talk to Demystify Data Lineage

  • 1. © 2019 IDERA, Inc. All rights reserved. STRAIGHT TALK TO DEMYSTIFY DATA LINEAGE
  • 2. © 2019 IDERA, Inc. All rights reserved. 2 DRIVING ENTERPRISE DATA GOVERNANCE ▪ Key drivers for instituting data governance: • Improved information utilization • Better data quality • Improved interoperability • Improved technical operationalization • Reduced operational costs • Streamlined design and development • Improved business accountability • Compliance with data use agreements • Compliance with regulatory demands • Improved business results • Trustworthy analytics • Trustworthy reporting
  • 3. © 2019 IDERA, Inc. All rights reserved. OBJECTIVES OF A DATA GOVERNANCE PROGRAM Understand and interpret business data dependencies Define and approve data policies Develop procedures for operationalization Continuously monitor compliance
  • 4. © 2019 IDERA, Inc. All rights reserved. DATA LINEAGE POWERS DATA GOVERNANCE ▪ Data lineage methods help to develop a map of the enterprise data landscape ▪ Data lineage provides a holistic description of each data object’s • Sources • Information pipelines • Transformations • Methods of access • Controls • All other fundamental aspects of information utility
  • 5. © 2019 IDERA, Inc. All rights reserved. ASPECTS OF DATA LINEAGE Business lineage Technical lineage Procedural lineage The semantic aspects of tracing data meaning and usage semantics The structural aspects of data element concepts and their use across the enterprise A trace of data's journey through different systems and data stores, providing an audit trail of the changes along the way Data lineage combines three different aspects of corporate metadata:
  • 6. © 2019 IDERA, Inc. All rights reserved. TECHNIQUES SUPPORTING LINEAGE Policy management Glossary Business Process Model
  • 7. © 2019 IDERA, Inc. All rights reserved. BUSINESS LINEAGE ▪ Inventory and description of business characteristics of data assets captured within a data catalog, accumulating information such as: • Data asset description • Business glossary • Data asset location • Data sensitivity • Access rights
  • 8. © 2019 IDERA, Inc. All rights reserved. TECHNICAL/STRUCTURAL LINEAGE ▪ Catalogs which data element concepts are used ▪ Notes how data element concepts are manifested as data elements within specific data assets ▪ Not limited to static data sets • Data in motion • Manifestation of data element concepts in dynamic contexts such as reports and feature sets for analysis
  • 9. © 2019 IDERA, Inc. All rights reserved. PROCEDURAL LINEAGE ▪ Identify the original introduction of data elements ▪ Establish the process flow for data elements that are central to data policy compliance ▪ Draft a mapping of data element use to the business application touch points ▪ Determine where data instances are created, updated, or just read ▪ Document transformations applied
  • 10. © 2019 IDERA, Inc. All rights reserved. BENEFITS OF DATA LINEAGE ▪ Analyzing data dependencies ▪ Validating semantic consistency ▪ Impact analysis ▪ Data quality root cause analysis ▪ Integrating data controls ▪ Enforcing regulatory compliance ▪ Protecting sensitive data Resulting in: ▪ Better data quality ▪ Better business decisions
  • 11. © 2019 IDERA, Inc. All rights reserved. ANALYZING DATA DEPENDENCIES ▪ Unexposed data dependencies introduce risks in ensuring high- quality usable data • Reports, dashboards, and analyses may appear to be derived from data sets from isolated systems, but in many cases there is a chain of processing that ultimately originates with data taken from a shared data source • Multiple data sets may be populated using data from distinct yet structurally and semantically equivalent sources ? =
  • 12. © 2019 IDERA, Inc. All rights reserved. VALIDATING SEMANTIC CONSISTENCY Social Security Number Identifier Unique number assigned by Social Security Administration Authentication Last four digits of number assigned by the Social Security Administration Identifier Unique number assigned by the company Customer ID
  • 13. © 2019 IDERA, Inc. All rights reserved. IMPACT ANALYSIS ▪ External drivers and directives may demand changes to organizational information systems ▪ Data lineage allows forward-dependency tracing to identify downstream systems impacted by changes to • Business term definitions • Data element specifications • Augmentation of data element semantics • Changes in business process flow
  • 14. © 2019 IDERA, Inc. All rights reserved. ISSUE ROOT CAUSE ANALYSIS ▪ Data lineage maps the information production flow ▪ A data steward can use the lineage maps to reverse-trace back through the data production flow ▪ Enables identification of the point of introduction of a data error
  • 15. © 2019 IDERA, Inc. All rights reserved. INTEGRATED DATA CONTROLS ▪ Identification of “problem spots” and key phases in business information flows highlight opportunities for integrated data controls ▪ Data controls validate data flowing through selected processing phases ▪ Alerts are generated when invalid data values are identified
  • 16. © 2019 IDERA, Inc. All rights reserved. ENFORCING DATA POLICIES ▪ Data policies can be formulated to reflect externally-imposed data compliance requirements ▪ Business lineage is used to • Capture external policy definitions • Standardize semantics across different application usage of shared data element concepts ▪ Technical lineage allows for • Standardized specifications for data element validation • Institution of audit controls for demonstrating compliance ▪ Examples: • GDPR • CCPA • 12 CFR Part 11 • HIPAA Privacy Rule
  • 17. © 2019 IDERA, Inc. All rights reserved. PROTECTING SENSITIVE DATA ▪ Business lineage traces origin and levels of data sensitivity ▪ Coupled with procedural lineage allows for insertion of data protection techniques • Encryption at rest • Encryption in motion • Data masking • Access controls
  • 18. © 2019 IDERA, Inc. All rights reserved. SOME QUESTIONS DATA LINEAGE CAN ANSWER ▪ To understand organizational data • What’s important? • Where is it? (can be may places) • Where did it come from? • How is it used (business processes)? • What is the chain of custody? • What are the business rules? ▪ To support governance • How do I identify private information? • How long should I keep the information? • Master Data Management classification • Data quality • Is it fit for purpose? • What changed and why?
  • 19. © 2019 IDERA, Inc. All rights reserved. CONSIDERATIONS ▪ Data lineage augments the corporate toolkit for deploying data governance ▪ Look for products that simplify the data steward’s consumption of data lineage mappings, and have: • The ability to enable users to see the flow of data through the data production lifecycle • A mechanism for enumerating the data sources for the different data pipelines • The ability to identify data elements and link them to data models and to metadata for data element concepts and business glossaries • A method of documenting data transformations and allowing data professionals to review those transformations across a variety of data pipelines • The capability of interoperating with existing ETL/data integration tools to import data pipelines along with their collected transformations. • A means for collaboration around data pipelines and associated metadata • The ability to display a visual presentation allowing data stewards to review the data lineage
  • 20. © 2019 IDERA, Inc. All rights reserved. THANKS! Any questions? Learn more at: www.idera.com 20