SlideShare a Scribd company logo
T24	
  
Special	
  Topics	
  
5/5/16	
  15:00	
  
Uncover	
  Untold	
  Stories	
  in	
  Your	
  Data:	
  A	
  
Deep	
  Dive	
  on	
  Data	
  Profiling	
  
Presented	
  by:	
  
Catherine	
  Cruz	
  Agosto
Shauna Ayers
	
  
Availity,	
  LLC	
  
Brought	
  to	
  you	
  by:	
  	
  
350	
  Corporate	
  Way,	
  Suite	
  400,	
  Orange	
  Park,	
  FL	
  32073	
  	
  
888-­‐-­‐-­‐268-­‐-­‐-­‐8770	
  ·∙·∙	
  904-­‐-­‐-­‐278-­‐-­‐-­‐0524	
  -­‐	
  info@techwell.com	
  -­‐	
  http://www.stareast.techwell.com/	
  
Catherine	
  Cruz	
  Agosto	
  
Availity,	
  LLC	
  
Catherine	
  Cruz	
  Agosto	
  found	
  that	
  her	
  software	
  engineering	
  experience	
  at	
  
Baxter	
  Healthcare	
  and	
  Boeing-­‐subsidiary	
  Insitu	
  provided	
  an	
  excellent	
  foundation	
  
for	
  finding	
  more	
  effective	
  and	
  user-­‐friendly	
  approaches	
  to	
  complex	
  technical	
  
problems.	
  Catherine	
  has	
  developed	
  more	
  efficient	
  and	
  innovative	
  data	
  quality	
  
testing	
  solutions	
  at	
  healthcare	
  intermediary	
  Availity,	
  expanding	
  their	
  automated	
  
data	
  quality	
  testing	
  processes	
  to	
  accommodate	
  diverse	
  and	
  dissimilar	
  data	
  
sources,	
  facilitating	
  analysis,	
  testing,	
  and	
  controls	
  for	
  data	
  integration,	
  analytics,	
  
and	
  healthcare	
  data	
  reporting.	
  
Shauna Ayers
Availity, LLC
Shauna Ayers has been untangling the Gordian knots of IT systems for more than
seventeen years, analyzing data systems and testing both software and data
quality in the manufacturing, medical device, and healthcare industries. Shauna
found her passion in developing creative solutions for the analysis and testing of
sensitive and highly-regulated data sets at industry leaders such as Blue Cross
Blue Shield of Florida (now Florida Blue), Vistakon (a subsidiary of Johnson &
Johnson), and Availity.
Uncovering
the Untold Stories of Your Data
A Deep Dive on Data Profiling
By Catherine Cruz Agosto
and Shauna Ayers
Overview
• Define Data Profiling
• Importance of Profiling
• Profiling in action
– Tools?
– Lifecycle and need
– Classification
• Conclusion
Define Data Profiling
• Myths
– You need to buy a profiling tool
– Profiling tools do all the work to get the data metrics
– Profiling is one-time activity
• Definitions
– Data profiling: The ongoing process of examining data
from one or multiple sources (i.e. databases, files,
etc.) and collecting meaningful metrics and statistics
to gain a greater understanding of the data and its
quality in its appropriate context.
Importance of Profiling
• Communicate Health and Status more
effectively
• Get people of different roles on the same page
• Identify Revenue Opportunities
• Identifying Operational Gaps and Risks
Profiling in action: Tools?
• Tools: Uses
– Visibility
– Monitoring
– Trends
• Tools: To buy or not to buy
– Reminder: You do not need a fancy tool in order to profile
– Common tools: IDQ, Datamartist, Microsoft Data Profiling Task
• All tools have some sort of limitations
• Can get expensive
– Creating your own tools
• Seems more time consuming up-front
• More control/ less limitations compared to pre-bought
In the end, what matters is how you use the data.
Profiling in action: Lifecycle and Need
• Dimensions of Profiling
– Properties of Data (Quality)
– Movement of Data (Flow)
– Usage of Data (Business Need)
Profiling in action: Lifecycle and Need
continued
• Profiling for planned changes
– I.E. New projects, enhancements
• Profiling for unplanned changes
– Changes within the data itself
– Monitoring
Profiling in action: Classification
• Datasets
• Velocity and Functional Flow
• Properties
• Relationships
• Tolerances
• Business Value
Conclusion
• Profiling is dynamic.
• Technology can improve profiling.
• Profiling includes qualitative analysis that
cannot be done solely via machine.
• Profiling can help improve both strategic and
operational outcomes… if you do it right.

More Related Content

PPTX
Introduction to Data Science
Laguna State Polytechnic University
 
PPTX
Тестирование данных с помощью Data Quality Services (MS SQL 12)
SQALab
 
PPTX
Hscb Focus 2010 Data Acquisition Extraction Management Debrief Jgm R1
jmorriso
 
PPT
d-Wise | SAS Clinical Data Integration
d-Wise Technologies
 
PPTX
Reveal - An Enterprise Clinical Data Search Solution
d-Wise Technologies
 
PDF
2_resume_2016.11
Sara del Moral
 
PPT
Applying Architecture Design for Information Delivery - HC
Human Managed
 
PDF
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
DATAVERSITY
 
Introduction to Data Science
Laguna State Polytechnic University
 
Тестирование данных с помощью Data Quality Services (MS SQL 12)
SQALab
 
Hscb Focus 2010 Data Acquisition Extraction Management Debrief Jgm R1
jmorriso
 
d-Wise | SAS Clinical Data Integration
d-Wise Technologies
 
Reveal - An Enterprise Clinical Data Search Solution
d-Wise Technologies
 
2_resume_2016.11
Sara del Moral
 
Applying Architecture Design for Information Delivery - HC
Human Managed
 
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
DATAVERSITY
 

Similar to Uncover Untold Stories in Your Data: A Deep Dive on Data Profiling (20)

PPTX
TA reporting metrics and analytics
cjparker
 
PPTX
Code Camp - Data Profiling and Quality Analysis Framework
Knoldus Inc.
 
PDF
Data driven decision making
SHAHZAD M. SALEEM
 
PDF
Analytics in-action-survey
Anjan Das
 
PPTX
Harnessing the Power of Data
KianJazayeri1
 
PDF
Digital Transform (Data & Analytics) Presentation.pdf
anandk70744
 
PPTX
types-of-big-data-analytics-overview.pptx
dumppy37
 
PPTX
Data Analyics
SysDiva Consultants
 
PPTX
Predictive Analytics as a Product
Ramkumar Ravichandran
 
PPTX
MODULE 1_Introduction to Data analytics and life cycle..pptx
nikshaikh786
 
PDF
Data Driven Engineering 2014
Roger Barga
 
PPTX
Unlocking Insights: The Power of Data Analytics
Nidhi Nanda
 
PDF
Data Profiling, Data Catalogs and Metadata Harmonisation
Alan McSweeney
 
PPTX
Introductions to Business Analytics
Venkat .P
 
PDF
Day 1 - Introduction to Data Analytics.pdf
edug2academy2024
 
PPTX
Regression and correlation
VrushaliSolanke
 
PDF
1. Overview_of_data_analytics (1).pdf
Ayele40
 
PPTX
Challenges in adapting predictive analytics
Prasad Narasimhan
 
PPTX
Role of Data Analytics in Business Decision-making.pptx
Shivanshi Singh
 
PDF
Impact of Data Analytics in Changing the Future of Business and Challenges Fa...
IJSRP Journal
 
TA reporting metrics and analytics
cjparker
 
Code Camp - Data Profiling and Quality Analysis Framework
Knoldus Inc.
 
Data driven decision making
SHAHZAD M. SALEEM
 
Analytics in-action-survey
Anjan Das
 
Harnessing the Power of Data
KianJazayeri1
 
Digital Transform (Data & Analytics) Presentation.pdf
anandk70744
 
types-of-big-data-analytics-overview.pptx
dumppy37
 
Data Analyics
SysDiva Consultants
 
Predictive Analytics as a Product
Ramkumar Ravichandran
 
MODULE 1_Introduction to Data analytics and life cycle..pptx
nikshaikh786
 
Data Driven Engineering 2014
Roger Barga
 
Unlocking Insights: The Power of Data Analytics
Nidhi Nanda
 
Data Profiling, Data Catalogs and Metadata Harmonisation
Alan McSweeney
 
Introductions to Business Analytics
Venkat .P
 
Day 1 - Introduction to Data Analytics.pdf
edug2academy2024
 
Regression and correlation
VrushaliSolanke
 
1. Overview_of_data_analytics (1).pdf
Ayele40
 
Challenges in adapting predictive analytics
Prasad Narasimhan
 
Role of Data Analytics in Business Decision-making.pptx
Shivanshi Singh
 
Impact of Data Analytics in Changing the Future of Business and Challenges Fa...
IJSRP Journal
 
Ad

More from Josiah Renaudin (20)

PDF
Solve Everyday IT Problems with DevOps
Josiah Renaudin
 
PDF
End-to-End Quality Approach: 14 Levels of Testing
Josiah Renaudin
 
PDF
Product Management: The Innovation Glue for the Lean Enterprise
Josiah Renaudin
 
PDF
Slay the Dragons of Agile Measurement
Josiah Renaudin
 
PDF
Blending Product Discovery and Product Delivery
Josiah Renaudin
 
PDF
Determining Business Value in Agile Development
Josiah Renaudin
 
PDF
Three Things You MUST Know to Transform into an Agile Enterprise
Josiah Renaudin
 
PDF
Internet of Things and the Wisdom of Mobile
Josiah Renaudin
 
PDF
How to Do Kick-Ass Software Development
Josiah Renaudin
 
PDF
The Power of an Agile Mindset
Josiah Renaudin
 
PDF
DevOps and the Culture of High-Performing Software Organizations
Josiah Renaudin
 
PDF
Build a Quality Engineering and Automation Framework
Josiah Renaudin
 
PDF
Don’t Be Another Statistic! Develop a Long-Term Test Automation Strategy
Josiah Renaudin
 
PDF
Testing Lessons from the Land of Make Believe
Josiah Renaudin
 
PDF
Finding Success with Test Process Improvement
Josiah Renaudin
 
PDF
Git and GitHub for Testers
Josiah Renaudin
 
PDF
Stay Ahead of the Mobile and Web Testing Maturity Curve
Josiah Renaudin
 
PDF
The Selenium Grid: Run Multiple Automated Tests in Parallel
Josiah Renaudin
 
PDF
Testing at Startup Companies: What, When, Where, and How
Josiah Renaudin
 
PDF
Boost Test Coverage with Automated Visual Testing
Josiah Renaudin
 
Solve Everyday IT Problems with DevOps
Josiah Renaudin
 
End-to-End Quality Approach: 14 Levels of Testing
Josiah Renaudin
 
Product Management: The Innovation Glue for the Lean Enterprise
Josiah Renaudin
 
Slay the Dragons of Agile Measurement
Josiah Renaudin
 
Blending Product Discovery and Product Delivery
Josiah Renaudin
 
Determining Business Value in Agile Development
Josiah Renaudin
 
Three Things You MUST Know to Transform into an Agile Enterprise
Josiah Renaudin
 
Internet of Things and the Wisdom of Mobile
Josiah Renaudin
 
How to Do Kick-Ass Software Development
Josiah Renaudin
 
The Power of an Agile Mindset
Josiah Renaudin
 
DevOps and the Culture of High-Performing Software Organizations
Josiah Renaudin
 
Build a Quality Engineering and Automation Framework
Josiah Renaudin
 
Don’t Be Another Statistic! Develop a Long-Term Test Automation Strategy
Josiah Renaudin
 
Testing Lessons from the Land of Make Believe
Josiah Renaudin
 
Finding Success with Test Process Improvement
Josiah Renaudin
 
Git and GitHub for Testers
Josiah Renaudin
 
Stay Ahead of the Mobile and Web Testing Maturity Curve
Josiah Renaudin
 
The Selenium Grid: Run Multiple Automated Tests in Parallel
Josiah Renaudin
 
Testing at Startup Companies: What, When, Where, and How
Josiah Renaudin
 
Boost Test Coverage with Automated Visual Testing
Josiah Renaudin
 
Ad

Recently uploaded (20)

PPTX
Presentation about variables and constant.pptx
kr2589474
 
PPTX
Odoo Integration Services by Candidroot Solutions
CandidRoot Solutions Private Limited
 
PDF
Key Features to Look for in Arizona App Development Services
Net-Craft.com
 
PPTX
Maximizing Revenue with Marketo Measure: A Deep Dive into Multi-Touch Attribu...
bbedford2
 
PPTX
ASSIGNMENT_1[1][1][1][1][1] (1) variables.pptx
kr2589474
 
DOCX
Can You Build Dashboards Using Open Source Visualization Tool.docx
Varsha Nayak
 
PPTX
ConcordeApp: Engineering Global Impact & Unlocking Billions in Event ROI with AI
chastechaste14
 
PDF
Protecting the Digital World Cyber Securit
dnthakkar16
 
PPTX
Role Of Python In Programing Language.pptx
jaykoshti048
 
PPTX
Visualising Data with Scatterplots in IBM SPSS Statistics.pptx
Version 1 Analytics
 
PDF
49785682629390197565_LRN3014_Migrating_the_Beast.pdf
Abilash868456
 
PDF
lesson-2-rules-of-netiquette.pdf.bshhsjdj
jasmenrojas249
 
PPTX
slidesgo-unlocking-the-code-the-dynamic-dance-of-variables-and-constants-2024...
kr2589474
 
PDF
ChatPharo: an Open Architecture for Understanding How to Talk Live to LLMs
ESUG
 
PDF
Adobe Illustrator Crack Full Download (Latest Version 2025) Pre-Activated
imang66g
 
PDF
MiniTool Power Data Recovery Crack New Pre Activated Version Latest 2025
imang66g
 
PPTX
classification of computer and basic part of digital computer
ravisinghrajpurohit3
 
PDF
Balancing Resource Capacity and Workloads with OnePlan – Avoid Overloading Te...
OnePlan Solutions
 
PPTX
Presentation about Database and Database Administrator
abhishekchauhan86963
 
PDF
Immersive experiences: what Pharo users do!
ESUG
 
Presentation about variables and constant.pptx
kr2589474
 
Odoo Integration Services by Candidroot Solutions
CandidRoot Solutions Private Limited
 
Key Features to Look for in Arizona App Development Services
Net-Craft.com
 
Maximizing Revenue with Marketo Measure: A Deep Dive into Multi-Touch Attribu...
bbedford2
 
ASSIGNMENT_1[1][1][1][1][1] (1) variables.pptx
kr2589474
 
Can You Build Dashboards Using Open Source Visualization Tool.docx
Varsha Nayak
 
ConcordeApp: Engineering Global Impact & Unlocking Billions in Event ROI with AI
chastechaste14
 
Protecting the Digital World Cyber Securit
dnthakkar16
 
Role Of Python In Programing Language.pptx
jaykoshti048
 
Visualising Data with Scatterplots in IBM SPSS Statistics.pptx
Version 1 Analytics
 
49785682629390197565_LRN3014_Migrating_the_Beast.pdf
Abilash868456
 
lesson-2-rules-of-netiquette.pdf.bshhsjdj
jasmenrojas249
 
slidesgo-unlocking-the-code-the-dynamic-dance-of-variables-and-constants-2024...
kr2589474
 
ChatPharo: an Open Architecture for Understanding How to Talk Live to LLMs
ESUG
 
Adobe Illustrator Crack Full Download (Latest Version 2025) Pre-Activated
imang66g
 
MiniTool Power Data Recovery Crack New Pre Activated Version Latest 2025
imang66g
 
classification of computer and basic part of digital computer
ravisinghrajpurohit3
 
Balancing Resource Capacity and Workloads with OnePlan – Avoid Overloading Te...
OnePlan Solutions
 
Presentation about Database and Database Administrator
abhishekchauhan86963
 
Immersive experiences: what Pharo users do!
ESUG
 

Uncover Untold Stories in Your Data: A Deep Dive on Data Profiling

  • 1. T24   Special  Topics   5/5/16  15:00   Uncover  Untold  Stories  in  Your  Data:  A   Deep  Dive  on  Data  Profiling   Presented  by:   Catherine  Cruz  Agosto Shauna Ayers   Availity,  LLC   Brought  to  you  by:     350  Corporate  Way,  Suite  400,  Orange  Park,  FL  32073     888-­‐-­‐-­‐268-­‐-­‐-­‐8770  ·∙·∙  904-­‐-­‐-­‐278-­‐-­‐-­‐0524  -­‐  [email protected]  -­‐  http://www.stareast.techwell.com/  
  • 2. Catherine  Cruz  Agosto   Availity,  LLC   Catherine  Cruz  Agosto  found  that  her  software  engineering  experience  at   Baxter  Healthcare  and  Boeing-­‐subsidiary  Insitu  provided  an  excellent  foundation   for  finding  more  effective  and  user-­‐friendly  approaches  to  complex  technical   problems.  Catherine  has  developed  more  efficient  and  innovative  data  quality   testing  solutions  at  healthcare  intermediary  Availity,  expanding  their  automated   data  quality  testing  processes  to  accommodate  diverse  and  dissimilar  data   sources,  facilitating  analysis,  testing,  and  controls  for  data  integration,  analytics,   and  healthcare  data  reporting.   Shauna Ayers Availity, LLC Shauna Ayers has been untangling the Gordian knots of IT systems for more than seventeen years, analyzing data systems and testing both software and data quality in the manufacturing, medical device, and healthcare industries. Shauna found her passion in developing creative solutions for the analysis and testing of sensitive and highly-regulated data sets at industry leaders such as Blue Cross Blue Shield of Florida (now Florida Blue), Vistakon (a subsidiary of Johnson & Johnson), and Availity.
  • 3. Uncovering the Untold Stories of Your Data A Deep Dive on Data Profiling By Catherine Cruz Agosto and Shauna Ayers
  • 4. Overview • Define Data Profiling • Importance of Profiling • Profiling in action – Tools? – Lifecycle and need – Classification • Conclusion
  • 5. Define Data Profiling • Myths – You need to buy a profiling tool – Profiling tools do all the work to get the data metrics – Profiling is one-time activity • Definitions – Data profiling: The ongoing process of examining data from one or multiple sources (i.e. databases, files, etc.) and collecting meaningful metrics and statistics to gain a greater understanding of the data and its quality in its appropriate context.
  • 6. Importance of Profiling • Communicate Health and Status more effectively • Get people of different roles on the same page • Identify Revenue Opportunities • Identifying Operational Gaps and Risks
  • 7. Profiling in action: Tools? • Tools: Uses – Visibility – Monitoring – Trends • Tools: To buy or not to buy – Reminder: You do not need a fancy tool in order to profile – Common tools: IDQ, Datamartist, Microsoft Data Profiling Task • All tools have some sort of limitations • Can get expensive – Creating your own tools • Seems more time consuming up-front • More control/ less limitations compared to pre-bought In the end, what matters is how you use the data.
  • 8. Profiling in action: Lifecycle and Need • Dimensions of Profiling – Properties of Data (Quality) – Movement of Data (Flow) – Usage of Data (Business Need)
  • 9. Profiling in action: Lifecycle and Need continued • Profiling for planned changes – I.E. New projects, enhancements • Profiling for unplanned changes – Changes within the data itself – Monitoring
  • 10. Profiling in action: Classification • Datasets • Velocity and Functional Flow • Properties • Relationships • Tolerances • Business Value
  • 11. Conclusion • Profiling is dynamic. • Technology can improve profiling. • Profiling includes qualitative analysis that cannot be done solely via machine. • Profiling can help improve both strategic and operational outcomes… if you do it right.