SlideShare a Scribd company logo
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Understanding
Azure Data Factory
The What, When, and Why
Cathrine Wilhelmsen
NIC · February 6th, 2019
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Understanding Azure Data Factory
@cathrinew
cathrinew.net
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Data Warehousing Business Intelligence
Artificial Intelligence
Big Data and Analytics
Machine Learning
Data Science
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Data Warehousing Business Intelligence
Artificial Intelligence
Big Data and Analytics
Machine Learning
Data Science
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
What?
When?
Why?
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Collect
Store
Transform
Integrate
Prepare
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Azure
Data Factory
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
What is Azure Data Factory?
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
What can you do in Azure Data Factory?
Copy Data Transform Data
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
What is inside Azure Data Factory?
Pipelines
Activities Datasets
Linked
Services
Integration
Runtimes
Triggers
Templates
DEMO
Let's look inside
Azure Data Factory!
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
What can you do in Azure Data Factory?
Copy Data Transform Data
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
What is the Copy Data Activity?
*
* Cathrine's opinion :)
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Copy Data Process: Binary Files
Source Sink
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Copy Data Process: Complex Files
Source Sink
Serialization
Deserialization
Compression
Decompression
Column
Mapping
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Copy Data Process: Complex Files
Source Sink
Serialization
Deserialization
Compression
Decompression
Column
Mapping
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Copy Data Process: Complex Files
Source Sink
Serialization
Deserialization
Compression
Decompression
Column
Mapping
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Copy Data Process: Complex Files
Source Sink
Serialization
Deserialization
Compression
Decompression
Column
Mapping
DEMO
Let's copy
some data!
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
what if my
systems are
on-premises?
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Hybrid Azure Data Factory
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
What are Integration Runtimes?
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Azure Integration Runtime
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Self-Hosted Integration Runtime
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Copy Data Scenarios
DEMO
Let's connect to an
on-prem SQL Server!
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Ok, so we can copy data…
Copy Data Transform Data
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
…what about transforming data?
Copy Data Transform Data
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Mapping or Wrangling
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
What are Mapping Data Flows?
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
How do Mapping Data Flows work?
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
What are Wrangling Data Flows?
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
How do Wrangling Data Flows work?
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Mapping Data Flows Wrangling Data Flows
DEMO
Let's transform
some data!
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
how do we
schedule data
pipelines?
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Trigger pipelines…
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Triggers: Schedule
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Triggers: Tumbling Window
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Triggers: Event Based
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Triggers: Now
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Monitoring Triggers
DEMO
Let's schedule
some pipelines!
Azure Data
Architectures
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Advanced Analytics on Big Data
https://azure.microsoft.com/en-us/solutions/architecture/advanced-analytics-on-big-data/
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Real-time Analytics
https://azure.microsoft.com/en-us/solutions/architecture/real-time-analytics/
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Modern Data Warehouse
https://azure.microsoft.com/en-us/solutions/architecture/modern-data-warehouse/
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Ingest
Azure
Data Factory
Serve
Azure Synapse
Analytics
Visualize
Power BI
Store
Azure Data Lake Storage
Data Pipeline Orchestration and Monitoring
Azure Data Factory
Sources
Cloud
SaaS
Prepare
Wrangling
Data Flows
Transform
Mapping
Data Flows
On-Premises
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Serve
Azure Synapse Analytics
Visualize
Power BI
Sources
Cloud
SaaS
On-Premises
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Good luck!
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
@cathrinew
cathrinew.net
hi@cathrinew.net
thank you!

More Related Content

What's hot (20)

PDF
Data warehouse con azure synapse analytics
Eduardo Castro
 
PDF
Introduction to Azure
Robert Crane
 
PDF
Building Dynamic Pipelines in Azure Data Factory (Data Saturday Holland)
Cathrine Wilhelmsen
 
PPTX
Microsoft azure
Charith Suriyakula
 
PPTX
Intro to Azure Data Factory v1
Eric Bragas
 
PDF
Cloud Migration Checklist | Microsoft Azure Migration
Intellika
 
PDF
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
DATAVERSITY
 
PDF
Migrate to Microsoft Azure with Confidence
David J Rosenthal
 
PDF
Cloud migration strategies
SogetiLabs
 
PDF
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY
 
PPTX
Azure Synapse Analytics Overview (r1)
James Serra
 
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
 
PDF
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Cathrine Wilhelmsen
 
PDF
Cloud Migration Cookbook: A Guide To Moving Your Apps To The Cloud
New Relic
 
PDF
Azure Data Factory Introduction.pdf
MaheshPandit16
 
PPTX
Data Center Migration to the AWS Cloud
Tom Laszewski
 
PDF
Azure+Databricks+Course+Slide+Deck+V4.pdf
Chitresh Kaushik
 
PDF
Cloud Migration Strategy - IT Transformation with Cloud
Blazeclan Technologies Private Limited
 
PDF
Got data?… now what? An introduction to modern data platforms
JamesAnderson599331
 
PPTX
Microsoft Cloud Adoption Framework for Azure: Thru Partner Governance Workshop
Nicholas Vossburg
 
Data warehouse con azure synapse analytics
Eduardo Castro
 
Introduction to Azure
Robert Crane
 
Building Dynamic Pipelines in Azure Data Factory (Data Saturday Holland)
Cathrine Wilhelmsen
 
Microsoft azure
Charith Suriyakula
 
Intro to Azure Data Factory v1
Eric Bragas
 
Cloud Migration Checklist | Microsoft Azure Migration
Intellika
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
DATAVERSITY
 
Migrate to Microsoft Azure with Confidence
David J Rosenthal
 
Cloud migration strategies
SogetiLabs
 
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY
 
Azure Synapse Analytics Overview (r1)
James Serra
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
 
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Cathrine Wilhelmsen
 
Cloud Migration Cookbook: A Guide To Moving Your Apps To The Cloud
New Relic
 
Azure Data Factory Introduction.pdf
MaheshPandit16
 
Data Center Migration to the AWS Cloud
Tom Laszewski
 
Azure+Databricks+Course+Slide+Deck+V4.pdf
Chitresh Kaushik
 
Cloud Migration Strategy - IT Transformation with Cloud
Blazeclan Technologies Private Limited
 
Got data?… now what? An introduction to modern data platforms
JamesAnderson599331
 
Microsoft Cloud Adoption Framework for Azure: Thru Partner Governance Workshop
Nicholas Vossburg
 

Similar to Understanding Azure Data Factory: The What, When, and Why (NIC 2020) (20)

PDF
Creating Visual Transformations in Azure Data Factory (dataMinds Connect)
Cathrine Wilhelmsen
 
PPTX
Transform your data with Azure Data factory
Prometix Pty Ltd
 
PDF
Pipelines and Packages: Introduction to Azure Data Factory (24HOP)
Cathrine Wilhelmsen
 
PDF
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
Cathrine Wilhelmsen
 
PDF
Data Integration with Data Factory (Microsoft Fabric Day Oslo 2023)
Cathrine Wilhelmsen
 
PPTX
Azure Data Engineer Course | Azure Data Engineer Training Hyderabad.pptx
sivavisualpath
 
PDF
Azure Data Factory for the SSIS Developer (SentryOne Webinar)
Cathrine Wilhelmsen
 
PPTX
A lap around Azure Data Factory
BizTalk360
 
PPTX
Azure datafactory
Dimko Zhluktenko
 
PDF
Azure Data Factory Interview Questions PDF By ScholarHat
Scholarhat
 
PPTX
Azuresatpn19 - An Introduction To Azure Data Factory
Riccardo Perico
 
PDF
ADF+Course+Deck.pdf
ChiquteRobledo
 
PPTX
Next Generation of Data Integration with Azure Data Factory by Tom Kerkhove
Codit
 
PPTX
Next Generation Data Integration with Azure Data Factory
Tom Kerkhove
 
PDF
Mapping Data Flows in Azure Data Factory 1st Edition Mark Kromer
divacazokey
 
PDF
Data Integration using Data Factory in Microsoft Fabric (ESPC Microsoft Fabri...
Cathrine Wilhelmsen
 
PDF
azure-cloud-data-engineer-training-curriculum (1).pdf
k6640559
 
PPTX
Big Data Analytics: Finding diamonds in the rough with Azure
Christos Charmatzis
 
PDF
Mapping Data Flows in Azure Data Factory 1st Edition Mark Kromer
deltintebyan94
 
PPTX
Best Azure Data Engineer Training - Best Data Engineer Course in Hyderabad.pptx
eshwarvisualpath
 
Creating Visual Transformations in Azure Data Factory (dataMinds Connect)
Cathrine Wilhelmsen
 
Transform your data with Azure Data factory
Prometix Pty Ltd
 
Pipelines and Packages: Introduction to Azure Data Factory (24HOP)
Cathrine Wilhelmsen
 
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
Cathrine Wilhelmsen
 
Data Integration with Data Factory (Microsoft Fabric Day Oslo 2023)
Cathrine Wilhelmsen
 
Azure Data Engineer Course | Azure Data Engineer Training Hyderabad.pptx
sivavisualpath
 
Azure Data Factory for the SSIS Developer (SentryOne Webinar)
Cathrine Wilhelmsen
 
A lap around Azure Data Factory
BizTalk360
 
Azure datafactory
Dimko Zhluktenko
 
Azure Data Factory Interview Questions PDF By ScholarHat
Scholarhat
 
Azuresatpn19 - An Introduction To Azure Data Factory
Riccardo Perico
 
ADF+Course+Deck.pdf
ChiquteRobledo
 
Next Generation of Data Integration with Azure Data Factory by Tom Kerkhove
Codit
 
Next Generation Data Integration with Azure Data Factory
Tom Kerkhove
 
Mapping Data Flows in Azure Data Factory 1st Edition Mark Kromer
divacazokey
 
Data Integration using Data Factory in Microsoft Fabric (ESPC Microsoft Fabri...
Cathrine Wilhelmsen
 
azure-cloud-data-engineer-training-curriculum (1).pdf
k6640559
 
Big Data Analytics: Finding diamonds in the rough with Azure
Christos Charmatzis
 
Mapping Data Flows in Azure Data Factory 1st Edition Mark Kromer
deltintebyan94
 
Best Azure Data Engineer Training - Best Data Engineer Course in Hyderabad.pptx
eshwarvisualpath
 
Ad

More from Cathrine Wilhelmsen (20)

PDF
Fra utvikler til arkitekt: Skap din egen karrierevei ved ĂĄ utvikle din person...
Cathrine Wilhelmsen
 
PDF
One Year in Fabric: Lessons Learned from Implementing Real-World Projects (PA...
Cathrine Wilhelmsen
 
PDF
Data Factory in Microsoft Fabric (MsBIP #82)
Cathrine Wilhelmsen
 
PDF
Getting Started: Data Factory in Microsoft Fabric (Microsoft Fabric Community...
Cathrine Wilhelmsen
 
PDF
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Cathrine Wilhelmsen
 
PDF
Website Analytics in My Pocket using Microsoft Fabric (SQLBits 2024)
Cathrine Wilhelmsen
 
PDF
Choosing between Fabric, Synapse and Databricks (Data Left Unattended 2023)
Cathrine Wilhelmsen
 
PDF
The Battle of the Data Transformation Tools (PASS Data Community Summit 2023)
Cathrine Wilhelmsen
 
PDF
Visually Transform Data in Azure Data Factory or Azure Synapse Analytics (PAS...
Cathrine Wilhelmsen
 
PDF
Building an End-to-End Solution in Microsoft Fabric: From Dataverse to Power ...
Cathrine Wilhelmsen
 
PDF
Website Analytics in my Pocket using Microsoft Fabric (AdaCon 2023)
Cathrine Wilhelmsen
 
PDF
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D...
Cathrine Wilhelmsen
 
PDF
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S...
Cathrine Wilhelmsen
 
PDF
"I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ...
Cathrine Wilhelmsen
 
PDF
Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S...
Cathrine Wilhelmsen
 
PDF
6 Tips for Building Confidence as a Public Speaker (SQLBits 2022)
Cathrine Wilhelmsen
 
PDF
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Cathrine Wilhelmsen
 
PDF
Azure Synapse Analytics Teaser (Microsoft TechX Oslo 2019)
Cathrine Wilhelmsen
 
PDF
Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2...
Cathrine Wilhelmsen
 
PDF
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Cathrine Wilhelmsen
 
Fra utvikler til arkitekt: Skap din egen karrierevei ved ĂĄ utvikle din person...
Cathrine Wilhelmsen
 
One Year in Fabric: Lessons Learned from Implementing Real-World Projects (PA...
Cathrine Wilhelmsen
 
Data Factory in Microsoft Fabric (MsBIP #82)
Cathrine Wilhelmsen
 
Getting Started: Data Factory in Microsoft Fabric (Microsoft Fabric Community...
Cathrine Wilhelmsen
 
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Cathrine Wilhelmsen
 
Website Analytics in My Pocket using Microsoft Fabric (SQLBits 2024)
Cathrine Wilhelmsen
 
Choosing between Fabric, Synapse and Databricks (Data Left Unattended 2023)
Cathrine Wilhelmsen
 
The Battle of the Data Transformation Tools (PASS Data Community Summit 2023)
Cathrine Wilhelmsen
 
Visually Transform Data in Azure Data Factory or Azure Synapse Analytics (PAS...
Cathrine Wilhelmsen
 
Building an End-to-End Solution in Microsoft Fabric: From Dataverse to Power ...
Cathrine Wilhelmsen
 
Website Analytics in my Pocket using Microsoft Fabric (AdaCon 2023)
Cathrine Wilhelmsen
 
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D...
Cathrine Wilhelmsen
 
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S...
Cathrine Wilhelmsen
 
"I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ...
Cathrine Wilhelmsen
 
Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S...
Cathrine Wilhelmsen
 
6 Tips for Building Confidence as a Public Speaker (SQLBits 2022)
Cathrine Wilhelmsen
 
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Cathrine Wilhelmsen
 
Azure Synapse Analytics Teaser (Microsoft TechX Oslo 2019)
Cathrine Wilhelmsen
 
Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2...
Cathrine Wilhelmsen
 
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Cathrine Wilhelmsen
 
Ad

Recently uploaded (20)

PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PDF
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PPTX
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
PPTX
Dr djdjjdsjsjsjsjsjsjjsjdjdjdjdjjd1.pptx
Nandy31
 
PDF
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
PPTX
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
PDF
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PDF
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
PDF
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
PDF
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
PDF
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PPTX
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
PPTX
Climate Action.pptx action plan for climate
justfortalabat
 
PPTX
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
Dr djdjjdsjsjsjsjsjsjjsjdjdjdjdjjd1.pptx
Nandy31
 
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
Climate Action.pptx action plan for climate
justfortalabat
 
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 

Understanding Azure Data Factory: The What, When, and Why (NIC 2020)