SlideShare a Scribd company logo
Azure Data Factory V2
The Data Flows
Your expectations and what we will cover;
Azure Data Factory V2; The Data Flows
• The Abstract - “Mapping Data Flows is the fantastic new feature of Azure Data Factory, which
combines a visual designer with the full power of Databricks to deliver a robust and hugely
scalable data flow pipeline, like a constantly evolving integration services cloud service. In this
hands-on session we'll design and build a data transformation in the data flow designer using
the new Data Factory flow user interface, and talk about the underlying architectural
components.”
• We will start with a high level introduction to Azure Data
Factory
• Then we’ll discuss some of Data Flows
• Then let us build something!
About me
• Thomas Sykes MCT, Azure Certified, MCSE
• Senior Consultant for Quorum based in Edinburgh, Scotland
• Working with SQL Server since version 7.0
• Now working ‘in the cloud’
• On twitter @sqltomato and use my notepad at sqltomato.com
About you
How many of you
have used SQL
Server Integration
Services (SSIS)?
How many of you
have used Azure?
Have you looked
into or used Azure
Data Factory?
What is Azure Data Factory?
• Azure Data Factory is a cloud-based data integration
service that allows you to create data-driven workflows
in the cloud for orchestrating and automating data
movement and data transformation
https://docs.microsoft.com/en-gb/azure/data-factory/introduction
What is the Data Flow?
• It is a visual designer native to Microsoft Azure that provides
robust and scalable data flow pipeline, like a constantly
evolving integration services cloud service
• It transforms Azure Data Factory from a Data Movement
tool to a full Extract, Transform and Load tool with a
graphical interface
Azure Data Factory Data Flows ADFDF
• When typing this ADFDF acronym
• Looks live I’ve fallen asleep on the keyboard
• It’s almost meant for our standard Dvorak QWERTY
keyboard
QWERTY keyboard - https://en.wikipedia.org/wiki/Dvorak_Simplified_Keyboard#Original_Dvorak_layout
Pipelines
Linked Services
Data Flows (Preview)
Azure Data Factory Data Flows ADFDF
Pipelines
• “… pipeline is a logical grouping of activities that performs a
unit of work …”
• Within the pipelines we have essentially control flow and
data flows
Linked Services
• “… Linked services are much like connection strings …”
• They can represent data stores (such as Azure Blob Storage
or Azure SQL Database) or a compute resource (such as the
HDInsightHive activity)
https://docs.microsoft.com/en-gb/azure/data-factory/introduction
Data Flows
• “… Data Flows allow data engineers to develop graphical
data transformation logic without writing code. The resulting
data flows are executed as activities within Azure Data
Factory Pipelines using scaled-out Azure Databricks clusters
…”
• Similar to SQL Server Integration Services the native Azure
Data Flows boast a graphical ‘no code’ interface with a rich
array of connectors
• Being actively developed
https://docs.microsoft.com/en-us/azure/data-factory/data-flow-expression-functions
Building a simple data flow – the problem
• A simple real world problem
• A courier required the post town to match the post code or would
not accept the packages as the system would reject them
• Each Postcode EH11 4EP has a Post District EH11 which has a
associated Post Town EDINBURGH
• Some of the data now has more than one entry, to keep this
simple I’ve used the first entry and used EH and G only
Post district data - https://en.wikipedia.org/wiki/List_of_postcode_districts_in_the_United_Kingdom
Building a simple data flow - prerequisites
• Azure SQL Server and Database
• Azure Data Factory
• Azure Storage Blob
• Microsoft Azure Storage Explorer
• Post District Data – EH and G data loaded into database
• ‘Customer’ Mailing Data – Some fictious customer data
Building a simple data flow – concepts
• Flat File data source stored in Azure Blob Storage
• Azure SQL Database – Azure PaaS Database
• Simple expression
• INNER JOIN
• Output to a ‘sink’
“This Page left Intentionally Blank” – Demo!
This Page left Intentionally Blank
Building a simple data flow – How it looks
Database, Debug, Runtime and Costs
Want to build this yourself?
What will you need?
• Azure Subscription
• Microsoft Azure Storage Explorer
• Post District Data
• ‘Customer’ File
• All details at sqltomato.com blog post Data Flows
https://azure.microsoft.com/en-gb/free/
https://azure.microsoft.com/en-gb/features/storage-explorer/
https://en.wikipedia.org/wiki/List_of_postcode_districts_in_the_United_Kingdom
•
Azure Data Factory V2; The Data Flows
Thank you for attending, please complete feedback and visit our sponsors!

More Related Content

What's hot (20)

PPTX
Azure data factory
David Giard
 
PPTX
1- Introduction of Azure data factory.pptx
BRIJESH KUMAR
 
PDF
Azure Data Factory Introduction.pdf
MaheshPandit16
 
PDF
Azure Synapse Analytics
WinWire Technologies Inc
 
PPTX
Azure data platform overview
James Serra
 
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
 
PPTX
Azure data bricks by Eugene Polonichko
Alex Tumanoff
 
PPTX
Azure Synapse Analytics Overview (r1)
James Serra
 
PDF
Introduction to Azure Data Lake
Antonios Chatzipavlis
 
PDF
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
PPTX
Databricks Fundamentals
Dalibor Wijas
 
PDF
Introduction to Azure Data Factory
Slava Kokaev
 
PDF
Learn to Use Databricks for Data Science
Databricks
 
PPTX
ADF Demo_ppt.pptx
vamsytaurus
 
PPTX
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
James Serra
 
PDF
Snowflake Data Science and AI/ML at Scale
Adam Doyle
 
PDF
Azure SQL Database
rockplace
 
PPTX
Data Warehousing Trends, Best Practices, and Future Outlook
James Serra
 
PPTX
Core Concepts in azure data factory
BRIJESH KUMAR
 
PPTX
Azure Synapse Analytics Overview (r2)
James Serra
 
Azure data factory
David Giard
 
1- Introduction of Azure data factory.pptx
BRIJESH KUMAR
 
Azure Data Factory Introduction.pdf
MaheshPandit16
 
Azure Synapse Analytics
WinWire Technologies Inc
 
Azure data platform overview
James Serra
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
 
Azure data bricks by Eugene Polonichko
Alex Tumanoff
 
Azure Synapse Analytics Overview (r1)
James Serra
 
Introduction to Azure Data Lake
Antonios Chatzipavlis
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
Databricks Fundamentals
Dalibor Wijas
 
Introduction to Azure Data Factory
Slava Kokaev
 
Learn to Use Databricks for Data Science
Databricks
 
ADF Demo_ppt.pptx
vamsytaurus
 
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
James Serra
 
Snowflake Data Science and AI/ML at Scale
Adam Doyle
 
Azure SQL Database
rockplace
 
Data Warehousing Trends, Best Practices, and Future Outlook
James Serra
 
Core Concepts in azure data factory
BRIJESH KUMAR
 
Azure Synapse Analytics Overview (r2)
James Serra
 

Similar to Azure Data Factory V2; The Data Flows (20)

PDF
Azure Data Factory Interview Questions PDF By ScholarHat
Scholarhat
 
PPTX
Azure Data Engineer Course | Azure Data Engineer Training Hyderabad.pptx
sivavisualpath
 
PDF
Creating Visual Transformations in Azure Data Factory (dataMinds Connect)
Cathrine Wilhelmsen
 
PPTX
ADF Mapping Data Flows Training Slides V1
Mark Kromer
 
PDF
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
Cathrine Wilhelmsen
 
PPTX
Azure Data Factory Data Flows Training v005
Mark Kromer
 
PPTX
Transform your data with Azure Data factory
Prometix Pty Ltd
 
PPTX
ADF Mapping Data Flows Training V2
Mark Kromer
 
PPTX
Best Azure Data Engineer Training - Best Data Engineer Course in Hyderabad.pptx
eshwarvisualpath
 
PDF
Azure Data Engineer Training In Hyderabad | Azure Data Engineer Training
eshwarvisualpath
 
PPTX
SQL Saturday Redmond 2019 ETL Patterns in the Cloud
Mark Kromer
 
PDF
Unleash the power of Azure Data Factory
Sergio Zenatti Filho
 
PPTX
Azure Data Engineer Training Hyderabad - Azure Data Engineer Online Training....
eshwarvisualpath
 
DOCX
adf.docx
KMGANGOTRISINGH
 
PPTX
Azure Data Factory ETL Patterns in the Cloud
Mark Kromer
 
PPTX
Intelligent Cloud Conference 2018 - Next Generation of Data Integration with ...
Tom Kerkhove
 
PPTX
Azure Data Factory for Redmond SQL PASS UG Sept 2018
Mark Kromer
 
PPTX
A lap around Azure Data Factory
BizTalk360
 
PPTX
Build ETL Process using Azure Data Factory
Manoj Mittal
 
PPTX
Designing big data analytics solutions on azure
Mohamed Tawfik
 
Azure Data Factory Interview Questions PDF By ScholarHat
Scholarhat
 
Azure Data Engineer Course | Azure Data Engineer Training Hyderabad.pptx
sivavisualpath
 
Creating Visual Transformations in Azure Data Factory (dataMinds Connect)
Cathrine Wilhelmsen
 
ADF Mapping Data Flows Training Slides V1
Mark Kromer
 
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
Cathrine Wilhelmsen
 
Azure Data Factory Data Flows Training v005
Mark Kromer
 
Transform your data with Azure Data factory
Prometix Pty Ltd
 
ADF Mapping Data Flows Training V2
Mark Kromer
 
Best Azure Data Engineer Training - Best Data Engineer Course in Hyderabad.pptx
eshwarvisualpath
 
Azure Data Engineer Training In Hyderabad | Azure Data Engineer Training
eshwarvisualpath
 
SQL Saturday Redmond 2019 ETL Patterns in the Cloud
Mark Kromer
 
Unleash the power of Azure Data Factory
Sergio Zenatti Filho
 
Azure Data Engineer Training Hyderabad - Azure Data Engineer Online Training....
eshwarvisualpath
 
adf.docx
KMGANGOTRISINGH
 
Azure Data Factory ETL Patterns in the Cloud
Mark Kromer
 
Intelligent Cloud Conference 2018 - Next Generation of Data Integration with ...
Tom Kerkhove
 
Azure Data Factory for Redmond SQL PASS UG Sept 2018
Mark Kromer
 
A lap around Azure Data Factory
BizTalk360
 
Build ETL Process using Azure Data Factory
Manoj Mittal
 
Designing big data analytics solutions on azure
Mohamed Tawfik
 
Ad

Recently uploaded (20)

PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
PPTX
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
PDF
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
PPTX
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
PDF
InformaticsPractices-MS - Google Docs.pdf
seshuashwin0829
 
PPTX
05_Jelle Baats_Tekst.pptx_AI_Barometer_Release_Event
FinTech Belgium
 
PDF
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
PDF
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
PPTX
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
PDF
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
PPTX
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
PDF
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 
PDF
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
PDF
Technical-Report-GPS_GIS_RS-for-MSF-finalv2.pdf
KPycho
 
PDF
Driving Employee Engagement in a Hybrid World.pdf
Mia scott
 
PDF
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
PPTX
How to Add Columns and Rows in an R Data Frame
subhashenia
 
PDF
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
InformaticsPractices-MS - Google Docs.pdf
seshuashwin0829
 
05_Jelle Baats_Tekst.pptx_AI_Barometer_Release_Event
FinTech Belgium
 
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
Technical-Report-GPS_GIS_RS-for-MSF-finalv2.pdf
KPycho
 
Driving Employee Engagement in a Hybrid World.pdf
Mia scott
 
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
How to Add Columns and Rows in an R Data Frame
subhashenia
 
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
Ad

Azure Data Factory V2; The Data Flows

  • 1. Azure Data Factory V2 The Data Flows
  • 2. Your expectations and what we will cover; Azure Data Factory V2; The Data Flows • The Abstract - “Mapping Data Flows is the fantastic new feature of Azure Data Factory, which combines a visual designer with the full power of Databricks to deliver a robust and hugely scalable data flow pipeline, like a constantly evolving integration services cloud service. In this hands-on session we'll design and build a data transformation in the data flow designer using the new Data Factory flow user interface, and talk about the underlying architectural components.” • We will start with a high level introduction to Azure Data Factory • Then we’ll discuss some of Data Flows • Then let us build something!
  • 3. About me • Thomas Sykes MCT, Azure Certified, MCSE • Senior Consultant for Quorum based in Edinburgh, Scotland • Working with SQL Server since version 7.0 • Now working ‘in the cloud’ • On twitter @sqltomato and use my notepad at sqltomato.com
  • 4. About you How many of you have used SQL Server Integration Services (SSIS)? How many of you have used Azure? Have you looked into or used Azure Data Factory?
  • 5. What is Azure Data Factory? • Azure Data Factory is a cloud-based data integration service that allows you to create data-driven workflows in the cloud for orchestrating and automating data movement and data transformation https://docs.microsoft.com/en-gb/azure/data-factory/introduction
  • 6. What is the Data Flow? • It is a visual designer native to Microsoft Azure that provides robust and scalable data flow pipeline, like a constantly evolving integration services cloud service • It transforms Azure Data Factory from a Data Movement tool to a full Extract, Transform and Load tool with a graphical interface
  • 7. Azure Data Factory Data Flows ADFDF • When typing this ADFDF acronym • Looks live I’ve fallen asleep on the keyboard • It’s almost meant for our standard Dvorak QWERTY keyboard QWERTY keyboard - https://en.wikipedia.org/wiki/Dvorak_Simplified_Keyboard#Original_Dvorak_layout
  • 8. Pipelines Linked Services Data Flows (Preview) Azure Data Factory Data Flows ADFDF
  • 9. Pipelines • “… pipeline is a logical grouping of activities that performs a unit of work …” • Within the pipelines we have essentially control flow and data flows
  • 10. Linked Services • “… Linked services are much like connection strings …” • They can represent data stores (such as Azure Blob Storage or Azure SQL Database) or a compute resource (such as the HDInsightHive activity) https://docs.microsoft.com/en-gb/azure/data-factory/introduction
  • 11. Data Flows • “… Data Flows allow data engineers to develop graphical data transformation logic without writing code. The resulting data flows are executed as activities within Azure Data Factory Pipelines using scaled-out Azure Databricks clusters …” • Similar to SQL Server Integration Services the native Azure Data Flows boast a graphical ‘no code’ interface with a rich array of connectors • Being actively developed https://docs.microsoft.com/en-us/azure/data-factory/data-flow-expression-functions
  • 12. Building a simple data flow – the problem • A simple real world problem • A courier required the post town to match the post code or would not accept the packages as the system would reject them • Each Postcode EH11 4EP has a Post District EH11 which has a associated Post Town EDINBURGH • Some of the data now has more than one entry, to keep this simple I’ve used the first entry and used EH and G only Post district data - https://en.wikipedia.org/wiki/List_of_postcode_districts_in_the_United_Kingdom
  • 13. Building a simple data flow - prerequisites • Azure SQL Server and Database • Azure Data Factory • Azure Storage Blob • Microsoft Azure Storage Explorer • Post District Data – EH and G data loaded into database • ‘Customer’ Mailing Data – Some fictious customer data
  • 14. Building a simple data flow – concepts • Flat File data source stored in Azure Blob Storage • Azure SQL Database – Azure PaaS Database • Simple expression • INNER JOIN • Output to a ‘sink’
  • 15. “This Page left Intentionally Blank” – Demo! This Page left Intentionally Blank
  • 16. Building a simple data flow – How it looks
  • 18. Want to build this yourself? What will you need? • Azure Subscription • Microsoft Azure Storage Explorer • Post District Data • ‘Customer’ File • All details at sqltomato.com blog post Data Flows https://azure.microsoft.com/en-gb/free/ https://azure.microsoft.com/en-gb/features/storage-explorer/ https://en.wikipedia.org/wiki/List_of_postcode_districts_in_the_United_Kingdom
  • 19.
  • 21. Thank you for attending, please complete feedback and visit our sponsors!