SlideShare a Scribd company logo
Getting Started: Data Factory in Microsoft Fabric (Microsoft Fabric Community Conference 2024)
Getting Started:
Data Factory in
Microsoft Fabric
Shireen Bahadur
Product Manager, Microsoft Azure Data
Cathrine Wilhelmsen
Lead Consultant, Evidi
Tess Kassier
Senior Product Manager, Delphix
Shireen
Bahadur
Product Manager, Azure Data
Microsoft
/shireen-bahadur
Cathrine
Wilhelmsen
Lead Consultant
Evidi
/cathrinewilhelmsen
Tess
Kassier
Senior Product Manager
Delphix
/tessmaggio
Agenda
01
Introduction to
Data Factory
02
Ingest and Transform
using Dataflows
03
Ingest and Orchestrate
using Data Pipelines
04
Lineage and
Monitoring
05
Delphix
Compliance Services
06
Q&A
Introduction to
Data Factory
What is Data Factory in Microsoft Fabric?
No-code / low-code data integration experience with:
• Scale and power of Azure Data Factory
• Ease-of-use of Power Query
• Intelligence of Copilot
The Evolution of Data Factory
2014 2017 2019
2015
2023
Ingest Data Transform Data
Orchestrate Workflows
Ingest Data Transform Data
Orchestrate Workflows
Ingest Data Transform Data
Orchestrate Workflows
Pipelines
Ingest Data
Dataflows / Pipelines
Transform Data
Dataflows
Orchestrate Workflows
Pipelines
Ingest Data
Dataflows / Pipelines
Transform Data
Dataflows
Ingest and
Transform using
Dataflows
What are Dataflows Gen 2?
Self-serve Data preparation
Familiar experience with Power Query + M Language
File, relational, multi-dimensional, SaaS data
experiences
Embedded into Microsoft citizen experiences – Excel,
Power BI, Power Apps & Automate, Dynamics 365
Next Generation of Power BI
Dataflows
Shorter authoring experience
New output Data Destinations
New refresh history and monitoring
Integration with Data pipelines
Components of a Dataflow
1. Get Data (connectors)
2. Queries Pane
3. Diagram View
4. Data Preview pane
5. Query settings pane
Demo:
Ingest and Transform
using Dataflows
Orchestrate Workflows
Pipelines
Ingest Data
Dataflows / Pipelines
Transform Data
Dataflows
Orchestrate Workflows
Pipelines
Ingest Data
Dataflows / Pipelines
Transform Data
Dataflows
Ingest and
Orchestrate using
Data Pipelines
What are Data Pipelines?
Data Pipelines are what
you execute or run
They define workflows:
what to do in which order
What are Activities?
Activities are individual
steps inside pipelines
They can be chained
or run in parallel
Copy Data Activity: Powerful Capabilities
Source Destination
Copy Data Activity: Complex Data
Source
Convert
File Formats
Convert
Data Types
Map
Columns
Flatten
Hierarchies
Split &
Merge Files
Compress &
Decompress
Destination
What are Connections?
Connections define how
to connect and authorize
Choose between built-in
connection to workspace
or 30+ external sources
What are Runs?
Runs are individual
executions of pipelines
Run on-demand or
on a defined schedule
Demo:
Ingest and Orchestrate
using Data Pipelines
Decision Guide for Ingest and Transform
Copy Activity Dataflow
Use cases:
Data Ingestion
Data Conversion
Data Ingestion
Data Transformation
User interface: Wizard, Canvas Power Query
Sources: 30+ connectors 180+ connectors
Destinations: 18+ connectors
Workspace: Lakehouse / Warehouse
External: Azure SQL Database, Azure Data
Explorer, Azure Synapse Analytics
Transformations: Lightweight conversions 300+ transformation functions
Delphix
Compliance
Services
Solving for Sensitive Data in Microsoft Fabric
Enabling analysts to innovate at speed
Power BI
Run Secure
Analytics
Fabric Data Factory
Delphix Data
Masking API
Delphix Data
Discovery API
170+
Data
Sources
UAT
BACKUP
AI/ML
BI / ANALYTICS
QUALITY ASSURANCE
DEVELOPMENT
REPORTING
DWH
INTEGRATION TESTING
SANDBOX
PERF TESTING
DATA SCIENCE
FUNCTIONAL TESTING
DATA LAKE
G D P R
P D P L
L G P D
D P P A
D P A
P O P I A
P I P L / C L S
P I P A
A P P I
A P P S
PRIVACY ACT 2020
C C P A
H I P A A
C O P P A
G L B A
C C P A
Sensitive Data Exposed in Data Factory
170+ Targets
170+ Sources
Sensitive
Production Data
!
Sensitive data is extracted and loaded to target.
Fabric Data Factory
Non-compliant
Data Poses Risks
!
Unprotected
Pipeline / Dataflow
US SSN: CCPA
FIRST NAME: GDPR, LGPD, CCPA.
DISEASE: HIPAA
Analyst gets
access to
sensitive data
Compliant Data Delivered in Data Factory
170+ Targets
170+ Sources
Sensitive
Production Data
!
Compliant
Data
Sensitive data is extracted and loaded to target.
D I S C O V E R Y
AUTOMATED SENSITIVE
DATA DISCOVERY
Fabric Data Factory
M A S K
REFERENTIAL
INTEGRITY
L O A D
Timely access
to quality
compliant data
D I S C O V E R
AUTOMATED SENSITIVE DATA DISCOVERY
â—Ź Data Sampling
â—Ź Metadata Match
â—Ź Pattern Match
â—Ź Data Type
â—Ź List Checks
Employees
SSN
324-23-1920
450-22-3204
529-003-2314
â—Ź Name Match
â—Ź Nine Digit Match
â—Ź No Entries Over/ Under 9 Digits
â—Ź Data Type: VARCHAR
â—Ź Occasional Delimiter
M A S K
REFERENTIAL INTEGRITY
Siloed Production Data Secure Non-Production Data
Employees
last_name
Haas
Silverman
Yang
Employees
last_name
Lee
Rogers
Johnson
Payroll
SSN
291-24-5523
450-22-3204
529-03-2314
Payroll
SSN
324-23-1920
992-35-6523
857-42-4367 Orders
SSN last_name
291-24-5523 Haas
450-22-3204 Yang
857-432-4367 Silverman
Employee_Details
SSN last_name
324-23-1920 Lee
992-35-6523 Johnson
529-003-2314 Rogers
Demo:
Delphix Compliance Services
Questions?
Getting Started with Data
Factory
Speakers: Shireen Bahadur,
Cathrine Wilhelmsen (MVP)
TUESDAY @ 11:30am
Using Azure AI Services
with Data Factory
Speakers: Abhishek Narain, Joroen
Luitwieler
THURSDAY @ 1:30pm
Connecting to the World’s
data using Data Factory
Speakers: Matt Masson, Miguel
Escobar, Jianlei Shen
TUESDAY @ 11:30am
From Data to Decisions:
Leveraging Microsoft 365
with Data Factory
Speakers: Wilson Lee, Karan Shah,
Rishi Girish
TUESDAY @ 3:15pm
Empowering Self-service
BI on SAP Data with
Microsoft Fabric
Speakers: Abhishek Narain, Joroen
Luitwieler
THURSDAY @ 2:45pm
Performance Tuning
Secrets for Data Factory
Speakers: Sid Jayadevan, Mark
Kromer, Matt Masson
WEDNESDAY @ 11:15am
Implement Enterprise Data
Integration Patterns with
Data Factory
Speakers: Abhishek Narain,
Miquella de Boer, Noelle Li
WEDNESDAY @ 1:45pm
Upgrade Pathways and
Best Practices for Data
Factory
Speakers: Mark Kromer, Miguel
Escobar, Mike Carlo (MVP)
THURSDAY @ 11am
Behind the Design:
Crafting Data Factory
Experiences in Fabric
Speakers: Cristin Ford, Arian
Martinez, Vichita Jianjitlert
TUESDAY @ 2pm
Modern Data Integration
with Microsoft Fabric Data
Factory
Speakers: Wee Hyong, Shabnam
Watson (MVP), Penny Zhou
WEDNESDAY @ 8am
Data Factory in Microsoft
Fabric Technical Deep Dive
Speakers: Mohan Sankaran, John
Welch, Erwin de Kreuk (MVP)
WEDNESDAY @ 9:15am
WEDNESDAY
THURSDAY
Customer Stories: Data
Integration
Speakers: Andre Fomin,
Tom Peplow
WEDNESDAY @ 8am
TUESDAY
s
GENERAL AVAILABILITY
PREVIEW
SNEAK PEAK
Modern Get Data – browse
Azure Connections
Dataflow output destinations –
Support for schema changes for
Lakehouse & Azure SQL DB
Incremental Refresh for Dataflows
Data Pipelines access on-
premises data using “On
Premises Data Gateway” (OPDG)
Fast Copy for Dataflows
40 to 80 activity limit in Data
Pipelines
Semantic Model Refresh
CI/CD in Data Pipelines
Cancel Dataflow Refresh
SPN support for VNET Data
Gateway
VNET Data Gateway support with Private Links for Dataflows Gen 2 in Fabric
Microsoft Fabric Community Conference 2024 • Shireen Bahadur,
Cathrine Wilhelmsen & Tess Kassier
Interested in
Connecting with the
Product Group?
Please email us at
Fabcon-DI-Speakers@microsoft.com
for any questions!
Community Lounge Meet Ups
Check Whova for official meetups with
user group leaders, MVPs, Super Users
and more!
Meet Speakers & the Product Group
Check Whova for the full schedule of speaker Q&A and
PG meet & greets in the Community Lounge.
aka.ms/FabricCommunity
Ask and answer questions in
the Fabric Community forum
aka.ms/FabricUserGroups
Find a user group in your area
or to match your interests
Thank you!
Shireen Bahadur
Product Manager, Microsoft Azure Data
Cathrine Wilhelmsen
Lead Consultant, Evidi
Tess Kassier
Senior Product Manager, Delphix

More Related Content

What's hot (20)

PDF
Data Architecture Strategies: Data Architecture for Digital Transformation
DATAVERSITY
 
PPTX
Azure data factory
BizTalk360
 
PDF
Architect’s Open-Source Guide for a Data Mesh Architecture
Databricks
 
PDF
Logical Data Fabric: Architectural Components
Denodo
 
PDF
Time to Talk about Data Mesh
LibbySchulze
 
PDF
Got data?… now what? An introduction to modern data platforms
JamesAnderson599331
 
PDF
Moving to Databricks & Delta
Databricks
 
PDF
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Dr. Arif Wider
 
PPTX
Databricks Fundamentals
Dalibor Wijas
 
PDF
Data platform architecture
Sudheer Kondla
 
PDF
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY
 
PDF
[XConf Brasil 2020] Data mesh
ThoughtWorks Brasil
 
PDF
DAS Slides: Data Governance - Combining Data Management with Organizational ...
DATAVERSITY
 
PDF
Webinar Data Mesh - Part 3
Jeffrey T. Pollock
 
PDF
Introduction to Azure Data Lake
Antonios Chatzipavlis
 
PDF
DataMinds 2022 Azure Purview Erwin de Kreuk
Erwin de Kreuk
 
PDF
Azure+Databricks+Course+Slide+Deck+V4.pdf
Chitresh Kaushik
 
PPTX
Data Quality Patterns in the Cloud with Azure Data Factory
Mark Kromer
 
PPTX
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
DataScienceConferenc1
 
PPTX
Collibra Data Citizen '19 - Bridging Data Privacy with Data Governance
BigID Inc
 
Data Architecture Strategies: Data Architecture for Digital Transformation
DATAVERSITY
 
Azure data factory
BizTalk360
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Databricks
 
Logical Data Fabric: Architectural Components
Denodo
 
Time to Talk about Data Mesh
LibbySchulze
 
Got data?… now what? An introduction to modern data platforms
JamesAnderson599331
 
Moving to Databricks & Delta
Databricks
 
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Dr. Arif Wider
 
Databricks Fundamentals
Dalibor Wijas
 
Data platform architecture
Sudheer Kondla
 
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY
 
[XConf Brasil 2020] Data mesh
ThoughtWorks Brasil
 
DAS Slides: Data Governance - Combining Data Management with Organizational ...
DATAVERSITY
 
Webinar Data Mesh - Part 3
Jeffrey T. Pollock
 
Introduction to Azure Data Lake
Antonios Chatzipavlis
 
DataMinds 2022 Azure Purview Erwin de Kreuk
Erwin de Kreuk
 
Azure+Databricks+Course+Slide+Deck+V4.pdf
Chitresh Kaushik
 
Data Quality Patterns in the Cloud with Azure Data Factory
Mark Kromer
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
DataScienceConferenc1
 
Collibra Data Citizen '19 - Bridging Data Privacy with Data Governance
BigID Inc
 

Similar to Getting Started: Data Factory in Microsoft Fabric (Microsoft Fabric Community Conference 2024) (20)

PDF
Data Factory in Microsoft Fabric (MsBIP #82)
Cathrine Wilhelmsen
 
PDF
Data Integration with Data Factory (Microsoft Fabric Day Oslo 2023)
Cathrine Wilhelmsen
 
PDF
Data Integration using Data Factory in Microsoft Fabric (ESPC Microsoft Fabri...
Cathrine Wilhelmsen
 
PPTX
Intro to Azure Data Factory v1
Eric Bragas
 
PDF
Visually Transform Data in Azure Data Factory or Azure Synapse Analytics (PAS...
Cathrine Wilhelmsen
 
PDF
Comprehensive Guide for Microsoft Fabric to Master Data Analytics
Sparity1
 
PDF
Microsoft FabricPresentationIntroduction
dogma28
 
PDF
Microsoft Fabric Data Platform Next Step
SamValdez10
 
PPTX
Microsoft Fabric Online Training | Microsoft Fabric Training.pptx
TalluriRenuka
 
PDF
Competitive Advantage through Azure Service Fabric Analytics
Microsoft Dynamics
 
PDF
Azure Data Factory Introduction.pdf
MaheshPandit16
 
PPTX
Transform your data with Azure Data factory
Prometix Pty Ltd
 
PPTX
A lap around Azure Data Factory
BizTalk360
 
PDF
Data Culture Series - Keynote & Panel - Reading - 12th May 2015
Jonathan Woodward
 
PDF
Unleash the power of Azure Data Factory
Sergio Zenatti Filho
 
PPTX
Microsoft Fabric Training | Microsoft Fabric Certification Course.pptx
TalluriRenuka
 
PDF
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Denodo
 
PDF
Azure Data Factory v2
Sergio Zenatti Filho
 
PDF
Harnessing the Power of Distributed Processing: Managing Data Across Clouds a...
Safe Software
 
PPTX
What is Microsoft Fabric - a guide by Select Distinct
Select Distinct Limited
 
Data Factory in Microsoft Fabric (MsBIP #82)
Cathrine Wilhelmsen
 
Data Integration with Data Factory (Microsoft Fabric Day Oslo 2023)
Cathrine Wilhelmsen
 
Data Integration using Data Factory in Microsoft Fabric (ESPC Microsoft Fabri...
Cathrine Wilhelmsen
 
Intro to Azure Data Factory v1
Eric Bragas
 
Visually Transform Data in Azure Data Factory or Azure Synapse Analytics (PAS...
Cathrine Wilhelmsen
 
Comprehensive Guide for Microsoft Fabric to Master Data Analytics
Sparity1
 
Microsoft FabricPresentationIntroduction
dogma28
 
Microsoft Fabric Data Platform Next Step
SamValdez10
 
Microsoft Fabric Online Training | Microsoft Fabric Training.pptx
TalluriRenuka
 
Competitive Advantage through Azure Service Fabric Analytics
Microsoft Dynamics
 
Azure Data Factory Introduction.pdf
MaheshPandit16
 
Transform your data with Azure Data factory
Prometix Pty Ltd
 
A lap around Azure Data Factory
BizTalk360
 
Data Culture Series - Keynote & Panel - Reading - 12th May 2015
Jonathan Woodward
 
Unleash the power of Azure Data Factory
Sergio Zenatti Filho
 
Microsoft Fabric Training | Microsoft Fabric Certification Course.pptx
TalluriRenuka
 
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Denodo
 
Azure Data Factory v2
Sergio Zenatti Filho
 
Harnessing the Power of Distributed Processing: Managing Data Across Clouds a...
Safe Software
 
What is Microsoft Fabric - a guide by Select Distinct
Select Distinct Limited
 
Ad

More from Cathrine Wilhelmsen (20)

PDF
Fra utvikler til arkitekt: Skap din egen karrierevei ved ĂĄ utvikle din person...
Cathrine Wilhelmsen
 
PDF
One Year in Fabric: Lessons Learned from Implementing Real-World Projects (PA...
Cathrine Wilhelmsen
 
PDF
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Cathrine Wilhelmsen
 
PDF
Website Analytics in My Pocket using Microsoft Fabric (SQLBits 2024)
Cathrine Wilhelmsen
 
PDF
Choosing between Fabric, Synapse and Databricks (Data Left Unattended 2023)
Cathrine Wilhelmsen
 
PDF
The Battle of the Data Transformation Tools (PASS Data Community Summit 2023)
Cathrine Wilhelmsen
 
PDF
Building an End-to-End Solution in Microsoft Fabric: From Dataverse to Power ...
Cathrine Wilhelmsen
 
PDF
Website Analytics in my Pocket using Microsoft Fabric (AdaCon 2023)
Cathrine Wilhelmsen
 
PDF
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Cathrine Wilhelmsen
 
PDF
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D...
Cathrine Wilhelmsen
 
PDF
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S...
Cathrine Wilhelmsen
 
PDF
"I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ...
Cathrine Wilhelmsen
 
PDF
Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S...
Cathrine Wilhelmsen
 
PDF
6 Tips for Building Confidence as a Public Speaker (SQLBits 2022)
Cathrine Wilhelmsen
 
PDF
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Cathrine Wilhelmsen
 
PDF
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
PDF
Understanding Azure Data Factory: The What, When, and Why (NIC 2020)
Cathrine Wilhelmsen
 
PDF
Azure Data Factory for the SSIS Developer (SentryOne Webinar)
Cathrine Wilhelmsen
 
PDF
Azure Synapse Analytics Teaser (Microsoft TechX Oslo 2019)
Cathrine Wilhelmsen
 
PDF
Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2...
Cathrine Wilhelmsen
 
Fra utvikler til arkitekt: Skap din egen karrierevei ved ĂĄ utvikle din person...
Cathrine Wilhelmsen
 
One Year in Fabric: Lessons Learned from Implementing Real-World Projects (PA...
Cathrine Wilhelmsen
 
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Cathrine Wilhelmsen
 
Website Analytics in My Pocket using Microsoft Fabric (SQLBits 2024)
Cathrine Wilhelmsen
 
Choosing between Fabric, Synapse and Databricks (Data Left Unattended 2023)
Cathrine Wilhelmsen
 
The Battle of the Data Transformation Tools (PASS Data Community Summit 2023)
Cathrine Wilhelmsen
 
Building an End-to-End Solution in Microsoft Fabric: From Dataverse to Power ...
Cathrine Wilhelmsen
 
Website Analytics in my Pocket using Microsoft Fabric (AdaCon 2023)
Cathrine Wilhelmsen
 
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Cathrine Wilhelmsen
 
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D...
Cathrine Wilhelmsen
 
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S...
Cathrine Wilhelmsen
 
"I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ...
Cathrine Wilhelmsen
 
Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S...
Cathrine Wilhelmsen
 
6 Tips for Building Confidence as a Public Speaker (SQLBits 2022)
Cathrine Wilhelmsen
 
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Cathrine Wilhelmsen
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
Understanding Azure Data Factory: The What, When, and Why (NIC 2020)
Cathrine Wilhelmsen
 
Azure Data Factory for the SSIS Developer (SentryOne Webinar)
Cathrine Wilhelmsen
 
Azure Synapse Analytics Teaser (Microsoft TechX Oslo 2019)
Cathrine Wilhelmsen
 
Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2...
Cathrine Wilhelmsen
 
Ad

Recently uploaded (20)

PDF
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PDF
Data Science Course Certificate by Sigma Software University
Stepan Kalika
 
PDF
Research Methodology Overview Introduction
ayeshagul29594
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
PDF
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
PDF
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
PDF
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
PPTX
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
PDF
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
PPTX
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
PDF
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
PPTX
Powerful Uses of Data Analytics You Should Know
subhashenia
 
PPTX
big data eco system fundamentals of data science
arivukarasi
 
PDF
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
PPT
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
PDF
Development and validation of the Japanese version of the Organizational Matt...
Yoga Tokuyoshi
 
PPTX
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
Data Science Course Certificate by Sigma Software University
Stepan Kalika
 
Research Methodology Overview Introduction
ayeshagul29594
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
Powerful Uses of Data Analytics You Should Know
subhashenia
 
big data eco system fundamentals of data science
arivukarasi
 
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
Development and validation of the Japanese version of the Organizational Matt...
Yoga Tokuyoshi
 
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 

Getting Started: Data Factory in Microsoft Fabric (Microsoft Fabric Community Conference 2024)

  • 2. Getting Started: Data Factory in Microsoft Fabric Shireen Bahadur Product Manager, Microsoft Azure Data Cathrine Wilhelmsen Lead Consultant, Evidi Tess Kassier Senior Product Manager, Delphix
  • 3. Shireen Bahadur Product Manager, Azure Data Microsoft /shireen-bahadur Cathrine Wilhelmsen Lead Consultant Evidi /cathrinewilhelmsen Tess Kassier Senior Product Manager Delphix /tessmaggio
  • 4. Agenda 01 Introduction to Data Factory 02 Ingest and Transform using Dataflows 03 Ingest and Orchestrate using Data Pipelines 04 Lineage and Monitoring 05 Delphix Compliance Services 06 Q&A
  • 6. What is Data Factory in Microsoft Fabric? No-code / low-code data integration experience with: • Scale and power of Azure Data Factory • Ease-of-use of Power Query • Intelligence of Copilot
  • 7. The Evolution of Data Factory 2014 2017 2019 2015 2023
  • 11. Orchestrate Workflows Pipelines Ingest Data Dataflows / Pipelines Transform Data Dataflows
  • 12. Orchestrate Workflows Pipelines Ingest Data Dataflows / Pipelines Transform Data Dataflows
  • 14. What are Dataflows Gen 2? Self-serve Data preparation Familiar experience with Power Query + M Language File, relational, multi-dimensional, SaaS data experiences Embedded into Microsoft citizen experiences – Excel, Power BI, Power Apps & Automate, Dynamics 365 Next Generation of Power BI Dataflows Shorter authoring experience New output Data Destinations New refresh history and monitoring Integration with Data pipelines
  • 15. Components of a Dataflow 1. Get Data (connectors) 2. Queries Pane 3. Diagram View 4. Data Preview pane 5. Query settings pane
  • 17. Orchestrate Workflows Pipelines Ingest Data Dataflows / Pipelines Transform Data Dataflows
  • 18. Orchestrate Workflows Pipelines Ingest Data Dataflows / Pipelines Transform Data Dataflows
  • 20. What are Data Pipelines? Data Pipelines are what you execute or run They define workflows: what to do in which order
  • 21. What are Activities? Activities are individual steps inside pipelines They can be chained or run in parallel
  • 22. Copy Data Activity: Powerful Capabilities Source Destination
  • 23. Copy Data Activity: Complex Data Source Convert File Formats Convert Data Types Map Columns Flatten Hierarchies Split & Merge Files Compress & Decompress Destination
  • 24. What are Connections? Connections define how to connect and authorize Choose between built-in connection to workspace or 30+ external sources
  • 25. What are Runs? Runs are individual executions of pipelines Run on-demand or on a defined schedule
  • 27. Decision Guide for Ingest and Transform Copy Activity Dataflow Use cases: Data Ingestion Data Conversion Data Ingestion Data Transformation User interface: Wizard, Canvas Power Query Sources: 30+ connectors 180+ connectors Destinations: 18+ connectors Workspace: Lakehouse / Warehouse External: Azure SQL Database, Azure Data Explorer, Azure Synapse Analytics Transformations: Lightweight conversions 300+ transformation functions
  • 29. Solving for Sensitive Data in Microsoft Fabric Enabling analysts to innovate at speed Power BI Run Secure Analytics Fabric Data Factory Delphix Data Masking API Delphix Data Discovery API 170+ Data Sources
  • 30. UAT BACKUP AI/ML BI / ANALYTICS QUALITY ASSURANCE DEVELOPMENT REPORTING DWH INTEGRATION TESTING SANDBOX PERF TESTING DATA SCIENCE FUNCTIONAL TESTING DATA LAKE
  • 31. G D P R P D P L L G P D D P P A D P A P O P I A P I P L / C L S P I P A A P P I A P P S PRIVACY ACT 2020 C C P A H I P A A C O P P A G L B A C C P A
  • 32. Sensitive Data Exposed in Data Factory 170+ Targets 170+ Sources Sensitive Production Data ! Sensitive data is extracted and loaded to target. Fabric Data Factory Non-compliant Data Poses Risks ! Unprotected Pipeline / Dataflow US SSN: CCPA FIRST NAME: GDPR, LGPD, CCPA. DISEASE: HIPAA Analyst gets access to sensitive data
  • 33. Compliant Data Delivered in Data Factory 170+ Targets 170+ Sources Sensitive Production Data ! Compliant Data Sensitive data is extracted and loaded to target. D I S C O V E R Y AUTOMATED SENSITIVE DATA DISCOVERY Fabric Data Factory M A S K REFERENTIAL INTEGRITY L O A D Timely access to quality compliant data D I S C O V E R AUTOMATED SENSITIVE DATA DISCOVERY â—Ź Data Sampling â—Ź Metadata Match â—Ź Pattern Match â—Ź Data Type â—Ź List Checks Employees SSN 324-23-1920 450-22-3204 529-003-2314 â—Ź Name Match â—Ź Nine Digit Match â—Ź No Entries Over/ Under 9 Digits â—Ź Data Type: VARCHAR â—Ź Occasional Delimiter M A S K REFERENTIAL INTEGRITY Siloed Production Data Secure Non-Production Data Employees last_name Haas Silverman Yang Employees last_name Lee Rogers Johnson Payroll SSN 291-24-5523 450-22-3204 529-03-2314 Payroll SSN 324-23-1920 992-35-6523 857-42-4367 Orders SSN last_name 291-24-5523 Haas 450-22-3204 Yang 857-432-4367 Silverman Employee_Details SSN last_name 324-23-1920 Lee 992-35-6523 Johnson 529-003-2314 Rogers
  • 36. Getting Started with Data Factory Speakers: Shireen Bahadur, Cathrine Wilhelmsen (MVP) TUESDAY @ 11:30am Using Azure AI Services with Data Factory Speakers: Abhishek Narain, Joroen Luitwieler THURSDAY @ 1:30pm Connecting to the World’s data using Data Factory Speakers: Matt Masson, Miguel Escobar, Jianlei Shen TUESDAY @ 11:30am From Data to Decisions: Leveraging Microsoft 365 with Data Factory Speakers: Wilson Lee, Karan Shah, Rishi Girish TUESDAY @ 3:15pm Empowering Self-service BI on SAP Data with Microsoft Fabric Speakers: Abhishek Narain, Joroen Luitwieler THURSDAY @ 2:45pm Performance Tuning Secrets for Data Factory Speakers: Sid Jayadevan, Mark Kromer, Matt Masson WEDNESDAY @ 11:15am Implement Enterprise Data Integration Patterns with Data Factory Speakers: Abhishek Narain, Miquella de Boer, Noelle Li WEDNESDAY @ 1:45pm Upgrade Pathways and Best Practices for Data Factory Speakers: Mark Kromer, Miguel Escobar, Mike Carlo (MVP) THURSDAY @ 11am Behind the Design: Crafting Data Factory Experiences in Fabric Speakers: Cristin Ford, Arian Martinez, Vichita Jianjitlert TUESDAY @ 2pm Modern Data Integration with Microsoft Fabric Data Factory Speakers: Wee Hyong, Shabnam Watson (MVP), Penny Zhou WEDNESDAY @ 8am Data Factory in Microsoft Fabric Technical Deep Dive Speakers: Mohan Sankaran, John Welch, Erwin de Kreuk (MVP) WEDNESDAY @ 9:15am WEDNESDAY THURSDAY Customer Stories: Data Integration Speakers: Andre Fomin, Tom Peplow WEDNESDAY @ 8am TUESDAY
  • 37. s GENERAL AVAILABILITY PREVIEW SNEAK PEAK Modern Get Data – browse Azure Connections Dataflow output destinations – Support for schema changes for Lakehouse & Azure SQL DB Incremental Refresh for Dataflows Data Pipelines access on- premises data using “On Premises Data Gateway” (OPDG) Fast Copy for Dataflows 40 to 80 activity limit in Data Pipelines Semantic Model Refresh CI/CD in Data Pipelines Cancel Dataflow Refresh SPN support for VNET Data Gateway VNET Data Gateway support with Private Links for Dataflows Gen 2 in Fabric
  • 38. Microsoft Fabric Community Conference 2024 • Shireen Bahadur, Cathrine Wilhelmsen & Tess Kassier Interested in Connecting with the Product Group? Please email us at [email protected] for any questions! Community Lounge Meet Ups Check Whova for official meetups with user group leaders, MVPs, Super Users and more! Meet Speakers & the Product Group Check Whova for the full schedule of speaker Q&A and PG meet & greets in the Community Lounge. aka.ms/FabricCommunity Ask and answer questions in the Fabric Community forum aka.ms/FabricUserGroups Find a user group in your area or to match your interests
  • 39. Thank you! Shireen Bahadur Product Manager, Microsoft Azure Data Cathrine Wilhelmsen Lead Consultant, Evidi Tess Kassier Senior Product Manager, Delphix