SlideShare a Scribd company logo
Azure Data Factory: Data Wrangling
Power Query in ADF
Updated Public Preview Q1 CY21
What is Data Wrangling?
 Code-free data exploration and data prep
 Operationalize Power Query as an activity by
translating M script in ADF data flow script
 Execute Power Query as a pipeline activity using the
ADF data flow serverless, scaled-out, ADF-managed
Apache Spark engine
 Essentially acts as a data-first entry point to building
ADF data flows
ADF Data Wrangling Use Cases
 Data Engineer is building an ETL process in ADF uses PQ to explore data using data profiling
 Business Analyst is a PQ desktop user and wishes to operationalize their M query in a data
pipeline that sinks data in the Lake
 Data Engineer needs to prep data for modeling and ETL by using a data-first approach. Creates a
PQ wrangling activity and adds it to pipeline.
 Trimming strings
 Data type conversions
 Rename columns
 Remove columns
 Value prop
 “Data Wrangling in ADF”, not “Power Query lift-and-shift”
ADF Data Wrangling Roadmap – PQ Activity
 Continue to add more M functions to fold into Spark
 Add more native connectors that work in both ADF & Power Query
 Enable V-Net in Power Query Online data wrangling experience in ADF
 Launch PQ activity in Synapse Pipelines
 Enable interactive monitoring similar to Copy and Data Flow
Additional
resources
Documentation
List of tutorial videos
Expression language reference
Performance guide
ADF twitter
ADF tech community blog

More Related Content

What's hot (20)

PPTX
Azure Data Factory Data Flow
Mark Kromer
 
PPTX
Microsoft Azure Data Factory Hands-On Lab Overview Slides
Mark Kromer
 
PPTX
SQL Saturday Redmond 2019 ETL Patterns in the Cloud
Mark Kromer
 
PPTX
Mapping Data Flows Training April 2021
Mark Kromer
 
PPTX
Azure Data Factory Data Flow Limited Preview for January 2019
Mark Kromer
 
PPTX
Deep Dive into Azure Data Factory v2
Eric Bragas
 
PPTX
ADF Mapping Data Flows Training Slides V1
Mark Kromer
 
PPTX
Mapping Data Flows Training deck Q1 CY22
Mark Kromer
 
PPTX
Microsoft Azure Data Factory Data Flow Scenarios
Mark Kromer
 
PPTX
Microsoft Build 2018 Analytic Solutions with Azure Data Factory and Azure SQL...
Mark Kromer
 
PDF
ADF Mapping Data Flow Private Preview Migration
Mark Kromer
 
PPTX
Microsoft Data Integration Pipelines: Azure Data Factory and SSIS
Mark Kromer
 
PPTX
Azure data factory
David Giard
 
PDF
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
Lace Lofranco
 
PPTX
ETL in the Cloud With Microsoft Azure
Mark Kromer
 
PPTX
Sql Bits 2020 - Designing Performant and Scalable Data Lakes using Azure Data...
Rukmani Gopalan
 
PPTX
Microsoft Azure BI Solutions in the Cloud
Mark Kromer
 
PDF
J1 T1 4 - Azure Data Factory vs SSIS - Regis Baccaro
MS Cloud Summit
 
PDF
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
Cathrine Wilhelmsen
 
PDF
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Cathrine Wilhelmsen
 
Azure Data Factory Data Flow
Mark Kromer
 
Microsoft Azure Data Factory Hands-On Lab Overview Slides
Mark Kromer
 
SQL Saturday Redmond 2019 ETL Patterns in the Cloud
Mark Kromer
 
Mapping Data Flows Training April 2021
Mark Kromer
 
Azure Data Factory Data Flow Limited Preview for January 2019
Mark Kromer
 
Deep Dive into Azure Data Factory v2
Eric Bragas
 
ADF Mapping Data Flows Training Slides V1
Mark Kromer
 
Mapping Data Flows Training deck Q1 CY22
Mark Kromer
 
Microsoft Azure Data Factory Data Flow Scenarios
Mark Kromer
 
Microsoft Build 2018 Analytic Solutions with Azure Data Factory and Azure SQL...
Mark Kromer
 
ADF Mapping Data Flow Private Preview Migration
Mark Kromer
 
Microsoft Data Integration Pipelines: Azure Data Factory and SSIS
Mark Kromer
 
Azure data factory
David Giard
 
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
Lace Lofranco
 
ETL in the Cloud With Microsoft Azure
Mark Kromer
 
Sql Bits 2020 - Designing Performant and Scalable Data Lakes using Azure Data...
Rukmani Gopalan
 
Microsoft Azure BI Solutions in the Cloud
Mark Kromer
 
J1 T1 4 - Azure Data Factory vs SSIS - Regis Baccaro
MS Cloud Summit
 
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
Cathrine Wilhelmsen
 
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Cathrine Wilhelmsen
 

Similar to Azure Data Factory Data Wrangling with Power Query (20)

PPTX
Become a Data-Engineering _ ABMC Group.pptx
Access Business Management Conferencing International
 
PDF
Getting Started: Data Factory in Microsoft Fabric (Microsoft Fabric Community...
Cathrine Wilhelmsen
 
DOCX
Data Wrangling for Big Data Challenges andOpportunities.docx
whittemorelucilla
 
PPTX
Data Wrangling Made Simple: Tools and Tips.pptx
Shivanshi Singh
 
PPTX
Intro to Azure Data Factory v1
Eric Bragas
 
PPTX
data wrangling (1).pptx kjhiukjhknjbnkjh
VISHALMARWADE1
 
PDF
Data Wrangling with Python_ Cleaning and Preparing Datasets for Analysis.pdf
ExcelR- Data Science, Data Analyst, Business Analyst Course Training in Delhi
 
PPTX
DataWrangler @VGSOM
Divya Hamirwasia
 
PDF
Data Integration with Data Factory (Microsoft Fabric Day Oslo 2023)
Cathrine Wilhelmsen
 
PDF
Data Integration using Data Factory in Microsoft Fabric (ESPC Microsoft Fabri...
Cathrine Wilhelmsen
 
PDF
Data Factory in Microsoft Fabric (MsBIP #82)
Cathrine Wilhelmsen
 
PDF
How Data Wrangling Is Reshaping IT Strategies.pdf
varshanayak241
 
PDF
The Battle of the Data Transformation Tools (PASS Data Community Summit 2023)
Cathrine Wilhelmsen
 
PPTX
Revolutionizing Data Wrangling with Ask On Data.pptx
Varsha Nayak
 
PDF
Revolutionizing Data Wrangling with Ask On Data.pdf
Varsha Nayak
 
DOCX
Revolutionizing Data Wrangling with Ask On Data.docx
Varsha Nayak
 
PDF
Sql saturday el salvador 2016 - Me, A Data Scientist?
Fabricio Quintanilla
 
PDF
Azure Data Factory v2
inovex GmbH
 
PDF
Next level data operations using Power Automate magic
Andries den Haan
 
PPTX
DataDiscoveryWithPowerQuery.pptx
HARLVED V
 
Become a Data-Engineering _ ABMC Group.pptx
Access Business Management Conferencing International
 
Getting Started: Data Factory in Microsoft Fabric (Microsoft Fabric Community...
Cathrine Wilhelmsen
 
Data Wrangling for Big Data Challenges andOpportunities.docx
whittemorelucilla
 
Data Wrangling Made Simple: Tools and Tips.pptx
Shivanshi Singh
 
Intro to Azure Data Factory v1
Eric Bragas
 
data wrangling (1).pptx kjhiukjhknjbnkjh
VISHALMARWADE1
 
Data Wrangling with Python_ Cleaning and Preparing Datasets for Analysis.pdf
ExcelR- Data Science, Data Analyst, Business Analyst Course Training in Delhi
 
DataWrangler @VGSOM
Divya Hamirwasia
 
Data Integration with Data Factory (Microsoft Fabric Day Oslo 2023)
Cathrine Wilhelmsen
 
Data Integration using Data Factory in Microsoft Fabric (ESPC Microsoft Fabri...
Cathrine Wilhelmsen
 
Data Factory in Microsoft Fabric (MsBIP #82)
Cathrine Wilhelmsen
 
How Data Wrangling Is Reshaping IT Strategies.pdf
varshanayak241
 
The Battle of the Data Transformation Tools (PASS Data Community Summit 2023)
Cathrine Wilhelmsen
 
Revolutionizing Data Wrangling with Ask On Data.pptx
Varsha Nayak
 
Revolutionizing Data Wrangling with Ask On Data.pdf
Varsha Nayak
 
Revolutionizing Data Wrangling with Ask On Data.docx
Varsha Nayak
 
Sql saturday el salvador 2016 - Me, A Data Scientist?
Fabricio Quintanilla
 
Azure Data Factory v2
inovex GmbH
 
Next level data operations using Power Automate magic
Andries den Haan
 
DataDiscoveryWithPowerQuery.pptx
HARLVED V
 
Ad

More from Mark Kromer (10)

PPTX
Fabric Data Factory Pipeline Copy Perf Tips.pptx
Mark Kromer
 
PPTX
Build data quality rules and data cleansing into your data pipelines
Mark Kromer
 
PPTX
Data cleansing and prep with synapse data flows
Mark Kromer
 
PPTX
Data cleansing and data prep with synapse data flows
Mark Kromer
 
PPTX
Mapping Data Flows Perf Tuning April 2021
Mark Kromer
 
PPTX
Data Lake ETL in the Cloud with ADF
Mark Kromer
 
PPTX
Azure Data Factory Data Flow Performance Tuning 101
Mark Kromer
 
PPTX
Data Quality Patterns in the Cloud with ADF
Mark Kromer
 
PPTX
ADF Mapping Data Flows Training V2
Mark Kromer
 
PPTX
Azure Data Factory Data Flow Preview December 2019
Mark Kromer
 
Fabric Data Factory Pipeline Copy Perf Tips.pptx
Mark Kromer
 
Build data quality rules and data cleansing into your data pipelines
Mark Kromer
 
Data cleansing and prep with synapse data flows
Mark Kromer
 
Data cleansing and data prep with synapse data flows
Mark Kromer
 
Mapping Data Flows Perf Tuning April 2021
Mark Kromer
 
Data Lake ETL in the Cloud with ADF
Mark Kromer
 
Azure Data Factory Data Flow Performance Tuning 101
Mark Kromer
 
Data Quality Patterns in the Cloud with ADF
Mark Kromer
 
ADF Mapping Data Flows Training V2
Mark Kromer
 
Azure Data Factory Data Flow Preview December 2019
Mark Kromer
 
Ad

Recently uploaded (20)

PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PPTX
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PDF
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 
PDF
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 

Azure Data Factory Data Wrangling with Power Query

  • 1. Azure Data Factory: Data Wrangling Power Query in ADF Updated Public Preview Q1 CY21
  • 2. What is Data Wrangling?  Code-free data exploration and data prep  Operationalize Power Query as an activity by translating M script in ADF data flow script  Execute Power Query as a pipeline activity using the ADF data flow serverless, scaled-out, ADF-managed Apache Spark engine  Essentially acts as a data-first entry point to building ADF data flows
  • 3. ADF Data Wrangling Use Cases  Data Engineer is building an ETL process in ADF uses PQ to explore data using data profiling  Business Analyst is a PQ desktop user and wishes to operationalize their M query in a data pipeline that sinks data in the Lake  Data Engineer needs to prep data for modeling and ETL by using a data-first approach. Creates a PQ wrangling activity and adds it to pipeline.  Trimming strings  Data type conversions  Rename columns  Remove columns  Value prop  “Data Wrangling in ADF”, not “Power Query lift-and-shift”
  • 4. ADF Data Wrangling Roadmap – PQ Activity  Continue to add more M functions to fold into Spark  Add more native connectors that work in both ADF & Power Query  Enable V-Net in Power Query Online data wrangling experience in ADF  Launch PQ activity in Synapse Pipelines  Enable interactive monitoring similar to Copy and Data Flow
  • 5. Additional resources Documentation List of tutorial videos Expression language reference Performance guide ADF twitter ADF tech community blog