SlideShare a Scribd company logo
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
f
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Offerings Of This Session
01 Why Azure Data Factory?
Microsoft Azure
Data Factory
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Offerings Of This Session
01 Why Azure Data Factory?
02 What Is Azure Data Factory?
Microsoft Azure
Data Factory
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Offerings Of This Session
01 Why Azure Data Factory?
02 What Is Azure Data Factory?
03 Azure Data Factory Concepts
Microsoft Azure
Data Factory
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Offerings Of This Session
01 Why Azure Data Factory?
02 What Is Azure Data Factory?
03 Azure Data Factory Concepts
04 What is Data Lake? Microsoft Azure
Data Factory
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Offerings Of This Session
01 Why Azure Data Factory?
02 What Is Azure Data Factory?
03 Azure Data Factory Concepts
04 What is Data Lake?
05 Data Lake Concepts
Microsoft Azure
Data Factory
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Offerings Of This Session
01 Why Azure Data Factory?
02 What Is Azure Data Factory?
03 Azure Data Factory Concepts
04 What is Data Lake?
05 Data Lake Concepts
06 Data Lake Vs Data Warehouse
Microsoft Azure
Data Factory
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Offerings Of This Session
01 Why Azure Data Factory?
02 What Is Azure Data Factory?
03 Azure Data Factory Concepts
04 What is Data Lake?
05 Data Lake Concepts
06 Data Lake Vs Data Warehouse
07 Demo
Microsoft Azure
Data Factory
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Why Azure Data Factory
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Why Data Factory?
Modern Data
handling requires you
to move from on
premise DB to Cloud
DW
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Why Data Factory?
Modern Data
handling requires you
to move from on
premise DB to Cloud
DW
This data needs
processing and goes
through a series of
steps, making the
process tedious
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Why Data Factory?
Modern Data
handling requires you
to move from on
premise DB to Cloud
DW
This data needs
processing and goes
through a series of
steps, making the
process tedious
Data Factory helps
you automate this
process and thus
serve the cause
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
What Is Data Factory?
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
What Is Data Factory?
It is a cloud based integration service that allows to create data driven workflows in the cloud for
orchestrating and automating data movement and data transformation.
Data Factory
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
What Is Data Factory?
It is a cloud based integration service that allows to create data driven workflows in the cloud for
orchestrating and automating data movement and data transformation.
Data Factory
Using Azure Data Factory, you can create and schedule
data-driven workflows (called pipelines) that can ingest
data from disparate data stores.
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
What Is Data Factory?
It is a cloud based integration service that allows to create data driven workflows in the cloud for
orchestrating and automating data movement and data transformation.
Data Factory
It can process and transform the data by using compute
services such as Azure HDInsight Hadoop, Spark, Azure
Data Lake Analytics, and Azure Machine Learning.
Using Azure Data Factory, you can create and schedule
data-driven workflows (called pipelines) that can ingest
data from disparate data stores.
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
What Is Data Factory?
How does it work?
The pipelines (data-driven workflows) in Azure Data Factory typically perform the following four steps:
Connect & Collect
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
What Is Data Factory?
How does it work?
The pipelines (data-driven workflows) in Azure Data Factory typically perform the following four steps:
Connect & Collect Transform & Enrich
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
What Is Data Factory?
How does it work?
The pipelines (data-driven workflows) in Azure Data Factory typically perform the following four steps:
Connect & Collect Transform & Enrich Publish
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
What Is Data Factory?
How does it work?
The pipelines (data-driven workflows) in Azure Data Factory typically perform the following four steps:
Connect & Collect Transform & Enrich Publish Monitor
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Data Factory Concepts
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Data Factory Concepts
Pipeline
A pipeline is a
logical grouping of
activities that
performs a unit of
work
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Data Factory Concepts
Pipeline
A pipeline is a
logical grouping of
activities that
performs a unit of
work
Datasets
Datasets
represent data
structures within
the data stores
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Data Factory Concepts
Pipeline
A pipeline is a
logical grouping of
activities that
performs a unit of
work
Activity
Activities
represent
processing step in
a pipeline
Datasets
Datasets
represent data
structures within
the data stores
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Data Factory Concepts
Pipeline
A pipeline is a
logical grouping of
activities that
performs a unit of
work
Activity
Activities
represent
processing step in
a pipeline
Datasets
Datasets
represent data
structures within
the data stores
Linked Services
Information
needed to
connect to
external sources
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
What Is Data Lake?
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
What Is Data Lake?
It is an enterprise wide hyperscale repository for Big Data Analytics workloads. Azure Data Lake holds
data of any size, type and allows you to do operational and exploratory analytics
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Data Lake Concepts
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Data Lake Concepts
Analytics Storage
HDInsight
Azure Data StoreAzure Data
Lake
Azure Data
Lake
Components
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Analytics on data of any size
All users productive on day one
Ready for your enterprise
Data Lake: Key
Things To Remember
Data Lake Concepts
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Structured Data
Semi Structured Data
Unstructured Data
Data Lake: Types Of
Data Stored
Data Lake Concepts
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Extents & Vertices
Supports parallel Reads and Writes
Supports Replication of data
Data Lake: How Is
Data Stored
Data Lake Concepts
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Data Lake Vs Data Warehouse
Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training
Data Lake Vs Data Warehouse
Complimentary to DW
Detailed Data
Schema on Read
One language to process Data of any
format
May be sourced to Data Lake
Filtered Summarized, Refined Data
Schema on Write
Processes SQL Complaint Data
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Demo- Move Data From SQL DB To Blog
Storage
Popularity

More Related Content

What's hot (20)

PPTX
Azure data factory
David Giard
 
PDF
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Cathrine Wilhelmsen
 
PPTX
Azure Data Factory Data Flow
Mark Kromer
 
PPTX
Azure Data Factory for Azure Data Week
Mark Kromer
 
PPTX
ADF Demo_ppt.pptx
vamsytaurus
 
PPTX
Intro to Azure Data Factory v1
Eric Bragas
 
PDF
Azure Data Factory v2
inovex GmbH
 
PDF
Azure Active Directory | Microsoft Azure Tutorial for Beginners | Azure 70-53...
Edureka!
 
PPTX
Azure Migrate
Mustafa
 
PDF
Adf presentation
Kaunas Java User Group
 
PPTX
NOVA SQL User Group - Azure Synapse Analytics Overview - May 2020
Timothy McAliley
 
PPTX
Azure data factory
BizTalk360
 
PPTX
Azure Data Factory ETL Patterns in the Cloud
Mark Kromer
 
PDF
Introduction to Azure Data Factory
Slava Kokaev
 
PPTX
Microsoft Azure Technical Overview
gjuljo
 
PPTX
Azure DataBricks for Data Engineering by Eugene Polonichko
Dimko Zhluktenko
 
PDF
Snowflake for Data Engineering
Harald Erb
 
PPTX
Azure Synapse Analytics Overview (r1)
James Serra
 
PDF
Migrate to Microsoft Azure with Confidence
David J Rosenthal
 
PPTX
Deep Dive into Azure Data Factory v2
Eric Bragas
 
Azure data factory
David Giard
 
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Cathrine Wilhelmsen
 
Azure Data Factory Data Flow
Mark Kromer
 
Azure Data Factory for Azure Data Week
Mark Kromer
 
ADF Demo_ppt.pptx
vamsytaurus
 
Intro to Azure Data Factory v1
Eric Bragas
 
Azure Data Factory v2
inovex GmbH
 
Azure Active Directory | Microsoft Azure Tutorial for Beginners | Azure 70-53...
Edureka!
 
Azure Migrate
Mustafa
 
Adf presentation
Kaunas Java User Group
 
NOVA SQL User Group - Azure Synapse Analytics Overview - May 2020
Timothy McAliley
 
Azure data factory
BizTalk360
 
Azure Data Factory ETL Patterns in the Cloud
Mark Kromer
 
Introduction to Azure Data Factory
Slava Kokaev
 
Microsoft Azure Technical Overview
gjuljo
 
Azure DataBricks for Data Engineering by Eugene Polonichko
Dimko Zhluktenko
 
Snowflake for Data Engineering
Harald Erb
 
Azure Synapse Analytics Overview (r1)
James Serra
 
Migrate to Microsoft Azure with Confidence
David J Rosenthal
 
Deep Dive into Azure Data Factory v2
Eric Bragas
 

Similar to Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure Training | Edureka (20)

PDF
Azure Data Engineer Online Training | Microsoft Azure Data Engineer
eshwarvisualpath
 
PPTX
Transform your data with Azure Data factory
Prometix Pty Ltd
 
PDF
Azure Data Factory Interview Questions PDF By ScholarHat
Scholarhat
 
PPTX
Azure datafactory
Dimko Zhluktenko
 
PDF
Azure Data Factory in Hyderabad - overview
version IT
 
PPTX
A lap around Azure Data Factory
BizTalk360
 
PDF
Azure Data Engineer Training In Hyderabad | Azure Data Engineer Training
eshwarvisualpath
 
PPTX
Designing big data analytics solutions on azure
Mohamed Tawfik
 
PPTX
Microsoft Azure Big Data Analytics
Mark Kromer
 
PPTX
Next Generation of Data Integration with Azure Data Factory by Tom Kerkhove
Codit
 
PPTX
Next Generation Data Integration with Azure Data Factory
Tom Kerkhove
 
PPTX
Big Data Analytics in the Cloud with Microsoft Azure
Mark Kromer
 
PPTX
Azure Data Engineer Course | Azure Data Engineer Training Hyderabad.pptx
sivavisualpath
 
PPTX
Best Azure Data Engineer Training - Best Data Engineer Course in Hyderabad.pptx
eshwarvisualpath
 
PPTX
ADF Mapping Data Flows Level 300
Mark Kromer
 
PPTX
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
Eric Bragas
 
PPTX
Azure Data Engineer Training Hyderabad - Azure Data Engineer Online Training....
eshwarvisualpath
 
DOCX
What are the core components of Azure Data Engineer courses.docx
kzayra69
 
PDF
Azure Data Engineer Course | Azure Data Engineer Trainin
Accentfuture
 
PPTX
Azure Data Engineer Training In Hyderabad | Microsoft Azure
eshwarvisualpath
 
Azure Data Engineer Online Training | Microsoft Azure Data Engineer
eshwarvisualpath
 
Transform your data with Azure Data factory
Prometix Pty Ltd
 
Azure Data Factory Interview Questions PDF By ScholarHat
Scholarhat
 
Azure datafactory
Dimko Zhluktenko
 
Azure Data Factory in Hyderabad - overview
version IT
 
A lap around Azure Data Factory
BizTalk360
 
Azure Data Engineer Training In Hyderabad | Azure Data Engineer Training
eshwarvisualpath
 
Designing big data analytics solutions on azure
Mohamed Tawfik
 
Microsoft Azure Big Data Analytics
Mark Kromer
 
Next Generation of Data Integration with Azure Data Factory by Tom Kerkhove
Codit
 
Next Generation Data Integration with Azure Data Factory
Tom Kerkhove
 
Big Data Analytics in the Cloud with Microsoft Azure
Mark Kromer
 
Azure Data Engineer Course | Azure Data Engineer Training Hyderabad.pptx
sivavisualpath
 
Best Azure Data Engineer Training - Best Data Engineer Course in Hyderabad.pptx
eshwarvisualpath
 
ADF Mapping Data Flows Level 300
Mark Kromer
 
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
Eric Bragas
 
Azure Data Engineer Training Hyderabad - Azure Data Engineer Online Training....
eshwarvisualpath
 
What are the core components of Azure Data Engineer courses.docx
kzayra69
 
Azure Data Engineer Course | Azure Data Engineer Trainin
Accentfuture
 
Azure Data Engineer Training In Hyderabad | Microsoft Azure
eshwarvisualpath
 
Ad

More from Edureka! (20)

PDF
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
PDF
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
PDF
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
PDF
Tableau Tutorial for Data Science | Edureka
Edureka!
 
PDF
Python Programming Tutorial | Edureka
Edureka!
 
PDF
Top 5 PMP Certifications | Edureka
Edureka!
 
PDF
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
PDF
Linux Mint Tutorial | Edureka
Edureka!
 
PDF
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
PDF
Importance of Digital Marketing | Edureka
Edureka!
 
PDF
RPA in 2020 | Edureka
Edureka!
 
PDF
Email Notifications in Jenkins | Edureka
Edureka!
 
PDF
EA Algorithm in Machine Learning | Edureka
Edureka!
 
PDF
Cognitive AI Tutorial | Edureka
Edureka!
 
PDF
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
PDF
Blue Prism Top Interview Questions | Edureka
Edureka!
 
PDF
Big Data on AWS Tutorial | Edureka
Edureka!
 
PDF
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
PDF
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
PDF
Introduction to DevOps | Edureka
Edureka!
 
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
Tableau Tutorial for Data Science | Edureka
Edureka!
 
Python Programming Tutorial | Edureka
Edureka!
 
Top 5 PMP Certifications | Edureka
Edureka!
 
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
Linux Mint Tutorial | Edureka
Edureka!
 
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
Importance of Digital Marketing | Edureka
Edureka!
 
RPA in 2020 | Edureka
Edureka!
 
Email Notifications in Jenkins | Edureka
Edureka!
 
EA Algorithm in Machine Learning | Edureka
Edureka!
 
Cognitive AI Tutorial | Edureka
Edureka!
 
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
Blue Prism Top Interview Questions | Edureka
Edureka!
 
Big Data on AWS Tutorial | Edureka
Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
Introduction to DevOps | Edureka
Edureka!
 
Ad

Recently uploaded (20)

PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 

Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure Training | Edureka

  • 1. Copyright © 2018, edureka and/or its affiliates. All rights reserved. f
  • 2. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Offerings Of This Session 01 Why Azure Data Factory? Microsoft Azure Data Factory
  • 3. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Offerings Of This Session 01 Why Azure Data Factory? 02 What Is Azure Data Factory? Microsoft Azure Data Factory
  • 4. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Offerings Of This Session 01 Why Azure Data Factory? 02 What Is Azure Data Factory? 03 Azure Data Factory Concepts Microsoft Azure Data Factory
  • 5. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Offerings Of This Session 01 Why Azure Data Factory? 02 What Is Azure Data Factory? 03 Azure Data Factory Concepts 04 What is Data Lake? Microsoft Azure Data Factory
  • 6. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Offerings Of This Session 01 Why Azure Data Factory? 02 What Is Azure Data Factory? 03 Azure Data Factory Concepts 04 What is Data Lake? 05 Data Lake Concepts Microsoft Azure Data Factory
  • 7. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Offerings Of This Session 01 Why Azure Data Factory? 02 What Is Azure Data Factory? 03 Azure Data Factory Concepts 04 What is Data Lake? 05 Data Lake Concepts 06 Data Lake Vs Data Warehouse Microsoft Azure Data Factory
  • 8. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Offerings Of This Session 01 Why Azure Data Factory? 02 What Is Azure Data Factory? 03 Azure Data Factory Concepts 04 What is Data Lake? 05 Data Lake Concepts 06 Data Lake Vs Data Warehouse 07 Demo Microsoft Azure Data Factory
  • 9. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Why Azure Data Factory
  • 10. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Why Data Factory? Modern Data handling requires you to move from on premise DB to Cloud DW
  • 11. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Why Data Factory? Modern Data handling requires you to move from on premise DB to Cloud DW This data needs processing and goes through a series of steps, making the process tedious
  • 12. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Why Data Factory? Modern Data handling requires you to move from on premise DB to Cloud DW This data needs processing and goes through a series of steps, making the process tedious Data Factory helps you automate this process and thus serve the cause
  • 13. Copyright © 2018, edureka and/or its affiliates. All rights reserved. What Is Data Factory?
  • 14. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training What Is Data Factory? It is a cloud based integration service that allows to create data driven workflows in the cloud for orchestrating and automating data movement and data transformation. Data Factory
  • 15. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training What Is Data Factory? It is a cloud based integration service that allows to create data driven workflows in the cloud for orchestrating and automating data movement and data transformation. Data Factory Using Azure Data Factory, you can create and schedule data-driven workflows (called pipelines) that can ingest data from disparate data stores.
  • 16. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training What Is Data Factory? It is a cloud based integration service that allows to create data driven workflows in the cloud for orchestrating and automating data movement and data transformation. Data Factory It can process and transform the data by using compute services such as Azure HDInsight Hadoop, Spark, Azure Data Lake Analytics, and Azure Machine Learning. Using Azure Data Factory, you can create and schedule data-driven workflows (called pipelines) that can ingest data from disparate data stores.
  • 17. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training What Is Data Factory? How does it work? The pipelines (data-driven workflows) in Azure Data Factory typically perform the following four steps: Connect & Collect
  • 18. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training What Is Data Factory? How does it work? The pipelines (data-driven workflows) in Azure Data Factory typically perform the following four steps: Connect & Collect Transform & Enrich
  • 19. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training What Is Data Factory? How does it work? The pipelines (data-driven workflows) in Azure Data Factory typically perform the following four steps: Connect & Collect Transform & Enrich Publish
  • 20. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training What Is Data Factory? How does it work? The pipelines (data-driven workflows) in Azure Data Factory typically perform the following four steps: Connect & Collect Transform & Enrich Publish Monitor
  • 21. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Data Factory Concepts
  • 22. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Data Factory Concepts Pipeline A pipeline is a logical grouping of activities that performs a unit of work
  • 23. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Data Factory Concepts Pipeline A pipeline is a logical grouping of activities that performs a unit of work Datasets Datasets represent data structures within the data stores
  • 24. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Data Factory Concepts Pipeline A pipeline is a logical grouping of activities that performs a unit of work Activity Activities represent processing step in a pipeline Datasets Datasets represent data structures within the data stores
  • 25. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Data Factory Concepts Pipeline A pipeline is a logical grouping of activities that performs a unit of work Activity Activities represent processing step in a pipeline Datasets Datasets represent data structures within the data stores Linked Services Information needed to connect to external sources
  • 26. Copyright © 2018, edureka and/or its affiliates. All rights reserved. What Is Data Lake?
  • 27. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training What Is Data Lake? It is an enterprise wide hyperscale repository for Big Data Analytics workloads. Azure Data Lake holds data of any size, type and allows you to do operational and exploratory analytics
  • 28. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Data Lake Concepts
  • 29. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Data Lake Concepts Analytics Storage HDInsight Azure Data StoreAzure Data Lake Azure Data Lake Components
  • 30. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Analytics on data of any size All users productive on day one Ready for your enterprise Data Lake: Key Things To Remember Data Lake Concepts
  • 31. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Structured Data Semi Structured Data Unstructured Data Data Lake: Types Of Data Stored Data Lake Concepts
  • 32. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Extents & Vertices Supports parallel Reads and Writes Supports Replication of data Data Lake: How Is Data Stored Data Lake Concepts
  • 33. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Data Lake Vs Data Warehouse
  • 34. Microsoft Azure Certification Training www.edureka.co/microsoft-azure-training Data Lake Vs Data Warehouse Complimentary to DW Detailed Data Schema on Read One language to process Data of any format May be sourced to Data Lake Filtered Summarized, Refined Data Schema on Write Processes SQL Complaint Data
  • 35. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Demo- Move Data From SQL DB To Blog Storage