SlideShare a Scribd company logo
Automate your
Machine Learning
Ajit Ananthram
Principal Consultant, FTS Data & AI
Microsoft Data Wranglers Meetup - Dec 2019 (Sydney)
Machine Learning Lifecycle
Data pre-processing
ML algorithm(s) selection
Training
Testing
Tuning
Hosting
ML Algorithm Selection
Traditional ML - Challenges
Repetitive & time-
consuming
Resource-intensive Requires deep
statistical knowledge
Automated Machine Learning (AutoML)
Automation of time-
consuming processes
Incorporation of
best practices
Democratisation of
machine learning
Demo –
Titanic Dataset
Objective:
Predict if a passenger survived
Titanic’s maiden voyage based
on known parameters
Reference: https://www.kaggle.com/c/titanic/
Feature Definition Key
survival Survival 0 = No, 1 = Yes
pclass Ticket class 1 = 1st, 2 = 2nd, 3 = 3rd
sex Sex
Age Age in years
sibsp # of siblings / spouses
aboard the Titanic
parch # of parents / children
aboard the Titanic
ticket Ticket number
fare Passenger fare
cabin Cabin number
embarked Port of Embarkation C = Cherbourg, Q =
Queenstown, S =
Southampton
Data Pre-processing
• SQL
• Python
• R
• Etc.
Languages
• ‘Unknown’ added for missing values
• Age converted to a range
• Fare converted to a range
Steps taken in Titanic Dataset
Automated Machine Learning in
Microsoft Azure
Demo
Iterations
• Runs consist of multiple
combinations of ML
algorithms & hyperparameters
• Stops when exit criteria
reached
• Deploy best model as web
service
Measurements
Metric Purpose Equation
Accuracy Indication of success-rate for true positives & true negatives (Tp + Tn) / (Tp + Tn + Fp + Fn)
Precision Indication of success-rate for true positives Tp / (Tp + Fp)
Recall Indication of impact of false negatives Tp / (Tp + Fn)
(Tn) (Fp)
(Tp)(Fn)
Interpretability
• ‘Explainability’ of ML output
critical for success
• Explain globally (all data) or
locally (specific point)
• Run on compute target
Hosting
• Test in an Azure Container Instance
(ACI) container
• Productionise using an Azure
Kubernetes Services (AKS) cluster
• Consume REST web-service in
applications
Drift Detection
• Data drift causes model
performance degradation
• Detect drift on timeseries data
• Set-up alerts in Application
Insights
Reference - https://docs.microsoft.com/en-us/azure/machine-learning/service/how-to-monitor-datasets
AutoML –
Code-based
Experience
MLOps
Reference - https://github.com/Microsoft/MLOps
• Integrate Azure Machine
Learning with Azure DevOps
(AML extension)
• Enable collaboration between
Data Scientists and other
teams
• Set-up CI/CD pipelines using
an effecting branching
strategy
Conclusions
• AutoML brings data science activities to a wider audience
• It reduces repetitive processing time thereby freeing up time for
higher-level activities
• Microsoft Azure Machine Learning is constantly evolving

More Related Content

What's hot (20)

PDF
Hamburg Data Science Meetup - MLOps with a Feature Store
Moritz Meister
 
PDF
MLflow and Azure Machine Learning—The Power Couple for ML Lifecycle Management
Databricks
 
PPTX
Blind spots in big data erez koren @ forter
Ido Shilon
 
PDF
The A-Z of Data: Introduction to MLOps
DataPhoenix
 
PDF
ML-Ops: Philosophy, Best-Practices and Tools
Jorge Davila-Chacon
 
PPTX
Managing your ML lifecycle with Azure Databricks and Azure ML
Parashar Shah
 
PDF
Managing the Machine Learning Lifecycle with MLflow
Databricks
 
PDF
Moving a Fraud-Fighting Random Forest from scikit-learn to Spark with MLlib, ...
Databricks
 
PPTX
Nasscom ml ops webinar
Sameer Mahajan
 
PDF
Consolidating MLOps at One of Europe’s Biggest Airports
Databricks
 
PDF
Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...
Databricks
 
PDF
Ml ops intro session
Avinash Patil
 
PDF
Lessons Learned from Using Spark for Evaluating Road Detection at BMW Autonom...
Databricks
 
PPTX
AzureML TechTalk
Udaya Kumar
 
PPTX
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
Provectus
 
PDF
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
Márton Kodok
 
PDF
Battling Model Decay with Deep Learning and Gamification
Databricks
 
PPTX
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
Databricks
 
PDF
Multi runtime serving pipelines for machine learning
Stepan Pushkarev
 
PDF
A Microservices Framework for Real-Time Model Scoring Using Structured Stream...
Databricks
 
Hamburg Data Science Meetup - MLOps with a Feature Store
Moritz Meister
 
MLflow and Azure Machine Learning—The Power Couple for ML Lifecycle Management
Databricks
 
Blind spots in big data erez koren @ forter
Ido Shilon
 
The A-Z of Data: Introduction to MLOps
DataPhoenix
 
ML-Ops: Philosophy, Best-Practices and Tools
Jorge Davila-Chacon
 
Managing your ML lifecycle with Azure Databricks and Azure ML
Parashar Shah
 
Managing the Machine Learning Lifecycle with MLflow
Databricks
 
Moving a Fraud-Fighting Random Forest from scikit-learn to Spark with MLlib, ...
Databricks
 
Nasscom ml ops webinar
Sameer Mahajan
 
Consolidating MLOps at One of Europe’s Biggest Airports
Databricks
 
Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...
Databricks
 
Ml ops intro session
Avinash Patil
 
Lessons Learned from Using Spark for Evaluating Road Detection at BMW Autonom...
Databricks
 
AzureML TechTalk
Udaya Kumar
 
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
Provectus
 
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
Márton Kodok
 
Battling Model Decay with Deep Learning and Gamification
Databricks
 
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
Databricks
 
Multi runtime serving pipelines for machine learning
Stepan Pushkarev
 
A Microservices Framework for Real-Time Model Scoring Using Structured Stream...
Databricks
 

Similar to Automate your Machine Learning (20)

PDF
The Data Science Process - Do we need it and how to apply?
Ivo Andreev
 
PDF
Prepare your data for machine learning
Ivo Andreev
 
PDF
The Machine Learning Workflow with Azure
Ivo Andreev
 
PDF
The Power of Auto ML and How Does it Work
Ivo Andreev
 
PPTX
Machine Learning with Azure
Barbara Fusinska
 
PPTX
AzureML – zero to hero
Govind Kanshi
 
PPTX
Automated machine learning - Global AI night 2019
Marco Zamana
 
PPTX
Azure Machine Learning and its real-world use cases
Michaela Murray
 
PPTX
Azure Machine Learning Challenge_Speakers Presentation.pptx
DrSatwinderSingh3
 
PPTX
Getting Started with Azure AutoML
Vivek Raja P S
 
PDF
Azure Machine Learning
Mostafa
 
PDF
Machine learning for IoT - unpacking the blackbox
Ivo Andreev
 
PPTX
Machine learning
Saravanan Subburayal
 
PPTX
Azure machine learning tech mela
Yogendra Tamang
 
PPTX
Predicting the Future as a Service with Azure ML and R
Barbara Fusinska
 
PPTX
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
DotNetCampus
 
PPTX
Net campus2015 antimomusone
DotNetCampus
 
PDF
201906 04 Overview of Automated ML June 2019
Mark Tabladillo
 
PDF
Azure Machine Learning and ML on Premises
Ivo Andreev
 
PPTX
Azure Machine Learning Dotnet Campus 2015
antimo musone
 
The Data Science Process - Do we need it and how to apply?
Ivo Andreev
 
Prepare your data for machine learning
Ivo Andreev
 
The Machine Learning Workflow with Azure
Ivo Andreev
 
The Power of Auto ML and How Does it Work
Ivo Andreev
 
Machine Learning with Azure
Barbara Fusinska
 
AzureML – zero to hero
Govind Kanshi
 
Automated machine learning - Global AI night 2019
Marco Zamana
 
Azure Machine Learning and its real-world use cases
Michaela Murray
 
Azure Machine Learning Challenge_Speakers Presentation.pptx
DrSatwinderSingh3
 
Getting Started with Azure AutoML
Vivek Raja P S
 
Azure Machine Learning
Mostafa
 
Machine learning for IoT - unpacking the blackbox
Ivo Andreev
 
Machine learning
Saravanan Subburayal
 
Azure machine learning tech mela
Yogendra Tamang
 
Predicting the Future as a Service with Azure ML and R
Barbara Fusinska
 
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
DotNetCampus
 
Net campus2015 antimomusone
DotNetCampus
 
201906 04 Overview of Automated ML June 2019
Mark Tabladillo
 
Azure Machine Learning and ML on Premises
Ivo Andreev
 
Azure Machine Learning Dotnet Campus 2015
antimo musone
 
Ad

Recently uploaded (20)

PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
PDF
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
PDF
1750162332_Snapshot-of-Indias-oil-Gas-data-May-2025.pdf
sandeep718278
 
PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
PPTX
What Is Data Integration and Transformation?
subhashenia
 
PPTX
Feb 2021 Ransomware Recovery presentation.pptx
enginsayin1
 
PPTX
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
PPT
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PDF
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
PDF
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
PDF
Optimizing Large Language Models with vLLM and Related Tools.pdf
Tamanna36
 
PPTX
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
PPTX
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
PDF
Technical-Report-GPS_GIS_RS-for-MSF-finalv2.pdf
KPycho
 
PDF
Data Science Course Certificate by Sigma Software University
Stepan Kalika
 
PPTX
Powerful Uses of Data Analytics You Should Know
subhashenia
 
PDF
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
PPTX
How to Add Columns and Rows in an R Data Frame
subhashenia
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
1750162332_Snapshot-of-Indias-oil-Gas-data-May-2025.pdf
sandeep718278
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
What Is Data Integration and Transformation?
subhashenia
 
Feb 2021 Ransomware Recovery presentation.pptx
enginsayin1
 
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
Optimizing Large Language Models with vLLM and Related Tools.pdf
Tamanna36
 
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
Technical-Report-GPS_GIS_RS-for-MSF-finalv2.pdf
KPycho
 
Data Science Course Certificate by Sigma Software University
Stepan Kalika
 
Powerful Uses of Data Analytics You Should Know
subhashenia
 
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
How to Add Columns and Rows in an R Data Frame
subhashenia
 
Ad

Automate your Machine Learning

  • 1. Automate your Machine Learning Ajit Ananthram Principal Consultant, FTS Data & AI Microsoft Data Wranglers Meetup - Dec 2019 (Sydney)
  • 2. Machine Learning Lifecycle Data pre-processing ML algorithm(s) selection Training Testing Tuning Hosting
  • 4. Traditional ML - Challenges Repetitive & time- consuming Resource-intensive Requires deep statistical knowledge
  • 5. Automated Machine Learning (AutoML) Automation of time- consuming processes Incorporation of best practices Democratisation of machine learning
  • 6. Demo – Titanic Dataset Objective: Predict if a passenger survived Titanic’s maiden voyage based on known parameters Reference: https://www.kaggle.com/c/titanic/ Feature Definition Key survival Survival 0 = No, 1 = Yes pclass Ticket class 1 = 1st, 2 = 2nd, 3 = 3rd sex Sex Age Age in years sibsp # of siblings / spouses aboard the Titanic parch # of parents / children aboard the Titanic ticket Ticket number fare Passenger fare cabin Cabin number embarked Port of Embarkation C = Cherbourg, Q = Queenstown, S = Southampton
  • 7. Data Pre-processing • SQL • Python • R • Etc. Languages • ‘Unknown’ added for missing values • Age converted to a range • Fare converted to a range Steps taken in Titanic Dataset
  • 8. Automated Machine Learning in Microsoft Azure Demo
  • 9. Iterations • Runs consist of multiple combinations of ML algorithms & hyperparameters • Stops when exit criteria reached • Deploy best model as web service
  • 10. Measurements Metric Purpose Equation Accuracy Indication of success-rate for true positives & true negatives (Tp + Tn) / (Tp + Tn + Fp + Fn) Precision Indication of success-rate for true positives Tp / (Tp + Fp) Recall Indication of impact of false negatives Tp / (Tp + Fn) (Tn) (Fp) (Tp)(Fn)
  • 11. Interpretability • ‘Explainability’ of ML output critical for success • Explain globally (all data) or locally (specific point) • Run on compute target
  • 12. Hosting • Test in an Azure Container Instance (ACI) container • Productionise using an Azure Kubernetes Services (AKS) cluster • Consume REST web-service in applications
  • 13. Drift Detection • Data drift causes model performance degradation • Detect drift on timeseries data • Set-up alerts in Application Insights Reference - https://docs.microsoft.com/en-us/azure/machine-learning/service/how-to-monitor-datasets
  • 15. MLOps Reference - https://github.com/Microsoft/MLOps • Integrate Azure Machine Learning with Azure DevOps (AML extension) • Enable collaboration between Data Scientists and other teams • Set-up CI/CD pipelines using an effecting branching strategy
  • 16. Conclusions • AutoML brings data science activities to a wider audience • It reduces repetitive processing time thereby freeing up time for higher-level activities • Microsoft Azure Machine Learning is constantly evolving

Editor's Notes

  • #2: Intro Poll Objectives
  • #9: Run 21
  • #14: Dataset monitors
  • #15: https://ml.azure.com/fileexplorer?wsid=/subscriptions/b5f23f19-6ff1-41aa-9846-de40165003d7/resourcegroups/MSMeetup_Ajit_rg/workspaces/aml_ws&activeFilePath=Users/Ajit.Ananthram/tutorials/regression-automated-ml.ipynb