SlideShare a Scribd company logo
Productionalizing Models through CI/CD Design with MLflow
Productionalizing Models Through
CI/CD Design and MLflow
Mary Grace Moesta & Pete Tamisin
June 2020
About the Speakers
Peter Tamisin
Technical Lead, Customer Success
● 20+ year career focused on data
analytics and engineering
● Member of Databricks Automation SME
team contributing multiple blogs and
published best practice guides on
CI/CD
● Based in Atlanta, GA enjoys cheering on
the Atlanta Hawks and playing
video/board games with his wife and 4
kids
Mary Grace Moesta
Customer Success Engineer
● Supporting customers in the
retail and CPG space
● Former data scientist with
focused work on customer
experience and brand
acceleration
● Databricks Labs AutoML
contributor
● Based in Detroit, MI, enjoys
running and golfing in her free
time
Agenda
▪ Setting Definition and
Assumptions
▪ Defining MLops
▪ The importance of MLops in a production system
▪ Basics of CI/CD
▪ How MLflow pivots CI/CD basics for ML
▪ An Example of promoting ML
Code and a model as artifacts
▪ Version control
▪ Interfacing with MLflow
▪ Registering the Model
▪ Building DevOps pipeline to trigger production training and
inference runs
Continuous Delivery for Machine Learning
Danilo Sato, Arif Wider, Christoph windheuser
“Continuous Delivery for Machine Learning (CD4ML) is a software
engineering approach in which a cross-functional team produces machine
learning applications based on code, data, and models in small and safe
increments that can be reproduced and reliably released at any time, in short
adaptation cycles.”
Why MLops is Relevant
▪ The data science and machine learning development framework is
traditionally centered around local development
▪ Keeping work scoped to a scientist’s local laptop, code save locally, limited to the amount of compute in local machine, etc.
▪ As data and process complexity grows, so does the number of
integration points
▪ Leveraging tools like Spark means there is more machines to manage, data is stored across various locations, the cost of kicking off a
single run now becomes much more complex and expensive
▪ Machine learning operations allow for development at scale and hands
off execution of production runs
▪ Automated execution enables the business to implement these powerful machine learning solutions with ease
The Basics of Continuous Integration and
Continuous Delivery
▪ Code
▪ Develop code and run tests in a local IDE, Databricks,
Databricks Connect, etc.
▪ Manually run tests
▪ Commit code and tests to a version controlled code branch
▪ Build
▪ Pull together new code + tests
▪ Run automated tests
▪ Build library and non notebook code
▪ Release
▪ Generate Release Artifact
▪ Deploy
▪ Two methods of deployment
▪ Deploying notebooks
▪ Deploying Libraries / release artifacts
▪ Test
▪ Run automated tests
▪ Report results
▪ Operate
▪ Programatically schedule downstream data
engineering, machine learning, and analytics
workloads
Continuous DeliveryContinuous Integration
The Basics of Continuous Integration and
Continuous Delivery with an ML Twist
▪ Code
▪ Develop code and run tests in a local IDE, Databricks,
Databricks Connect, etc.
▪ Manually run tests
▪ Commit code and tests to a version controlled code branch
▪ Build
▪ Pull together new code + tests
▪ Run automated tests
▪ Build library and non notebook code
▪ Release
▪ Generate Release Artifact
▪ Deploy
▪ Two methods of deployment
▪ Deploying notebooks
▪ Deploying Libraries / release artifacts
▪ Test
▪ Run automated tests
▪ Report results
▪ Operate
▪ Programatically schedule downstream data
engineering, machine learning, and analytics
workloads
Continuous DeliveryContinuous Integration
➔ Batching scoring, real time serving, containers,
cloud inference services
➔ Leveraging tools like Jenkins, AzDO, ect to
trigger new model builds in production when
new changes are merged to master
➔ Using MLflow to track experiments, runs,
hyperparameters, code, artifacts, etc.
➔ Notebook / IDE environment to develop on feature
branches using favorite ML tools: sklearn, SparkML,
TF, pytorch, etc.
➔ Training runs at scale with new model features,
hyperparams, etc. implemented
➔ Tracking different model version in production using
Model Registry
➔ Model, entire pipeline, image, code etc. as artifacts
➔ Writing tests for the machine learning code /
feature engineering
How Contributes to Seamless MLOps
Staging Production Archived
Data Scientists Deployment Engineers
v1
v2
v3
Models Tracking
Flavor 2Flavor 1
Model Registry
Custom
Models
In-Line Code
Containers
Batch & Stream
Scoring
Cloud Inference
Services
OSS Serving
Solutions
Serving
Parameters Metrics Artifacts
ModelsMetadata
Code and Version Control
● Pick your favorite version control
○ Github, AzDo, Bitbucket, etc.
● Branch from master for development
○ Hyperparam space search, alternative feature sets, algorithm refinements, etc.
● Tracking development metrics and criteria using MLflow tracking
▪ Tracking artifacts that will be used in
the build / release stages
Controlling Model Flow Through Build and
Release Stages
▪ After new models
have been trained
in the feature
branch:
Parameterizing the
right experiment
path using widgets
Setting the decision
criteria for a best run
Searching through filtered
runs to identify the best_run
and build the model URI to
programmatically reference
later
Controlling Model Flow Through Build and
Release Stages
▪ Once the best_run has been
identified, use Model
Registry to track the flow of
models in and out of
production
▪ Note that the stages defined
in the registry do not directly
translate to environments
▪ The registry also doesn’t span
multiple workspaces, they
map 1:1 to workspace.
Initially
register the
new model
Archive the current
model out of production
Promote to
production stage
Continuous Deployment
▪ Deployment depends on the business problem and the SLA attached
▪ Let’s look at batch inference as an example:
Pipeline is triggered anytime there’s a
change to master branch (release)
Environment variables + secrets that
have been redacted from the script
Continuous Deployment Example: AzDo Pipeline
Defining the instance
OS
Installing Python
Installing Databricks
CLI
Configuring the
Databricks CLI with a
secret token
Copying Code to Hands off Production
Environments
▪ Copy the code from the master branch to a desired location
▪ This could be a ‘hands off’ production environment, isolated within a single workspace, etc.
Building and Spinning Up the Cluster
Specifying the cluster
configuration
Booting up the
cluster
Make the Inference Job with Parameters
▪ Create the jobs using the name
defined as an environment variable
▪ Passing the parameters which specify
the experiment location
▪ Setting job configurations
▪ Increasing number of concurrent runs
allows to run multiple inference jobs at
the same time
Running the Inference Job
▪ Uses the run-now endpoint to run the job via API
▪ The job shows up in the Databricks UI
▪ Can navigate using the GUI or API to get more details about each job run
Wrapping it all Up
▪ MLops is an important piece of the machine learning framework that
enables the business to consume downstream results with ease
▪ The basics of CI/CD can be pivoted to fit the structure of a machine
learning project to help establish the feedback loop from development
to production
▪ can act that the governing body to help regulate and track
the entire lifecycle
▪ Automation tools like Jenkins, Azure DevOps, etc. are applied to
orchestrate the end to end process
Feedback
Your feedback is important to us.
Don’t forget to rate and
review the sessions.

More Related Content

What's hot (20)

PPTX
Apache Spark Components
Girish Khanzode
 
PDF
Introduction to MLflow
Databricks
 
PDF
Managing the Machine Learning Lifecycle with MLOps
Fatih Baltacı
 
PDF
Data pipelines from zero to solid
Lars Albertsson
 
PPTX
MLOps.pptx
AllenPeter7
 
PPTX
データ分析概略
Daiyu Hatakeyama
 
PPTX
Apache spark 소개 및 실습
동현 강
 
PDF
[2D1]Elasticsearch 성능 최적화
NAVER D2
 
PPTX
MLOps - The Assembly Line of ML
Jordan Birdsell
 
PDF
Airflow introduction
Chandler Huang
 
PDF
MLPerf an industry standard benchmark suite for machine learning performance
jemin lee
 
PDF
[부스트캠프 Tech Talk] 진명훈_datasets로 협업하기
CONNECT FOUNDATION
 
PDF
MLFlow: Platform for Complete Machine Learning Lifecycle
Databricks
 
PDF
Vectors are the new JSON in PostgreSQL
Jonathan Katz
 
PDF
MATS stack (MLFlow, Airflow, Tensorflow, Spark) for Cross-system Orchestratio...
Databricks
 
PDF
Intro to Vertex AI, unified MLOps platform for Data Scientists & ML Engineers
Daniel Zivkovic
 
PPTX
Airflow 101
SaarBergerbest
 
PDF
숨겨진 마이크로서비스: 초고속 응답과 고가용성을 위한 캐시 서비스 디자인
VMware Tanzu Korea
 
PDF
PostgreSQL のイケてるテクニック7選
Tomoya Kawanishi
 
PDF
Getting Started with Delta Lake on Databricks
Knoldus Inc.
 
Apache Spark Components
Girish Khanzode
 
Introduction to MLflow
Databricks
 
Managing the Machine Learning Lifecycle with MLOps
Fatih Baltacı
 
Data pipelines from zero to solid
Lars Albertsson
 
MLOps.pptx
AllenPeter7
 
データ分析概略
Daiyu Hatakeyama
 
Apache spark 소개 및 실습
동현 강
 
[2D1]Elasticsearch 성능 최적화
NAVER D2
 
MLOps - The Assembly Line of ML
Jordan Birdsell
 
Airflow introduction
Chandler Huang
 
MLPerf an industry standard benchmark suite for machine learning performance
jemin lee
 
[부스트캠프 Tech Talk] 진명훈_datasets로 협업하기
CONNECT FOUNDATION
 
MLFlow: Platform for Complete Machine Learning Lifecycle
Databricks
 
Vectors are the new JSON in PostgreSQL
Jonathan Katz
 
MATS stack (MLFlow, Airflow, Tensorflow, Spark) for Cross-system Orchestratio...
Databricks
 
Intro to Vertex AI, unified MLOps platform for Data Scientists & ML Engineers
Daniel Zivkovic
 
Airflow 101
SaarBergerbest
 
숨겨진 마이크로서비스: 초고속 응답과 고가용성을 위한 캐시 서비스 디자인
VMware Tanzu Korea
 
PostgreSQL のイケてるテクニック7選
Tomoya Kawanishi
 
Getting Started with Delta Lake on Databricks
Knoldus Inc.
 

Similar to Productionalizing Models through CI/CD Design with MLflow (20)

PDF
Continuous delivery for machine learning
Rajesh Muppalla
 
PPTX
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
PDF
Enabling Scalable Data Science Pipeline with Mlflow at Thermo Fisher Scientific
Databricks
 
PDF
Building A Product Assortment Recommendation Engine
Databricks
 
PPTX
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
DataScienceConferenc1
 
PDF
ALM Tour 2013 - Entregar a tiempo y sin errores
Jose Luis Soria
 
PDF
PASS 2024 - Best Practices for Development on Azure Databricks
Dustin Vannoy
 
PDF
Productionizing Real-time Serving With MLflow
Databricks
 
PDF
Productionizing Machine Learning with a Microservices Architecture
Databricks
 
PPTX
DOES14 - David Ashman, Blackboard Learn - Keep Your Head in the Clouds Tuesda...
DevOps Enterprise Summmit
 
PPTX
DOES14 - David Ashman - Blackboard Learn - Keep Your Head in the Clouds
Gene Kim
 
PPTX
From Data Science to MLOps
Carl W. Handlin
 
PDF
DEVOPS AND MACHINE LEARNING
CodeOps Technologies LLP
 
PPTX
DevOps for Machine Learning overview en-us
eltonrodriguez11
 
PDF
Csharp corner toronto vs2019 post launch 10 apr 2019 nilesh shah
Nilesh Shah
 
PDF
Structured Streaming in Spark
Digital Vidya
 
PDF
Lessons from Building Large-Scale, Multi-Cloud, SaaS Software at Databricks
Databricks
 
PPTX
#spsclt18 vincent biret #spfx #devops
Vincent Biret
 
PDF
MLOPS By Amazon offered and free download
pouyan533
 
PDF
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Sotrender
 
Continuous delivery for machine learning
Rajesh Muppalla
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
Enabling Scalable Data Science Pipeline with Mlflow at Thermo Fisher Scientific
Databricks
 
Building A Product Assortment Recommendation Engine
Databricks
 
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
DataScienceConferenc1
 
ALM Tour 2013 - Entregar a tiempo y sin errores
Jose Luis Soria
 
PASS 2024 - Best Practices for Development on Azure Databricks
Dustin Vannoy
 
Productionizing Real-time Serving With MLflow
Databricks
 
Productionizing Machine Learning with a Microservices Architecture
Databricks
 
DOES14 - David Ashman, Blackboard Learn - Keep Your Head in the Clouds Tuesda...
DevOps Enterprise Summmit
 
DOES14 - David Ashman - Blackboard Learn - Keep Your Head in the Clouds
Gene Kim
 
From Data Science to MLOps
Carl W. Handlin
 
DEVOPS AND MACHINE LEARNING
CodeOps Technologies LLP
 
DevOps for Machine Learning overview en-us
eltonrodriguez11
 
Csharp corner toronto vs2019 post launch 10 apr 2019 nilesh shah
Nilesh Shah
 
Structured Streaming in Spark
Digital Vidya
 
Lessons from Building Large-Scale, Multi-Cloud, SaaS Software at Databricks
Databricks
 
#spsclt18 vincent biret #spfx #devops
Vincent Biret
 
MLOPS By Amazon offered and free download
pouyan533
 
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Sotrender
 
Ad

More from Databricks (20)

PPTX
DW Migration Webinar-March 2022.pptx
Databricks
 
PPTX
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
PPT
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 4
Databricks
 
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
PDF
Democratizing Data Quality Through a Centralized Platform
Databricks
 
PDF
Learn to Use Databricks for Data Science
Databricks
 
PDF
Why APM Is Not the Same As ML Monitoring
Databricks
 
PDF
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
PDF
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
PDF
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
PDF
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
PDF
Sawtooth Windows for Feature Aggregations
Databricks
 
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
PDF
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
PDF
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
PDF
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
PDF
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
DW Migration Webinar-March 2022.pptx
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
Data Lakehouse Symposium | Day 2
Databricks
 
Data Lakehouse Symposium | Day 4
Databricks
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Learn to Use Databricks for Data Science
Databricks
 
Why APM Is Not the Same As ML Monitoring
Databricks
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
Sawtooth Windows for Feature Aggregations
Databricks
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
Ad

Recently uploaded (20)

PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PPTX
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
PPTX
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
PDF
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
PPTX
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PPT
deep dive data management sharepoint apps.ppt
novaprofk
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
PPTX
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
PPTX
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PPTX
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
PDF
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
PDF
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
PDF
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
PDF
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
PPTX
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
deep dive data management sharepoint apps.ppt
novaprofk
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 

Productionalizing Models through CI/CD Design with MLflow

  • 2. Productionalizing Models Through CI/CD Design and MLflow Mary Grace Moesta & Pete Tamisin June 2020
  • 3. About the Speakers Peter Tamisin Technical Lead, Customer Success ● 20+ year career focused on data analytics and engineering ● Member of Databricks Automation SME team contributing multiple blogs and published best practice guides on CI/CD ● Based in Atlanta, GA enjoys cheering on the Atlanta Hawks and playing video/board games with his wife and 4 kids Mary Grace Moesta Customer Success Engineer ● Supporting customers in the retail and CPG space ● Former data scientist with focused work on customer experience and brand acceleration ● Databricks Labs AutoML contributor ● Based in Detroit, MI, enjoys running and golfing in her free time
  • 4. Agenda ▪ Setting Definition and Assumptions ▪ Defining MLops ▪ The importance of MLops in a production system ▪ Basics of CI/CD ▪ How MLflow pivots CI/CD basics for ML ▪ An Example of promoting ML Code and a model as artifacts ▪ Version control ▪ Interfacing with MLflow ▪ Registering the Model ▪ Building DevOps pipeline to trigger production training and inference runs
  • 5. Continuous Delivery for Machine Learning Danilo Sato, Arif Wider, Christoph windheuser “Continuous Delivery for Machine Learning (CD4ML) is a software engineering approach in which a cross-functional team produces machine learning applications based on code, data, and models in small and safe increments that can be reproduced and reliably released at any time, in short adaptation cycles.”
  • 6. Why MLops is Relevant ▪ The data science and machine learning development framework is traditionally centered around local development ▪ Keeping work scoped to a scientist’s local laptop, code save locally, limited to the amount of compute in local machine, etc. ▪ As data and process complexity grows, so does the number of integration points ▪ Leveraging tools like Spark means there is more machines to manage, data is stored across various locations, the cost of kicking off a single run now becomes much more complex and expensive ▪ Machine learning operations allow for development at scale and hands off execution of production runs ▪ Automated execution enables the business to implement these powerful machine learning solutions with ease
  • 7. The Basics of Continuous Integration and Continuous Delivery ▪ Code ▪ Develop code and run tests in a local IDE, Databricks, Databricks Connect, etc. ▪ Manually run tests ▪ Commit code and tests to a version controlled code branch ▪ Build ▪ Pull together new code + tests ▪ Run automated tests ▪ Build library and non notebook code ▪ Release ▪ Generate Release Artifact ▪ Deploy ▪ Two methods of deployment ▪ Deploying notebooks ▪ Deploying Libraries / release artifacts ▪ Test ▪ Run automated tests ▪ Report results ▪ Operate ▪ Programatically schedule downstream data engineering, machine learning, and analytics workloads Continuous DeliveryContinuous Integration
  • 8. The Basics of Continuous Integration and Continuous Delivery with an ML Twist ▪ Code ▪ Develop code and run tests in a local IDE, Databricks, Databricks Connect, etc. ▪ Manually run tests ▪ Commit code and tests to a version controlled code branch ▪ Build ▪ Pull together new code + tests ▪ Run automated tests ▪ Build library and non notebook code ▪ Release ▪ Generate Release Artifact ▪ Deploy ▪ Two methods of deployment ▪ Deploying notebooks ▪ Deploying Libraries / release artifacts ▪ Test ▪ Run automated tests ▪ Report results ▪ Operate ▪ Programatically schedule downstream data engineering, machine learning, and analytics workloads Continuous DeliveryContinuous Integration ➔ Batching scoring, real time serving, containers, cloud inference services ➔ Leveraging tools like Jenkins, AzDO, ect to trigger new model builds in production when new changes are merged to master ➔ Using MLflow to track experiments, runs, hyperparameters, code, artifacts, etc. ➔ Notebook / IDE environment to develop on feature branches using favorite ML tools: sklearn, SparkML, TF, pytorch, etc. ➔ Training runs at scale with new model features, hyperparams, etc. implemented ➔ Tracking different model version in production using Model Registry ➔ Model, entire pipeline, image, code etc. as artifacts ➔ Writing tests for the machine learning code / feature engineering
  • 9. How Contributes to Seamless MLOps Staging Production Archived Data Scientists Deployment Engineers v1 v2 v3 Models Tracking Flavor 2Flavor 1 Model Registry Custom Models In-Line Code Containers Batch & Stream Scoring Cloud Inference Services OSS Serving Solutions Serving Parameters Metrics Artifacts ModelsMetadata
  • 10. Code and Version Control ● Pick your favorite version control ○ Github, AzDo, Bitbucket, etc. ● Branch from master for development ○ Hyperparam space search, alternative feature sets, algorithm refinements, etc. ● Tracking development metrics and criteria using MLflow tracking ▪ Tracking artifacts that will be used in the build / release stages
  • 11. Controlling Model Flow Through Build and Release Stages ▪ After new models have been trained in the feature branch: Parameterizing the right experiment path using widgets Setting the decision criteria for a best run Searching through filtered runs to identify the best_run and build the model URI to programmatically reference later
  • 12. Controlling Model Flow Through Build and Release Stages ▪ Once the best_run has been identified, use Model Registry to track the flow of models in and out of production ▪ Note that the stages defined in the registry do not directly translate to environments ▪ The registry also doesn’t span multiple workspaces, they map 1:1 to workspace. Initially register the new model Archive the current model out of production Promote to production stage
  • 13. Continuous Deployment ▪ Deployment depends on the business problem and the SLA attached ▪ Let’s look at batch inference as an example: Pipeline is triggered anytime there’s a change to master branch (release) Environment variables + secrets that have been redacted from the script
  • 14. Continuous Deployment Example: AzDo Pipeline Defining the instance OS Installing Python Installing Databricks CLI Configuring the Databricks CLI with a secret token
  • 15. Copying Code to Hands off Production Environments ▪ Copy the code from the master branch to a desired location ▪ This could be a ‘hands off’ production environment, isolated within a single workspace, etc.
  • 16. Building and Spinning Up the Cluster Specifying the cluster configuration Booting up the cluster
  • 17. Make the Inference Job with Parameters ▪ Create the jobs using the name defined as an environment variable ▪ Passing the parameters which specify the experiment location ▪ Setting job configurations ▪ Increasing number of concurrent runs allows to run multiple inference jobs at the same time
  • 18. Running the Inference Job ▪ Uses the run-now endpoint to run the job via API ▪ The job shows up in the Databricks UI ▪ Can navigate using the GUI or API to get more details about each job run
  • 19. Wrapping it all Up ▪ MLops is an important piece of the machine learning framework that enables the business to consume downstream results with ease ▪ The basics of CI/CD can be pivoted to fit the structure of a machine learning project to help establish the feedback loop from development to production ▪ can act that the governing body to help regulate and track the entire lifecycle ▪ Automation tools like Jenkins, Azure DevOps, etc. are applied to orchestrate the end to end process
  • 20. Feedback Your feedback is important to us. Don’t forget to rate and review the sessions.