DagsHub
Solving MLOps
from first principles
November ‘21, Dean Pleban
What we’ll cover
1. First principles thinking and mental models – a brief introduction
2. MLOps buyer’s remorse
3. Assumptions about the problem
4. Assumptions about ideal solutions
5. Example 1: Data Versioning
6. Generalizing to a framework
– CONFIDENTIAL –
About Me
Dean Pleban
Building tools for ML teamwork
Strongly believe in open source
Follow me:
@DeanPlbn
DeanPleban
Definitions
Mental models First principles thinking
Remember “that” statistic…
The ML world is changing
– CONFIDENTIAL –
OF TEAMS HAVE MODELS
IN PRODUCTION
0%
10%
20%
30%
40%
50%
60%
70%
0 1-10 10-100 Over 100
# of models in production
80%
Not just GAFAM…
THE NEXT CHALLENGE IS
SCALING
FROM 1 TO 10
(OR 10 TO 100) (Poll with over 2000 votes)
MLOps Fatigue – Too many tools, manually synchronized
– CONFIDENTIAL –
MLOPS TOOLS
TO CHOOSE FROM &
INTEGRATE WITH
280
The signs
Everything is
manual
Analysis
Paralysis
All or nothing Building
everything in-
house
Minimal Assumptions about the PROBLEM
You are not Google
/ Facebook
MLOps is still in
early days
Save time / future
proof / production
ready tradeoff
Minimal Assumptions about the SOLUTION
Problem vs
Feature focus
Hard part starts
when the first
model goes to
production
Data scientists !=
developers and
how this affects
tooling
Building on OSS
makes sense for
most cases
Example 1: Data Versioning
I want to version my data
My data is regularly changing and I want to
revert back to an older version for disaster
recovery / governance
Step 1: Define the problem
Step 1: Define the problem
Revert in case
of bug
Compare
different
versions
Knowing
which data is
used where
Add/modify
data without
breaking
Step 1: Define the problem
• Do you actually suffer from “all the above”?
• Prioritizing is important, separating must-have and nice-to-
have
Example 1: Data Versioning
The type of
data you work
with
The type of
data changes
you expect
What are the
organizational
constraints
Who am I?
Step 2: Define the problem parameters
Step 2: Define the problem parameters
• Flexibility to anything is tempting, but answering each
question differently will lead to very different tooling, so being
specific is important
• Organizational constraints are specifically critical, since they
are many times the largest limitations on the tools to use.
This also ties into modularity. E.g. does your org only work
with Azure cloud tools?
• This can also be the step where we define a “user story” or
workflow that includes this problem – e.g. are we going to
version the DB directly, or just the outputs of our queries?
Example 1: Data Versioning
Step 3: Google the problem
Step 3: Google the problem
• Specifically, budget a reasonable amount of time (at least 2-3
hours) to research existing solutions
• Now that you’ve defined the problem, and not just features, search
for those (as well as experimenting with problem parameters), this
will give you more tools, that prioritize different problem aspects
• Build out an info page so that other people in the org can review
and add inputs
• You will probably learn that you were searching for the wrong
keywords
• Read blogs and forum posts and see what TERMS people are
using, and search again
• Ask friends, use Reddit as a tool to discover keywords – describe
your problem and people will recommend the tools and
categories you need.
Step 3: Google the problem
Reddit
example
Googling
examples
Example of a tool
research output
Recommend
ed blogs
Example 1: Data Versioning
Pre-technical
evaluation
Operating
principles
“Hello World” Kick the tires –
mechanically
Step 4: Evaluate solutions
Step 4: Evaluate solutions
• Is there a hosted solution?
• How much does it cost?
• If I go for a hosted solution, how easy will it be to bring it in-
house in the future, or customize it to my needs
• How easy is it to get out of them
• How easy is it to get out of them if they prove less useful
Step 4: Evaluate solutions
Comparing 2 data
versioning tools
from a “face value”
perspective
Looking at the
operating
principles of DVC
Get started
tutorial
Try to add a
dataset with
10K images
Example 1: Data Versioning
Start simple – 1
project, 1 user
Define criteria for
success, or don’t
Review and
extrapolate
Step 5: Integrate
The 5 step process
1. Define the problem
2. Define the problem parameters
3. Google the problem
4. Evaluate solutions
5. Integrate
Thank You!

More Related Content

PPTX
ONE-SIZE DOESN'T FIT ALL - EFFECTIVELY (RE)EVALUATE A DATA SOLUTION FOR YOUR ...
PPTX
Lies Enterprise Architects Tell - Data Day Texas 2018 Keynote
PPTX
HOW TO OPTIMIZE NON-CODING TIME, ORI KEREN, LinearB
PPTX
The Devops Handbook
PDF
DevOps Beyond the Buzzwords: What it Means to Embrace the DevOps Lifestyle
PDF
RecSysOps: Best Practices for Operating a Large-Scale Recommender System
PPTX
DevOps: A Value Proposition
PPTX
2016 State of DevOps Report Webinar
ONE-SIZE DOESN'T FIT ALL - EFFECTIVELY (RE)EVALUATE A DATA SOLUTION FOR YOUR ...
Lies Enterprise Architects Tell - Data Day Texas 2018 Keynote
HOW TO OPTIMIZE NON-CODING TIME, ORI KEREN, LinearB
The Devops Handbook
DevOps Beyond the Buzzwords: What it Means to Embrace the DevOps Lifestyle
RecSysOps: Best Practices for Operating a Large-Scale Recommender System
DevOps: A Value Proposition
2016 State of DevOps Report Webinar

What's hot (20)

PPTX
Top Lessons Learned From The DevOps Handbook
PDF
CampDevOps keynote - DevOps: Using 'Lean' to eliminate Bottlenecks
PDF
DOES 2016 Sciencing the Crap Out of DevOps
PDF
What we learned from three years sciencing the crap out of devops
PDF
Continuous Delivery - the missing parts - Paul Stack
PDF
Drupal and Devops , the Survey Results
PPTX
Nf final chef-lisa-metrics-2015-ss
PPTX
DOES16 San Francisco - David Blank-Edelman - Lessons Learned from a Parallel ...
PPTX
One Terrible Day at Google, and How It Made Us Better
PPTX
Making disaster routine
PDF
DevOps Beyond the Buzzwords: Culture, Tools, & Straight Talk
PPTX
Why Everyone Needs DevOps Now: 15 Year Study Of High Performing Technology Orgs
PPTX
Five Ways Automation Has Increased Application Deployment and Changed Culture
PPTX
Tools Won't Fix Your Broken DevOps
PDF
DevOPs Transformation Workshop
PDF
The 7 Habits of Effective Data Driven Companies
PPTX
DOES15 - Randy Shoup - Ten (Hard-Won) Lessons of the DevOps Transition
PDF
What I learned from 5 years of sciencing the crap out of DevOps
PPTX
DevOps: The Key to IT Performance
PDF
IoT to Cloud the DevOps Way
Top Lessons Learned From The DevOps Handbook
CampDevOps keynote - DevOps: Using 'Lean' to eliminate Bottlenecks
DOES 2016 Sciencing the Crap Out of DevOps
What we learned from three years sciencing the crap out of devops
Continuous Delivery - the missing parts - Paul Stack
Drupal and Devops , the Survey Results
Nf final chef-lisa-metrics-2015-ss
DOES16 San Francisco - David Blank-Edelman - Lessons Learned from a Parallel ...
One Terrible Day at Google, and How It Made Us Better
Making disaster routine
DevOps Beyond the Buzzwords: Culture, Tools, & Straight Talk
Why Everyone Needs DevOps Now: 15 Year Study Of High Performing Technology Orgs
Five Ways Automation Has Increased Application Deployment and Changed Culture
Tools Won't Fix Your Broken DevOps
DevOPs Transformation Workshop
The 7 Habits of Effective Data Driven Companies
DOES15 - Randy Shoup - Ten (Hard-Won) Lessons of the DevOps Transition
What I learned from 5 years of sciencing the crap out of DevOps
DevOps: The Key to IT Performance
IoT to Cloud the DevOps Way
Ad

Similar to SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub (20)

PDF
Artur Suchwalko “What are common mistakes in Data Science projects and how to...
PPTX
Coaching teams in creative problem solving
PPTX
Resource and technology design process
PDF
Code mashadvancedtopicsworkshop
PPTX
BooK of EMC Introduction to Big data Analytics Module 2.pptx
PPT
STARCANADA 2013 Keynote: Lightning Strikes the Keynotes
PPTX
Learning Analytics Primer: Getting Started with Learning and Performance Anal...
PPTX
Application of analytics
PPTX
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
PDF
Clare Corthell: Learning Data Science Online
PDF
Deep learning for NLP
PDF
Machine Learning Product Managers Meetup Event
PPT
Decision making & problem solving
PPTX
Product Management in the Era of Data Science
PPTX
Operationalizing Machine Learning
PDF
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
PDF
Barga Data Science lecture 2
PDF
Doing Analytics Right - Building the Analytics Environment
PPTX
Teacher training material
PDF
Binary crosswords
Artur Suchwalko “What are common mistakes in Data Science projects and how to...
Coaching teams in creative problem solving
Resource and technology design process
Code mashadvancedtopicsworkshop
BooK of EMC Introduction to Big data Analytics Module 2.pptx
STARCANADA 2013 Keynote: Lightning Strikes the Keynotes
Learning Analytics Primer: Getting Started with Learning and Performance Anal...
Application of analytics
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
Clare Corthell: Learning Data Science Online
Deep learning for NLP
Machine Learning Product Managers Meetup Event
Decision making & problem solving
Product Management in the Era of Data Science
Operationalizing Machine Learning
"Startups, comment gérer une équipe de développeurs" par Laurent Cerveau
Barga Data Science lecture 2
Doing Analytics Right - Building the Analytics Environment
Teacher training material
Binary crosswords
Ad

More from DevOpsDays Tel Aviv (20)

PDF
YOUR OPEN SOURCE PROJECT IS LIKE A STARTUP, TREAT IT LIKE ONE, EYAR ZILBERMAN...
PPTX
GRAPHQL TO THE RES(T)CUE, ELLA SHARAKANSKI, Salto
PPTX
MICROSERVICES ABOVE THE CLOUD - DESIGNING THE INTERNATIONAL SPACE STATION FOR...
PPTX
THE (IR)RATIONAL INCIDENT RESPONSE: HOW PSYCHOLOGICAL BIASES AFFECT INCIDENT ...
PPTX
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PPTX
NUDGE AND SLUDGE: DRIVING SECURITY WITH DESIGN // J. WOLFGANG GOERLICH, Duo S...
PPTX
(Ignite) TAKE A HIKE: PREVENTING BATTERY CORROSION - LEAH VOGEL, CHEGG
PPTX
BUILDING A DR PLAN FOR YOUR CLOUD INFRASTRUCTURE FROM THE GROUND UP, MOSHE BE...
PPTX
THE THREE DISCIPLINES OF CI/CD SECURITY, DANIEL KRIVELEVICH, Cider Security
PDF
THE PLEASURES OF ON-PREM, TOMER GABEL
PPTX
CONFIGURATION MANAGEMENT IN THE CLOUD NATIVE ERA, SHAHAR MINTZ, EggPack
PPTX
SOLVING THE DEVOPS CRISIS, ONE PERSON AT A TIME, CHRISTINA BABITSKI, Develeap
PPTX
OPTIMIZING PERFORMANCE USING CONTINUOUS PRODUCTION PROFILING ,YONATAN GOLDSCH...
PPTX
HOW TO SCALE YOUR ONCALL OPERATION, AND SURVIVE TO TELL, ANTON DRUKH
PPTX
FLYING BLIND - ACCESSIBILITY IN MONITORING, FEU MOUREK, Icinga
PPTX
(Ignite) WHAT'S BURNING THROUGH YOUR CLOUD BILL - GIL BAHAT, CIDER SECURITY
PPTX
SLO DRIVEN DEVELOPMENT, ALON NATIV, Tomorrow.io
PPTX
ONBOARDING IN LOCKDOWN, HILA FOX, Augury
PPTX
DON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, Firefly
PPTX
KEYNOTE | WHAT'S COMING IN THE NEXT 10 YEARS OF DEVOPS? // ELLEN CHISA, bolds...
YOUR OPEN SOURCE PROJECT IS LIKE A STARTUP, TREAT IT LIKE ONE, EYAR ZILBERMAN...
GRAPHQL TO THE RES(T)CUE, ELLA SHARAKANSKI, Salto
MICROSERVICES ABOVE THE CLOUD - DESIGNING THE INTERNATIONAL SPACE STATION FOR...
THE (IR)RATIONAL INCIDENT RESPONSE: HOW PSYCHOLOGICAL BIASES AFFECT INCIDENT ...
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
NUDGE AND SLUDGE: DRIVING SECURITY WITH DESIGN // J. WOLFGANG GOERLICH, Duo S...
(Ignite) TAKE A HIKE: PREVENTING BATTERY CORROSION - LEAH VOGEL, CHEGG
BUILDING A DR PLAN FOR YOUR CLOUD INFRASTRUCTURE FROM THE GROUND UP, MOSHE BE...
THE THREE DISCIPLINES OF CI/CD SECURITY, DANIEL KRIVELEVICH, Cider Security
THE PLEASURES OF ON-PREM, TOMER GABEL
CONFIGURATION MANAGEMENT IN THE CLOUD NATIVE ERA, SHAHAR MINTZ, EggPack
SOLVING THE DEVOPS CRISIS, ONE PERSON AT A TIME, CHRISTINA BABITSKI, Develeap
OPTIMIZING PERFORMANCE USING CONTINUOUS PRODUCTION PROFILING ,YONATAN GOLDSCH...
HOW TO SCALE YOUR ONCALL OPERATION, AND SURVIVE TO TELL, ANTON DRUKH
FLYING BLIND - ACCESSIBILITY IN MONITORING, FEU MOUREK, Icinga
(Ignite) WHAT'S BURNING THROUGH YOUR CLOUD BILL - GIL BAHAT, CIDER SECURITY
SLO DRIVEN DEVELOPMENT, ALON NATIV, Tomorrow.io
ONBOARDING IN LOCKDOWN, HILA FOX, Augury
DON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, Firefly
KEYNOTE | WHAT'S COMING IN THE NEXT 10 YEARS OF DEVOPS? // ELLEN CHISA, bolds...

Recently uploaded (20)

PDF
Comparative analysis of machine learning models for fake news detection in so...
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PDF
STKI Israel Market Study 2025 version august
PPTX
Internet of Everything -Basic concepts details
PDF
Five Habits of High-Impact Board Members
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PPTX
Build Your First AI Agent with UiPath.pptx
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PPTX
Configure Apache Mutual Authentication
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
PDF
UiPath Agentic Automation session 1: RPA to Agents
PDF
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
PDF
CloudStack 4.21: First Look Webinar slides
PDF
Flame analysis and combustion estimation using large language and vision assi...
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PPTX
Training Program for knowledge in solar cell and solar industry
PDF
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
PDF
4 layer Arch & Reference Arch of IoT.pdf
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
Comparative analysis of machine learning models for fake news detection in so...
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
STKI Israel Market Study 2025 version august
Internet of Everything -Basic concepts details
Five Habits of High-Impact Board Members
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
Build Your First AI Agent with UiPath.pptx
Improvisation in detection of pomegranate leaf disease using transfer learni...
Configure Apache Mutual Authentication
NewMind AI Weekly Chronicles – August ’25 Week III
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
UiPath Agentic Automation session 1: RPA to Agents
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
CloudStack 4.21: First Look Webinar slides
Flame analysis and combustion estimation using large language and vision assi...
Custom Battery Pack Design Considerations for Performance and Safety
Training Program for knowledge in solar cell and solar industry
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
4 layer Arch & Reference Arch of IoT.pdf
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...

SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub

  • 1. DagsHub Solving MLOps from first principles November ‘21, Dean Pleban
  • 2. What we’ll cover 1. First principles thinking and mental models – a brief introduction 2. MLOps buyer’s remorse 3. Assumptions about the problem 4. Assumptions about ideal solutions 5. Example 1: Data Versioning 6. Generalizing to a framework – CONFIDENTIAL –
  • 3. About Me Dean Pleban Building tools for ML teamwork Strongly believe in open source Follow me: @DeanPlbn DeanPleban
  • 4. Definitions Mental models First principles thinking
  • 6. The ML world is changing – CONFIDENTIAL – OF TEAMS HAVE MODELS IN PRODUCTION 0% 10% 20% 30% 40% 50% 60% 70% 0 1-10 10-100 Over 100 # of models in production 80% Not just GAFAM… THE NEXT CHALLENGE IS SCALING FROM 1 TO 10 (OR 10 TO 100) (Poll with over 2000 votes)
  • 7. MLOps Fatigue – Too many tools, manually synchronized – CONFIDENTIAL – MLOPS TOOLS TO CHOOSE FROM & INTEGRATE WITH 280
  • 8. The signs Everything is manual Analysis Paralysis All or nothing Building everything in- house
  • 9. Minimal Assumptions about the PROBLEM You are not Google / Facebook MLOps is still in early days Save time / future proof / production ready tradeoff
  • 10. Minimal Assumptions about the SOLUTION Problem vs Feature focus Hard part starts when the first model goes to production Data scientists != developers and how this affects tooling Building on OSS makes sense for most cases
  • 11. Example 1: Data Versioning I want to version my data My data is regularly changing and I want to revert back to an older version for disaster recovery / governance Step 1: Define the problem
  • 12. Step 1: Define the problem Revert in case of bug Compare different versions Knowing which data is used where Add/modify data without breaking
  • 13. Step 1: Define the problem • Do you actually suffer from “all the above”? • Prioritizing is important, separating must-have and nice-to- have
  • 14. Example 1: Data Versioning The type of data you work with The type of data changes you expect What are the organizational constraints Who am I? Step 2: Define the problem parameters
  • 15. Step 2: Define the problem parameters • Flexibility to anything is tempting, but answering each question differently will lead to very different tooling, so being specific is important • Organizational constraints are specifically critical, since they are many times the largest limitations on the tools to use. This also ties into modularity. E.g. does your org only work with Azure cloud tools? • This can also be the step where we define a “user story” or workflow that includes this problem – e.g. are we going to version the DB directly, or just the outputs of our queries?
  • 16. Example 1: Data Versioning Step 3: Google the problem
  • 17. Step 3: Google the problem • Specifically, budget a reasonable amount of time (at least 2-3 hours) to research existing solutions • Now that you’ve defined the problem, and not just features, search for those (as well as experimenting with problem parameters), this will give you more tools, that prioritize different problem aspects • Build out an info page so that other people in the org can review and add inputs • You will probably learn that you were searching for the wrong keywords • Read blogs and forum posts and see what TERMS people are using, and search again • Ask friends, use Reddit as a tool to discover keywords – describe your problem and people will recommend the tools and categories you need.
  • 18. Step 3: Google the problem Reddit example Googling examples Example of a tool research output Recommend ed blogs
  • 19. Example 1: Data Versioning Pre-technical evaluation Operating principles “Hello World” Kick the tires – mechanically Step 4: Evaluate solutions
  • 20. Step 4: Evaluate solutions • Is there a hosted solution? • How much does it cost? • If I go for a hosted solution, how easy will it be to bring it in- house in the future, or customize it to my needs • How easy is it to get out of them • How easy is it to get out of them if they prove less useful
  • 21. Step 4: Evaluate solutions Comparing 2 data versioning tools from a “face value” perspective Looking at the operating principles of DVC Get started tutorial Try to add a dataset with 10K images
  • 22. Example 1: Data Versioning Start simple – 1 project, 1 user Define criteria for success, or don’t Review and extrapolate Step 5: Integrate
  • 23. The 5 step process 1. Define the problem 2. Define the problem parameters 3. Google the problem 4. Evaluate solutions 5. Integrate