SlideShare a Scribd company logo
Multi-Runtime Serving Pipelines
Stepan Pushkarev
CTO of Hydrosphere.io
Mission: Accelerate Machine Learning to Production
Opensource Products:
- Mist: Serverless proxy for Spark
- ML Lambda: ML Function as a Service
- Sonar: Data and ML Monitoring
Business Model: Subscription services and hands-on consulting
About
Deployment | Serving | Scoring | Inference
@Nvidia https://www.nvidia.com/en-us/deep-learning-ai/solutions/
From Single Model to Meta Pipelines
Item 1 Item 2
Title Authentic HERMES Bijouterie Fantaisie
Selle Clip-On Earrings Silvertone
#S1742 E
Auth HERMES Earrings Sellier Clip-on Silver
Tone Round $0 Ship 25130490900 S06B
Specs Brand: HERMES
Size(cm): W1.8 x H1.8 cm(Approx)
Color: Silver
Size(inch): W0.7 x H0.7" (Approx)
Style: Earrings
Rank: B
Brand: Hermes
Fastening: Clip-On
Style: Clip on
Country/Region of Manufacture: Unknown
Metal: Silver Plated
Main Color:Silver
Color: Silver
Description ... ...
Does this pair describe the same thing?
Product Matching
Model Artifact:
Ops perspective
- HTTP/1.1, HTTP/2, gRPC
- Kafka, Flink, Kinesis
- Protobuf, Avro
- Service Discovery
- Pipelining
- Tracing
- Monitoring
- Autoscaling
- Versioning
- A/B, Canary
- Testing
- CPU, GPU
API & Logistics
Monitoring
Shifting
experimentation
to production
Sidecar Architecture
Functions registry
responsible for the
model life cycle and
all the business logic
required to configure
models for serving
Mesh of serving
runtimes is an actual
serving cluster
Infrastructure
integration: ECS for
AWS, Kubernetes for GCE
and on premise
UX: Models and Applications
Applications provide public virtual endpoints for the
models and compositions of the models.
Why Not just one Big Neural Network?
● Not always possible
● Stages could be independent
● Ad-hoc rule based models
● Physics models (e.g. LIDAR)
● Big E2E DL Requires Black
Magic skills
Why Not just one Python script?
● Modularity. Stages could be developed by different teams
● Traceability and Monitoring
● Versioning
● Independent deployment, A/B testing and Canary
● Request Shadowing and other cool stuff
● Could require different ML runtimes (TF, Scikit, Spark
ML, etc)
● We need more microservices :)
Why Not just TF Serving?
● Other ML runtimes (DL4J, Scikit,
Spark ML). Servables are overkill.
● Need better versioning and
immutability (Docker per version)
● Don’t want to deal with state
(model loaded, offloaded, etc)
● Want to re-use microservices stack
(tracing, logging, metrics)
● Need better scalability
Demo
Thank you
- @hydrospheredata
- https://github.com/Hydrospheredata
- https://hydrosphere.io/
- spushkarev@hydrosphere.io

More Related Content

What's hot (20)

PDF
Hamburg Data Science Meetup - MLOps with a Feature Store
Moritz Meister
 
PDF
NextGenML
Moldovan Radu Adrian
 
PPTX
MLOps and Data Quality: Deploying Reliable ML Models in Production
Provectus
 
PDF
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
Databricks
 
PPTX
Blind spots in big data erez koren @ forter
Ido Shilon
 
PDF
Lessons Learned from Using Spark for Evaluating Road Detection at BMW Autonom...
Databricks
 
PDF
Managing the Machine Learning Lifecycle with MLflow
Databricks
 
PDF
Moving a Fraud-Fighting Random Forest from scikit-learn to Spark with MLlib, ...
Databricks
 
PPTX
Feature store: Solving anti-patterns in ML-systems
Andrzej Michałowski
 
PDF
Managed Feature Store for Machine Learning
Logical Clocks
 
PDF
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
Jan Kirenz
 
PDF
Ai platform at scale
Henry Saputra
 
PDF
Model versioning done right: A ModelDB 2.0 Walkthrough
Manasi Vartak
 
PPTX
DevOps and Machine Learning (Geekwire Cloud Tech Summit)
Jasjeet Thind
 
PDF
Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...
Databricks
 
PDF
Robust MLOps with Open-Source: ModelDB, Docker, Jenkins, and Prometheus
Manasi Vartak
 
PDF
High Performance Transfer Learning for Classifying Intent of Sales Engagement...
Databricks
 
PDF
AI Modernization at AT&T and the Application to Fraud with Databricks
Databricks
 
PDF
Apply MLOps at Scale by H&M
Databricks
 
PPTX
Next.ml Boston: Data Science Dev Ops
Eric Chiang
 
Hamburg Data Science Meetup - MLOps with a Feature Store
Moritz Meister
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
Provectus
 
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
Databricks
 
Blind spots in big data erez koren @ forter
Ido Shilon
 
Lessons Learned from Using Spark for Evaluating Road Detection at BMW Autonom...
Databricks
 
Managing the Machine Learning Lifecycle with MLflow
Databricks
 
Moving a Fraud-Fighting Random Forest from scikit-learn to Spark with MLlib, ...
Databricks
 
Feature store: Solving anti-patterns in ML-systems
Andrzej Michałowski
 
Managed Feature Store for Machine Learning
Logical Clocks
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
Jan Kirenz
 
Ai platform at scale
Henry Saputra
 
Model versioning done right: A ModelDB 2.0 Walkthrough
Manasi Vartak
 
DevOps and Machine Learning (Geekwire Cloud Tech Summit)
Jasjeet Thind
 
Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...
Databricks
 
Robust MLOps with Open-Source: ModelDB, Docker, Jenkins, and Prometheus
Manasi Vartak
 
High Performance Transfer Learning for Classifying Intent of Sales Engagement...
Databricks
 
AI Modernization at AT&T and the Application to Fraud with Databricks
Databricks
 
Apply MLOps at Scale by H&M
Databricks
 
Next.ml Boston: Data Science Dev Ops
Eric Chiang
 

Viewers also liked (14)

PDF
Machine learning in production with scikit-learn
Jeff Klukas
 
PDF
Python as part of a production machine learning stack by Michael Manapat PyDa...
PyData
 
PDF
Building A Production-Level Machine Learning Pipeline
Robert Dempsey
 
PPTX
Managing and Versioning Machine Learning Models in Python
Simon Frid
 
PDF
Using PySpark to Process Boat Loads of Data
Robert Dempsey
 
PPTX
Production machine learning_infrastructure
joshwills
 
PPTX
Production and Beyond: Deploying and Managing Machine Learning Models
Turi, Inc.
 
PDF
Machine learning in production
Turi, Inc.
 
PDF
Square's Machine Learning Infrastructure and Applications - Rong Yan
Hakka Labs
 
PDF
PostgreSQL + Kafka: The Delight of Change Data Capture
Jeff Klukas
 
PPTX
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
Jose Quesada (hiring)
 
PPTX
Machine Learning In Production
Samir Bessalah
 
PDF
Machine Learning Pipelines
jeykottalam
 
PPTX
AI and Machine Learning Demystified by Carol Smith at Midwest UX 2017
Carol Smith
 
Machine learning in production with scikit-learn
Jeff Klukas
 
Python as part of a production machine learning stack by Michael Manapat PyDa...
PyData
 
Building A Production-Level Machine Learning Pipeline
Robert Dempsey
 
Managing and Versioning Machine Learning Models in Python
Simon Frid
 
Using PySpark to Process Boat Loads of Data
Robert Dempsey
 
Production machine learning_infrastructure
joshwills
 
Production and Beyond: Deploying and Managing Machine Learning Models
Turi, Inc.
 
Machine learning in production
Turi, Inc.
 
Square's Machine Learning Infrastructure and Applications - Rong Yan
Hakka Labs
 
PostgreSQL + Kafka: The Delight of Change Data Capture
Jeff Klukas
 
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
Jose Quesada (hiring)
 
Machine Learning In Production
Samir Bessalah
 
Machine Learning Pipelines
jeykottalam
 
AI and Machine Learning Demystified by Carol Smith at Midwest UX 2017
Carol Smith
 
Ad

Similar to Multi runtime serving pipelines for machine learning (20)

PDF
GeeCON Microservices 2015 scaling micro services at gilt
Adrian Trenaman
 
PPTX
DF1 - ML - Petukhov - Azure Ml Machine Learning as a Service
MoscowDataFest
 
PDF
The Data Science Process - Do we need it and how to apply?
Ivo Andreev
 
ODP
Apache Deltacloud (Linuxcon 2010)
lutter
 
PDF
Evolving the Network Automation Journey from Python to Platforms
Network Automation Forum
 
PDF
TensorFlow meetup: Keras - Pytorch - TensorFlow.js
Stijn Decubber
 
PDF
Scala services in action
Underscore
 
PDF
Embrace Sparsity At Web Scale: Apache Spark MLlib Algorithms Optimization For...
Jen Aman
 
PDF
Microsoft & Machine Learning / Artificial Intelligence
İbrahim KIVANÇ
 
PDF
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Sanjeev Rampal
 
PDF
201908 Overview of Automated ML
Mark Tabladillo
 
PDF
Camel Day Italia 2021 - Camel K
Nicola Ferraro
 
PPTX
Borys Rybak “How to make your data smart with Artificial Intelligence and Mac...
Lviv Startup Club
 
PDF
Matrix.org decentralised communication, Matthew Hodgson, TADSummit
Alan Quayle
 
PDF
201906 02 Introduction to AutoML with ML.NET 1.0
Mark Tabladillo
 
PDF
Advanced Model Inferencing leveraging Kubeflow Serving, KNative and Istio
Animesh Singh
 
PPTX
Machine learning and Deep learning on edge devices using TensorFlow
Aditya Bhattacharya
 
PDF
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
PDF
ARISE
kaptoxic
 
PPTX
Dystopia as a Service
Adrian Cockcroft
 
GeeCON Microservices 2015 scaling micro services at gilt
Adrian Trenaman
 
DF1 - ML - Petukhov - Azure Ml Machine Learning as a Service
MoscowDataFest
 
The Data Science Process - Do we need it and how to apply?
Ivo Andreev
 
Apache Deltacloud (Linuxcon 2010)
lutter
 
Evolving the Network Automation Journey from Python to Platforms
Network Automation Forum
 
TensorFlow meetup: Keras - Pytorch - TensorFlow.js
Stijn Decubber
 
Scala services in action
Underscore
 
Embrace Sparsity At Web Scale: Apache Spark MLlib Algorithms Optimization For...
Jen Aman
 
Microsoft & Machine Learning / Artificial Intelligence
İbrahim KIVANÇ
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Sanjeev Rampal
 
201908 Overview of Automated ML
Mark Tabladillo
 
Camel Day Italia 2021 - Camel K
Nicola Ferraro
 
Borys Rybak “How to make your data smart with Artificial Intelligence and Mac...
Lviv Startup Club
 
Matrix.org decentralised communication, Matthew Hodgson, TADSummit
Alan Quayle
 
201906 02 Introduction to AutoML with ML.NET 1.0
Mark Tabladillo
 
Advanced Model Inferencing leveraging Kubeflow Serving, KNative and Istio
Animesh Singh
 
Machine learning and Deep learning on edge devices using TensorFlow
Aditya Bhattacharya
 
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
ARISE
kaptoxic
 
Dystopia as a Service
Adrian Cockcroft
 
Ad

Recently uploaded (20)

PPTX
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
PPTX
How Apagen Empowered an EPC Company with Engineering ERP Software
SatishKumar2651
 
PDF
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
PPTX
Tally software_Introduction_Presentation
AditiBansal54083
 
PPTX
An Introduction to ZAP by Checkmarx - Official Version
Simon Bennetts
 
PDF
Beyond Binaries: Understanding Diversity and Allyship in a Global Workplace -...
Imma Valls Bernaus
 
PDF
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
PPTX
Engineering the Java Web Application (MVC)
abhishekoza1981
 
PPTX
The Role of a PHP Development Company in Modern Web Development
SEO Company for School in Delhi NCR
 
PPTX
Revolutionizing Code Modernization with AI
KrzysztofKkol1
 
PPTX
MailsDaddy Outlook OST to PST converter.pptx
abhishekdutt366
 
PDF
iTop VPN With Crack Lifetime Activation Key-CODE
utfefguu
 
PDF
Alexander Marshalov - How to use AI Assistants with your Monitoring system Q2...
VictoriaMetrics
 
PDF
Build It, Buy It, or Already Got It? Make Smarter Martech Decisions
bbedford2
 
PDF
Streamline Contractor Lifecycle- TECH EHS Solution
TECH EHS Solution
 
PPTX
Comprehensive Guide: Shoviv Exchange to Office 365 Migration Tool 2025
Shoviv Software
 
PPT
MergeSortfbsjbjsfk sdfik k
RafishaikIT02044
 
PPTX
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 
PDF
GetOnCRM Speeds Up Agentforce 3 Deployment for Enterprise AI Wins.pdf
GetOnCRM Solutions
 
PPTX
Java Native Memory Leaks: The Hidden Villain Behind JVM Performance Issues
Tier1 app
 
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
How Apagen Empowered an EPC Company with Engineering ERP Software
SatishKumar2651
 
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
Tally software_Introduction_Presentation
AditiBansal54083
 
An Introduction to ZAP by Checkmarx - Official Version
Simon Bennetts
 
Beyond Binaries: Understanding Diversity and Allyship in a Global Workplace -...
Imma Valls Bernaus
 
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
Engineering the Java Web Application (MVC)
abhishekoza1981
 
The Role of a PHP Development Company in Modern Web Development
SEO Company for School in Delhi NCR
 
Revolutionizing Code Modernization with AI
KrzysztofKkol1
 
MailsDaddy Outlook OST to PST converter.pptx
abhishekdutt366
 
iTop VPN With Crack Lifetime Activation Key-CODE
utfefguu
 
Alexander Marshalov - How to use AI Assistants with your Monitoring system Q2...
VictoriaMetrics
 
Build It, Buy It, or Already Got It? Make Smarter Martech Decisions
bbedford2
 
Streamline Contractor Lifecycle- TECH EHS Solution
TECH EHS Solution
 
Comprehensive Guide: Shoviv Exchange to Office 365 Migration Tool 2025
Shoviv Software
 
MergeSortfbsjbjsfk sdfik k
RafishaikIT02044
 
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 
GetOnCRM Speeds Up Agentforce 3 Deployment for Enterprise AI Wins.pdf
GetOnCRM Solutions
 
Java Native Memory Leaks: The Hidden Villain Behind JVM Performance Issues
Tier1 app
 

Multi runtime serving pipelines for machine learning

  • 1. Multi-Runtime Serving Pipelines Stepan Pushkarev CTO of Hydrosphere.io
  • 2. Mission: Accelerate Machine Learning to Production Opensource Products: - Mist: Serverless proxy for Spark - ML Lambda: ML Function as a Service - Sonar: Data and ML Monitoring Business Model: Subscription services and hands-on consulting About
  • 3. Deployment | Serving | Scoring | Inference @Nvidia https://www.nvidia.com/en-us/deep-learning-ai/solutions/
  • 4. From Single Model to Meta Pipelines
  • 5. Item 1 Item 2 Title Authentic HERMES Bijouterie Fantaisie Selle Clip-On Earrings Silvertone #S1742 E Auth HERMES Earrings Sellier Clip-on Silver Tone Round $0 Ship 25130490900 S06B Specs Brand: HERMES Size(cm): W1.8 x H1.8 cm(Approx) Color: Silver Size(inch): W0.7 x H0.7" (Approx) Style: Earrings Rank: B Brand: Hermes Fastening: Clip-On Style: Clip on Country/Region of Manufacture: Unknown Metal: Silver Plated Main Color:Silver Color: Silver Description ... ... Does this pair describe the same thing? Product Matching
  • 7. - HTTP/1.1, HTTP/2, gRPC - Kafka, Flink, Kinesis - Protobuf, Avro - Service Discovery - Pipelining - Tracing - Monitoring - Autoscaling - Versioning - A/B, Canary - Testing - CPU, GPU API & Logistics
  • 10. Functions registry responsible for the model life cycle and all the business logic required to configure models for serving Mesh of serving runtimes is an actual serving cluster Infrastructure integration: ECS for AWS, Kubernetes for GCE and on premise
  • 11. UX: Models and Applications Applications provide public virtual endpoints for the models and compositions of the models.
  • 12. Why Not just one Big Neural Network? ● Not always possible ● Stages could be independent ● Ad-hoc rule based models ● Physics models (e.g. LIDAR) ● Big E2E DL Requires Black Magic skills
  • 13. Why Not just one Python script? ● Modularity. Stages could be developed by different teams ● Traceability and Monitoring ● Versioning ● Independent deployment, A/B testing and Canary ● Request Shadowing and other cool stuff ● Could require different ML runtimes (TF, Scikit, Spark ML, etc) ● We need more microservices :)
  • 14. Why Not just TF Serving? ● Other ML runtimes (DL4J, Scikit, Spark ML). Servables are overkill. ● Need better versioning and immutability (Docker per version) ● Don’t want to deal with state (model loaded, offloaded, etc) ● Want to re-use microservices stack (tracing, logging, metrics) ● Need better scalability
  • 15. Demo
  • 16. Thank you - @hydrospheredata - https://github.com/Hydrospheredata - https://hydrosphere.io/ - [email protected]