SlideShare a Scribd company logo
robertwdempsey.com
Building a
Production-Level
Machine Learning Pipeline
Robert Dempsey, CEO
Atlantic Dominion Solutions
robertwdempsey.com Production ML Pipelines
Robert Dempsey
2
Entrepreneur, Software Engineer
Books and online courses
Lotus Guides, District Data Labs
Atlantic Dominion Solutions, LLC
Professional
Author
Instructor
Owner
robertwdempsey.com Production ML Pipelines
We’ve mastered three jobs so you can
focus on one - growing your business.
3
robertwdempsey.com Production ML Pipelines
The Three Jobs
At Atlantic Dominion Solutions we perform three functions for our
customers:
Consulting: we assess and advise in the areas of technology, team and
process to determine how machine learning can have the biggest impact on
your business.
Implementation: after a strategy session to determine the work you need we
get to work using our proven methodology and begin delivering smarter
applications.
Training: continuous improvement requires continuous learning. We provide
both on-premises and online training.
4
robertwdempsey.com Production ML Pipelines
Writing the Book
Co-authoring the book Building
Machine Learning Pipelines.
Written for software developers and
data scientists, Building Machine
Learning Pipelines teaches the skills
required to create and use the
infrastructure needed to run modern
intelligent systems.
machinelearningpipelines.com
5
robertwdempsey.com Production ML Pipelines6
What’s your biggest issue?
robertwdempsey.com Production ML Pipelines7
Technology is LEAST important
robertwdempsey.com Production ML Pipelines8
The REPORT Framework™
robertwdempsey.com Production ML Pipelines
REPORT Framework™
Risk Tolerance
Expectations
Product
Operations
Results
Team
9
robertwdempsey.com Production ML Pipelines
Risk Tolerance
Question: How risk averse are you?
Some companies happily deploy beta and release candidate versions of cutting
edge open source software. Others enjoy the freedom of open source and look
for only mature applications. And yet a third category swear off open source
all together and only buy software that comes with a license and a support
contract. Where does your company sit on the risk aversion spectrum?
Question: What are your non-technology risks?
Technology aside, what happens if your project fails? Do you get fired? Does
the entire team get fired? Do the naysayers get to say “I told you so” in a
meeting?
10
robertwdempsey.com Production ML Pipelines
Expectations
Question: What are the expectations around the project?
Here are a few questions to get you started:
• Non-Technical
• How long do you think the project will take? How much do you
expect it to cost?
• What are others expecting the system will be able to do?
• Technical
• How much volume does the system need to be able to process? In
what amount of time?
• What level of downtime can you absorb?
11
robertwdempsey.com Production ML Pipelines
Product
Question: What does the product roadmap say?
At a minimum a bullet point list will help set the expectations of others,
and allow you to make trade-offs as the project moves forward. It also
helps you measure results - discussed later - on an incremental basis,
which will help your team know if they are making progress, or not.
Question: What’s the budget and estimated ROI?
As with expectations and product roadmap, whether formalized or not,
there is always, or should always be a budget as well as an estimated
ROI. Write it down and use it as one of your metrics.
12
robertwdempsey.com Production ML Pipelines
Operations
Question: Got DevOps?
DevOps, sometimes called TechOps, is a group that manages
and maintains the technology infrastructure of the organization.
Just because you have a DevOps team doesn’t mean you want
to add additional strain on them by firing up more servers.
With cloud providers like AWS you still have to do some
infrastructure support and maintenance. The larger your
business the more support work there will be.
13
robertwdempsey.com Production ML Pipelines
Results
Question: What does the end result look like?
Here’s a very partial list of results we’ve seen measured:
• The project was completed on X date by X time.
• The project cost $X amount of money to complete.
• The team worked no more than 40 hours each week to get
the project done.
• X, Y and Z features are in the product and have 90%
automated test coverage.
14
robertwdempsey.com Production ML Pipelines
Team
Question: Are the right people on the bus to get the project completed?
Having the right people with the right skills, both hard and soft, can
make or break a project.
Question: Does each team member have the tools and support they
need to be successful?
• Does the team have the support of senior leadership?
• Are they going to encounter a deluge of bureaucratic red tape that
will slow their progress?
• Are development and testing environments available?
15
robertwdempsey.com Production ML Pipelines
ML Pipeline
Toolbox
16
robertwdempsey.com Production ML Pipelines
The “Standard” ML Pipeline
17
Collect Store Enrich
Train /
Apply
Visualize
Infrastructure
robertwdempsey.com Production ML Pipelines
Infrastructure
• Servers
• Amazon EC2
• Data center
• Container Technologies
• Docker
• Amazon Elastic Container Service (ECS)
18
robertwdempsey.com Production ML Pipelines
Collect
• Programming Languages
• Python
• Scala
• Go
• R
• Pre-Built Tools
• Pentaho Data Integration
• Various web scraping tools
19
robertwdempsey.com Production ML Pipelines
Store
• Elasticsearch
• Apache Kafka
• Redis
• Cassandra
• MongoDB
• SQL
• Amazon S3
• HDFS
• Many others
20
robertwdempsey.com Production ML Pipelines
Enrich
• Apache Storm
• Apache Spark
• Amazon Elastic MapReduce (EMR)
• Apache Nifi
• Airflow (Airbnb)
21
robertwdempsey.com Production ML Pipelines
Train / Apply
• Python Libraries
• Scikit-learn
• Pandas
• Spark Libraries
• MLlib
• Deep Learning
• Tensorflow
• PyTorch
22
robertwdempsey.com Production ML Pipelines
Visualize
• Kibana
• Grafana
• Amazon Athena (for S3)
• Flask
• D3.js
23
robertwdempsey.com Production ML Pipelines
Machine Learning
Pipeline Architectures
24
robertwdempsey.com Production ML Pipelines
Architecture 1
25
Agent
File
System
Apache
Spark
File
System
Agent ES
1 2 3
robertwdempsey.com Production ML Pipelines
Architecture 1 Choices
This pipeline was built at a company building a new platform
using all leading-edge technologies, and was a temporary
solution until another pipeline was built.
• Risk Aversion: not an issue.
• Expectations: the pipeline needed to be run in production
and be able to handle the amount of data the company had
in a timely fashion.
• Product: this was a short-term solution to process data until
the desired pipeline was ready to be deployed into
production.
26
robertwdempsey.com Production ML Pipelines
Architecture 1 Choices
• Operations: due to its simplicity and limited functionality,
the solution became a one-server solution deployed by an
engineer working in unison with an internal devops team
member.
• Results: the pipeline was deployed on time and was able to
process all the data within the parameters
• Team: after a consultant built the first version of the
application an internal team member took over and
deployed it into production.
27
robertwdempsey.com Production ML Pipelines
Architecture 2
28
Agent
1 2 3
Agent
Agent
ES
S3
HDFS
Apache
Kafka
Apache
Storm
robertwdempsey.com Production ML Pipelines
Architecture 2 Choices
This pipeline was built at a startup focused on data collection
and was core to the product.
• Risk Aversion: this was the second version of a previously
developed and well proven pipeline so risk aversion was low.
• Expectations: as a core product the pipeline was expected to
be continuously evolving, able to be horizontally scaled, able
to handle a growing amount of data, and have 100% uptime.
• Product: the functionality built was in line with a product
roadmap that was reviewed on a monthly basis.
29
robertwdempsey.com Production ML Pipelines
Architecture 2 Choices
• Operations: an internal devops team managed the
infrastructure while engineers were expected to support the
associated applications and data processors
• Results: the pipeline could be horizontally scaled, handled
between 1-2TB of data per day, and had 99.9% uptime.
• Team: the devops and engineering teams worked together
to produce and support it.
30
robertwdempsey.com Production ML Pipelines
Architecture 3
31
Agent
1 2 3
Agent
Agent
Athena
S3
S3
Apache
Spark
robertwdempsey.com Production ML Pipelines
Architecture 3 Choices
This pipeline was built at a company building a new platform
using all leading-edge technologies, and was a temporary
solution until another pipeline was built.
• Risk Aversion: this system was mission critical for delivering
data in real-time to customers. Failure was not an option so
best in class practices needed to be implemented included
using hosted solutions such as Databricks and S3.
• Expectations: this system would scale as data collection
efforts grew and would be extremely fault tolerant.
32
robertwdempsey.com Production ML Pipelines
Architecture 3 Choices
• Product: this system would be extended to accommodate
additional product offerings so flexibility was important.
• Operations: this system was maintained by the engineers
who built it as there no separate devops team.
• Results: the system processed several TBs of data per hour
(need to double check this) with minimal downtime.
• Team: the team supporting the pipeline set up monitoring
and alerting to ensure uptime and worked with other
engineering groups to deconflict deployments that might
impact the pipeline.
33
robertwdempsey.com Production ML Pipelines
Architecture 4
34
Agent
1 2 3
Agent
Agent
ES
S3
HDFS
Apache
Kafka
Apache
Spark
HBase
robertwdempsey.com Production ML Pipelines
Architecture 4 Choices
This pipeline was built at a company building a new platform using all
leading-edge technologies, and was a temporary solution until another
pipeline was built.
• Risk Aversion: this system supported a key customer and was being
implemented as a means to resolve data loss and data discrepancies
that had plagued a legacy system.
• Expectations: this system would be resilient in the event of an outage
so that no data would be lost.
• Product: this system would ultimately be replaced by a more general
system designed to support multiple customers, so it was considered
extremely critical yet a one-off.
35
robertwdempsey.com Production ML Pipelines
Architecture 4 Choices
• Operations: this system was maintained by the engineers
who built it as at the time there was no technical operations
team in place.
• Results: the system processed hundreds of GBs of data per
day with infrequent outages.
• Team: once deployed, the team of developers who built this
pipeline began work on incorporating its features into a
more generalized stream processing platform.
36
robertwdempsey.com Production ML Pipelines
Q&A
37
robertwdempsey.com Production ML Pipelines
Free Guide
robertwdempsey.com/machineryai
38
robertwdempsey.com Production ML Pipelines
Where to Find Me
Website
Lotus Guides
LinkedIn
Twitter
Github
39
robertwdempsey.com
lotusguides.com
robertwdempsey
rdempsey
rdempsey
robertwdempsey.com Production ML Pipelines
Thank You!
40

More Related Content

What's hot (20)

PDF
ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
Databricks
 
PPTX
Magdalena Stenius: MLOPS Will Change Machine Learning
Lviv Startup Club
 
PDF
Seamless MLOps with Seldon and MLflow
Databricks
 
PDF
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Databricks
 
PDF
Provenance in Production-Grade Machine Learning
Anand Sampat
 
PPTX
Richard Coffey (x18140785) - Research in Computing CA2
Richard Coffey
 
PDF
Version Control in Machine Learning + AI (Stanford)
Anand Sampat
 
PDF
Challenges of Operationalising Data Science in Production
iguazio
 
PDF
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
Jan Kirenz
 
PPTX
MLOps and Data Quality: Deploying Reliable ML Models in Production
Provectus
 
PPTX
Ai use cases
Sparsh Agarwal
 
PDF
Deep Learning for Natural Language Processing Using Apache Spark and TensorFl...
Databricks
 
PDF
Managers guide to effective building of machine learning products
Gianmario Spacagna
 
PDF
Feature drift monitoring as a service for machine learning models at scale
Noriaki Tatsumi
 
PPTX
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Robb Boyd
 
PDF
Ml infra at an early stage
Nick Handel
 
PDF
Vertex AI: Pipelines for your MLOps workflows
Márton Kodok
 
PPTX
Py data scikit-production
Turi, Inc.
 
PDF
Simplifying AI integration on Apache Spark
Databricks
 
PDF
Machine Learning Teams - Full Stack Deep Learning
Sergey Karayev
 
ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
Databricks
 
Magdalena Stenius: MLOPS Will Change Machine Learning
Lviv Startup Club
 
Seamless MLOps with Seldon and MLflow
Databricks
 
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Databricks
 
Provenance in Production-Grade Machine Learning
Anand Sampat
 
Richard Coffey (x18140785) - Research in Computing CA2
Richard Coffey
 
Version Control in Machine Learning + AI (Stanford)
Anand Sampat
 
Challenges of Operationalising Data Science in Production
iguazio
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
Jan Kirenz
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
Provectus
 
Ai use cases
Sparsh Agarwal
 
Deep Learning for Natural Language Processing Using Apache Spark and TensorFl...
Databricks
 
Managers guide to effective building of machine learning products
Gianmario Spacagna
 
Feature drift monitoring as a service for machine learning models at scale
Noriaki Tatsumi
 
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Robb Boyd
 
Ml infra at an early stage
Nick Handel
 
Vertex AI: Pipelines for your MLOps workflows
Márton Kodok
 
Py data scikit-production
Turi, Inc.
 
Simplifying AI integration on Apache Spark
Databricks
 
Machine Learning Teams - Full Stack Deep Learning
Sergey Karayev
 

Viewers also liked (14)

PPTX
Production machine learning_infrastructure
joshwills
 
PDF
Serverless machine learning operations
Stepan Pushkarev
 
PDF
PostgreSQL + Kafka: The Delight of Change Data Capture
Jeff Klukas
 
PDF
Square's Machine Learning Infrastructure and Applications - Rong Yan
Hakka Labs
 
PDF
Python as part of a production machine learning stack by Michael Manapat PyDa...
PyData
 
PPTX
Production and Beyond: Deploying and Managing Machine Learning Models
Turi, Inc.
 
PDF
Multi runtime serving pipelines for machine learning
Stepan Pushkarev
 
PDF
Machine learning in production with scikit-learn
Jeff Klukas
 
PDF
Using PySpark to Process Boat Loads of Data
Robert Dempsey
 
PPTX
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
Jose Quesada (hiring)
 
PPTX
Machine Learning In Production
Samir Bessalah
 
PDF
Machine Learning Pipelines
jeykottalam
 
PDF
Spark and machine learning in microservices architecture
Stepan Pushkarev
 
PPTX
AI and Machine Learning Demystified by Carol Smith at Midwest UX 2017
Carol Smith
 
Production machine learning_infrastructure
joshwills
 
Serverless machine learning operations
Stepan Pushkarev
 
PostgreSQL + Kafka: The Delight of Change Data Capture
Jeff Klukas
 
Square's Machine Learning Infrastructure and Applications - Rong Yan
Hakka Labs
 
Python as part of a production machine learning stack by Michael Manapat PyDa...
PyData
 
Production and Beyond: Deploying and Managing Machine Learning Models
Turi, Inc.
 
Multi runtime serving pipelines for machine learning
Stepan Pushkarev
 
Machine learning in production with scikit-learn
Jeff Klukas
 
Using PySpark to Process Boat Loads of Data
Robert Dempsey
 
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
Jose Quesada (hiring)
 
Machine Learning In Production
Samir Bessalah
 
Machine Learning Pipelines
jeykottalam
 
Spark and machine learning in microservices architecture
Stepan Pushkarev
 
AI and Machine Learning Demystified by Carol Smith at Midwest UX 2017
Carol Smith
 
Ad

Similar to Building A Production-Level Machine Learning Pipeline (20)

PPTX
Deploying ML models to production (frequently and safely) - PYCON 2018
David Tan
 
PDF
Productionising Machine Learning Models
Tash Bickley
 
PPTX
A machine learning and data science pipeline for real companies
DataWorks Summit
 
PDF
Azure Engineering MLOps
Minesh A. Jethva
 
PDF
Rsqrd AI: From R&D to ROI of AI
Sanjana Chowdhury
 
PPTX
From Data Science to MLOps
Carl W. Handlin
 
PPTX
03_aiops-1.pptx
FarazulHoda2
 
PDF
How to Build an MLOps Pipeline - SoluLab
SoluLab1231
 
PDF
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
PPTX
Deploying ML models in the enterprise
doppenhe
 
PDF
Machine Learning Model Deployment: Strategy to Implementation
DataWorks Summit
 
PDF
Limited Budget but Effective End to End MLOps Practices (Machine Learning Mod...
IRJET Journal
 
PDF
MACHINE LEARNING AUTOMATIONS PIPELINE WITH CI/CD
IRJET Journal
 
PPTX
DevOps and Machine Learning (Geekwire Cloud Tech Summit)
Jasjeet Thind
 
PDF
Build machine learning pipelines from research to production
cnvrg.io AI OS - Hands-on ML Workshops
 
PDF
Tecton - The state of applied ML 2023.pdf
adjie131
 
PDF
Continuous delivery for machine learning
Rajesh Muppalla
 
PPTX
Deploying ML models to production (frequently and safely) - PYCON 2018
David Tan
 
PDF
ML in Production at FunTech Meetup (Feb 2019)
Mark Andreev
 
PDF
A survey on Machine Learning In Production (July 2018)
Arnab Biswas
 
Deploying ML models to production (frequently and safely) - PYCON 2018
David Tan
 
Productionising Machine Learning Models
Tash Bickley
 
A machine learning and data science pipeline for real companies
DataWorks Summit
 
Azure Engineering MLOps
Minesh A. Jethva
 
Rsqrd AI: From R&D to ROI of AI
Sanjana Chowdhury
 
From Data Science to MLOps
Carl W. Handlin
 
03_aiops-1.pptx
FarazulHoda2
 
How to Build an MLOps Pipeline - SoluLab
SoluLab1231
 
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
Deploying ML models in the enterprise
doppenhe
 
Machine Learning Model Deployment: Strategy to Implementation
DataWorks Summit
 
Limited Budget but Effective End to End MLOps Practices (Machine Learning Mod...
IRJET Journal
 
MACHINE LEARNING AUTOMATIONS PIPELINE WITH CI/CD
IRJET Journal
 
DevOps and Machine Learning (Geekwire Cloud Tech Summit)
Jasjeet Thind
 
Build machine learning pipelines from research to production
cnvrg.io AI OS - Hands-on ML Workshops
 
Tecton - The state of applied ML 2023.pdf
adjie131
 
Continuous delivery for machine learning
Rajesh Muppalla
 
Deploying ML models to production (frequently and safely) - PYCON 2018
David Tan
 
ML in Production at FunTech Meetup (Feb 2019)
Mark Andreev
 
A survey on Machine Learning In Production (July 2018)
Arnab Biswas
 
Ad

More from Robert Dempsey (20)

PDF
Analyzing Semi-Structured Data At Volume In The Cloud
Robert Dempsey
 
PDF
Practical Predictive Modeling in Python
Robert Dempsey
 
PDF
Creating Your First Predictive Model In Python
Robert Dempsey
 
PDF
Growth Hacking 101
Robert Dempsey
 
PPTX
Web Scraping With Python
Robert Dempsey
 
PPTX
DC Python Intro Slides - Rob's Version
Robert Dempsey
 
PDF
Content Marketing Strategy for 2013
Robert Dempsey
 
PDF
Creating Lead-Generating Social Media Campaigns
Robert Dempsey
 
PDF
Goal Writing Workshop
Robert Dempsey
 
PDF
Google AdWords Introduction
Robert Dempsey
 
PDF
20 Tips For Freelance Success
Robert Dempsey
 
PDF
How To Turn Your Business Into A Media Powerhouse
Robert Dempsey
 
PDF
Agile Teams as Innovation Teams
Robert Dempsey
 
PDF
Introduction to kanban
Robert Dempsey
 
PDF
Get The **** Up And Market
Robert Dempsey
 
PDF
Introduction To Inbound Marketing
Robert Dempsey
 
PDF
Writing Agile Requirements
Robert Dempsey
 
PDF
Twitter For Business
Robert Dempsey
 
PDF
Introduction To Scrum For Managers
Robert Dempsey
 
PDF
Introduction to Agile for Managers
Robert Dempsey
 
Analyzing Semi-Structured Data At Volume In The Cloud
Robert Dempsey
 
Practical Predictive Modeling in Python
Robert Dempsey
 
Creating Your First Predictive Model In Python
Robert Dempsey
 
Growth Hacking 101
Robert Dempsey
 
Web Scraping With Python
Robert Dempsey
 
DC Python Intro Slides - Rob's Version
Robert Dempsey
 
Content Marketing Strategy for 2013
Robert Dempsey
 
Creating Lead-Generating Social Media Campaigns
Robert Dempsey
 
Goal Writing Workshop
Robert Dempsey
 
Google AdWords Introduction
Robert Dempsey
 
20 Tips For Freelance Success
Robert Dempsey
 
How To Turn Your Business Into A Media Powerhouse
Robert Dempsey
 
Agile Teams as Innovation Teams
Robert Dempsey
 
Introduction to kanban
Robert Dempsey
 
Get The **** Up And Market
Robert Dempsey
 
Introduction To Inbound Marketing
Robert Dempsey
 
Writing Agile Requirements
Robert Dempsey
 
Twitter For Business
Robert Dempsey
 
Introduction To Scrum For Managers
Robert Dempsey
 
Introduction to Agile for Managers
Robert Dempsey
 

Recently uploaded (20)

PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
PDF
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
PDF
July Patch Tuesday
Ivanti
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
July Patch Tuesday
Ivanti
 

Building A Production-Level Machine Learning Pipeline

  • 1. robertwdempsey.com Building a Production-Level Machine Learning Pipeline Robert Dempsey, CEO Atlantic Dominion Solutions
  • 2. robertwdempsey.com Production ML Pipelines Robert Dempsey 2 Entrepreneur, Software Engineer Books and online courses Lotus Guides, District Data Labs Atlantic Dominion Solutions, LLC Professional Author Instructor Owner
  • 3. robertwdempsey.com Production ML Pipelines We’ve mastered three jobs so you can focus on one - growing your business. 3
  • 4. robertwdempsey.com Production ML Pipelines The Three Jobs At Atlantic Dominion Solutions we perform three functions for our customers: Consulting: we assess and advise in the areas of technology, team and process to determine how machine learning can have the biggest impact on your business. Implementation: after a strategy session to determine the work you need we get to work using our proven methodology and begin delivering smarter applications. Training: continuous improvement requires continuous learning. We provide both on-premises and online training. 4
  • 5. robertwdempsey.com Production ML Pipelines Writing the Book Co-authoring the book Building Machine Learning Pipelines. Written for software developers and data scientists, Building Machine Learning Pipelines teaches the skills required to create and use the infrastructure needed to run modern intelligent systems. machinelearningpipelines.com 5
  • 6. robertwdempsey.com Production ML Pipelines6 What’s your biggest issue?
  • 7. robertwdempsey.com Production ML Pipelines7 Technology is LEAST important
  • 8. robertwdempsey.com Production ML Pipelines8 The REPORT Framework™
  • 9. robertwdempsey.com Production ML Pipelines REPORT Framework™ Risk Tolerance Expectations Product Operations Results Team 9
  • 10. robertwdempsey.com Production ML Pipelines Risk Tolerance Question: How risk averse are you? Some companies happily deploy beta and release candidate versions of cutting edge open source software. Others enjoy the freedom of open source and look for only mature applications. And yet a third category swear off open source all together and only buy software that comes with a license and a support contract. Where does your company sit on the risk aversion spectrum? Question: What are your non-technology risks? Technology aside, what happens if your project fails? Do you get fired? Does the entire team get fired? Do the naysayers get to say “I told you so” in a meeting? 10
  • 11. robertwdempsey.com Production ML Pipelines Expectations Question: What are the expectations around the project? Here are a few questions to get you started: • Non-Technical • How long do you think the project will take? How much do you expect it to cost? • What are others expecting the system will be able to do? • Technical • How much volume does the system need to be able to process? In what amount of time? • What level of downtime can you absorb? 11
  • 12. robertwdempsey.com Production ML Pipelines Product Question: What does the product roadmap say? At a minimum a bullet point list will help set the expectations of others, and allow you to make trade-offs as the project moves forward. It also helps you measure results - discussed later - on an incremental basis, which will help your team know if they are making progress, or not. Question: What’s the budget and estimated ROI? As with expectations and product roadmap, whether formalized or not, there is always, or should always be a budget as well as an estimated ROI. Write it down and use it as one of your metrics. 12
  • 13. robertwdempsey.com Production ML Pipelines Operations Question: Got DevOps? DevOps, sometimes called TechOps, is a group that manages and maintains the technology infrastructure of the organization. Just because you have a DevOps team doesn’t mean you want to add additional strain on them by firing up more servers. With cloud providers like AWS you still have to do some infrastructure support and maintenance. The larger your business the more support work there will be. 13
  • 14. robertwdempsey.com Production ML Pipelines Results Question: What does the end result look like? Here’s a very partial list of results we’ve seen measured: • The project was completed on X date by X time. • The project cost $X amount of money to complete. • The team worked no more than 40 hours each week to get the project done. • X, Y and Z features are in the product and have 90% automated test coverage. 14
  • 15. robertwdempsey.com Production ML Pipelines Team Question: Are the right people on the bus to get the project completed? Having the right people with the right skills, both hard and soft, can make or break a project. Question: Does each team member have the tools and support they need to be successful? • Does the team have the support of senior leadership? • Are they going to encounter a deluge of bureaucratic red tape that will slow their progress? • Are development and testing environments available? 15
  • 16. robertwdempsey.com Production ML Pipelines ML Pipeline Toolbox 16
  • 17. robertwdempsey.com Production ML Pipelines The “Standard” ML Pipeline 17 Collect Store Enrich Train / Apply Visualize Infrastructure
  • 18. robertwdempsey.com Production ML Pipelines Infrastructure • Servers • Amazon EC2 • Data center • Container Technologies • Docker • Amazon Elastic Container Service (ECS) 18
  • 19. robertwdempsey.com Production ML Pipelines Collect • Programming Languages • Python • Scala • Go • R • Pre-Built Tools • Pentaho Data Integration • Various web scraping tools 19
  • 20. robertwdempsey.com Production ML Pipelines Store • Elasticsearch • Apache Kafka • Redis • Cassandra • MongoDB • SQL • Amazon S3 • HDFS • Many others 20
  • 21. robertwdempsey.com Production ML Pipelines Enrich • Apache Storm • Apache Spark • Amazon Elastic MapReduce (EMR) • Apache Nifi • Airflow (Airbnb) 21
  • 22. robertwdempsey.com Production ML Pipelines Train / Apply • Python Libraries • Scikit-learn • Pandas • Spark Libraries • MLlib • Deep Learning • Tensorflow • PyTorch 22
  • 23. robertwdempsey.com Production ML Pipelines Visualize • Kibana • Grafana • Amazon Athena (for S3) • Flask • D3.js 23
  • 24. robertwdempsey.com Production ML Pipelines Machine Learning Pipeline Architectures 24
  • 25. robertwdempsey.com Production ML Pipelines Architecture 1 25 Agent File System Apache Spark File System Agent ES 1 2 3
  • 26. robertwdempsey.com Production ML Pipelines Architecture 1 Choices This pipeline was built at a company building a new platform using all leading-edge technologies, and was a temporary solution until another pipeline was built. • Risk Aversion: not an issue. • Expectations: the pipeline needed to be run in production and be able to handle the amount of data the company had in a timely fashion. • Product: this was a short-term solution to process data until the desired pipeline was ready to be deployed into production. 26
  • 27. robertwdempsey.com Production ML Pipelines Architecture 1 Choices • Operations: due to its simplicity and limited functionality, the solution became a one-server solution deployed by an engineer working in unison with an internal devops team member. • Results: the pipeline was deployed on time and was able to process all the data within the parameters • Team: after a consultant built the first version of the application an internal team member took over and deployed it into production. 27
  • 28. robertwdempsey.com Production ML Pipelines Architecture 2 28 Agent 1 2 3 Agent Agent ES S3 HDFS Apache Kafka Apache Storm
  • 29. robertwdempsey.com Production ML Pipelines Architecture 2 Choices This pipeline was built at a startup focused on data collection and was core to the product. • Risk Aversion: this was the second version of a previously developed and well proven pipeline so risk aversion was low. • Expectations: as a core product the pipeline was expected to be continuously evolving, able to be horizontally scaled, able to handle a growing amount of data, and have 100% uptime. • Product: the functionality built was in line with a product roadmap that was reviewed on a monthly basis. 29
  • 30. robertwdempsey.com Production ML Pipelines Architecture 2 Choices • Operations: an internal devops team managed the infrastructure while engineers were expected to support the associated applications and data processors • Results: the pipeline could be horizontally scaled, handled between 1-2TB of data per day, and had 99.9% uptime. • Team: the devops and engineering teams worked together to produce and support it. 30
  • 31. robertwdempsey.com Production ML Pipelines Architecture 3 31 Agent 1 2 3 Agent Agent Athena S3 S3 Apache Spark
  • 32. robertwdempsey.com Production ML Pipelines Architecture 3 Choices This pipeline was built at a company building a new platform using all leading-edge technologies, and was a temporary solution until another pipeline was built. • Risk Aversion: this system was mission critical for delivering data in real-time to customers. Failure was not an option so best in class practices needed to be implemented included using hosted solutions such as Databricks and S3. • Expectations: this system would scale as data collection efforts grew and would be extremely fault tolerant. 32
  • 33. robertwdempsey.com Production ML Pipelines Architecture 3 Choices • Product: this system would be extended to accommodate additional product offerings so flexibility was important. • Operations: this system was maintained by the engineers who built it as there no separate devops team. • Results: the system processed several TBs of data per hour (need to double check this) with minimal downtime. • Team: the team supporting the pipeline set up monitoring and alerting to ensure uptime and worked with other engineering groups to deconflict deployments that might impact the pipeline. 33
  • 34. robertwdempsey.com Production ML Pipelines Architecture 4 34 Agent 1 2 3 Agent Agent ES S3 HDFS Apache Kafka Apache Spark HBase
  • 35. robertwdempsey.com Production ML Pipelines Architecture 4 Choices This pipeline was built at a company building a new platform using all leading-edge technologies, and was a temporary solution until another pipeline was built. • Risk Aversion: this system supported a key customer and was being implemented as a means to resolve data loss and data discrepancies that had plagued a legacy system. • Expectations: this system would be resilient in the event of an outage so that no data would be lost. • Product: this system would ultimately be replaced by a more general system designed to support multiple customers, so it was considered extremely critical yet a one-off. 35
  • 36. robertwdempsey.com Production ML Pipelines Architecture 4 Choices • Operations: this system was maintained by the engineers who built it as at the time there was no technical operations team in place. • Results: the system processed hundreds of GBs of data per day with infrequent outages. • Team: once deployed, the team of developers who built this pipeline began work on incorporating its features into a more generalized stream processing platform. 36
  • 38. robertwdempsey.com Production ML Pipelines Free Guide robertwdempsey.com/machineryai 38
  • 39. robertwdempsey.com Production ML Pipelines Where to Find Me Website Lotus Guides LinkedIn Twitter Github 39 robertwdempsey.com lotusguides.com robertwdempsey rdempsey rdempsey
  • 40. robertwdempsey.com Production ML Pipelines Thank You! 40