SlideShare a Scribd company logo
3
Most read
7
Most read
12
Most read
Learn to Use Databricks
for the full ML Lifecycle
Rafi Kurlansik, Sr. Solutions Architect
Structured Semi-structured Unstructured Streaming
BI &
SQL Analytics
Machine Learning
Real-time Data
Applications
Data Management & Governance
Open Data Storage
Data Science &
Engineering
Lakehouse Platform
Simple | Open | Collaborative
Reliable | Scalable | Secure
Structured Semi-structured Unstructured Streaming
BI &
SQL Analytics
Machine Learning
Real-time Data
Applications
Data Management & Governance
Open Data Storage
Data Science &
Engineering
Lakehouse Platform
Simple | Open | Collaborative
Reliable | Scalable | Secure
Our focus today
Essential Capabilities for Full ML Lifecycle
To lower risk and maintain stability of our ML pipeline, we need to think about:
● Robust Data Processing and Management
● Secure Collaboration
● Testing
● Monitoring
● Reproducibility
● Documentation
...for code, data and
models.
Business Context: Customer Retention
You are on a marketing analytics team and you have a lot of demographic and
historical service data on your customers that have churned, which has been put
into a SQL Analytics dashboard.
The data team has been asked by business stakeholders if you can go further and
predict which customers are likely to churn. Knowing this will allow the business
to take action and retain revenue.
Sounds simple enough. What steps do we need to take?
Learn to Use Databricks for the Full ML Lifecycle
Learn to Use Databricks for the Full ML Lifecycle
Learn to Use Databricks for the Full ML Lifecycle
Learn to Use Databricks for the Full ML Lifecycle
Learn to Use Databricks for the Full ML Lifecycle
To the demo we go!
The Full ML Lifecycle
To learn more:
We’ll be releasing a series of blogs on MLOps and ML Engineering throughout 2021:
● The Need for Data-centric ML Platforms
● Selecting Technologies and Platforms for Data Science and Machine Learning
● Model and Data Monitoring on Databricks
● … and more
Check out the other MLOps talks at DAIS!
Thank you!
Learn to Use Databricks for the Full ML Lifecycle

More Related Content

What's hot (20)

PDF
Introduction SQL Analytics on Lakehouse Architecture
Databricks
 
PPTX
Databricks Fundamentals
Dalibor Wijas
 
PPTX
MLOps - The Assembly Line of ML
Jordan Birdsell
 
PDF
Intro to Delta Lake
Databricks
 
PDF
Introducing Databricks Delta
Databricks
 
PPTX
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
Databricks
 
PPTX
Data platform modernization with Databricks.pptx
CalvinSim10
 
PPTX
Data council sf amundsen presentation
Tao Feng
 
PDF
Getting Started with Delta Lake on Databricks
Knoldus Inc.
 
PDF
DevOps for Databricks
Databricks
 
PDF
Scaling and Modernizing Data Platform with Databricks
Databricks
 
PPTX
Zero to Snowflake Presentation
Brett VanderPlaats
 
PDF
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
Databricks
 
PDF
Unified Big Data Processing with Apache Spark (QCON 2014)
Databricks
 
PDF
Databricks Delta Lake and Its Benefits
Databricks
 
PDF
Make your data AI ready with Microsoft Fabric and Azure Databricks pitch deck...
George Walters
 
PDF
Learn to Use Databricks for Data Science
Databricks
 
PDF
Architect’s Open-Source Guide for a Data Mesh Architecture
Databricks
 
PDF
Data & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023]
Chris Bingham
 
PPTX
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
huguk
 
Introduction SQL Analytics on Lakehouse Architecture
Databricks
 
Databricks Fundamentals
Dalibor Wijas
 
MLOps - The Assembly Line of ML
Jordan Birdsell
 
Intro to Delta Lake
Databricks
 
Introducing Databricks Delta
Databricks
 
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
Databricks
 
Data platform modernization with Databricks.pptx
CalvinSim10
 
Data council sf amundsen presentation
Tao Feng
 
Getting Started with Delta Lake on Databricks
Knoldus Inc.
 
DevOps for Databricks
Databricks
 
Scaling and Modernizing Data Platform with Databricks
Databricks
 
Zero to Snowflake Presentation
Brett VanderPlaats
 
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
Databricks
 
Unified Big Data Processing with Apache Spark (QCON 2014)
Databricks
 
Databricks Delta Lake and Its Benefits
Databricks
 
Make your data AI ready with Microsoft Fabric and Azure Databricks pitch deck...
George Walters
 
Learn to Use Databricks for Data Science
Databricks
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Databricks
 
Data & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023]
Chris Bingham
 
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
huguk
 

Similar to Learn to Use Databricks for the Full ML Lifecycle (20)

PDF
Introducing MLflow for End-to-End Machine Learning on Databricks
Databricks
 
PDF
Efficiently Building Machine Learning Models for Predictive Maintenance in th...
Databricks
 
PPTX
Databricks for MLOps Presentation (AI/ML)
Knoldus Inc.
 
PPTX
Machine Learning Models in Production
DataWorks Summit
 
PDF
Introducing MLOps.pdf
Dr. Anish Cheriyan (PhD)
 
PDF
Practical Machine Learning on Databricks (1st Edition) Debu Sinha
eunoraarang67
 
PDF
Experimentation to Industrialization: Implementing MLOps
Databricks
 
PDF
Ideas spracklen-final
supportlogic
 
PPTX
CNCF-Istanbul-MLOps for Devops Engineers.pptx
cansukavili1
 
PPTX
Design Patterns for Large-Scale Real-Time Learning
Swiss Big Data User Group
 
PDF
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Anyscale
 
PPTX
Software engineering practices for the data science and machine learning life...
DataWorks Summit
 
PDF
Data ops: Machine Learning in production
Stepan Pushkarev
 
PPTX
Spark ML Pipeline serving
Stepan Pushkarev
 
PPTX
Apache® Spark™ MLlib 2.x: migrating ML workloads to DataFrames
Databricks
 
PDF
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Databricks
 
PPTX
Azure Databricks for Data Scientists
Richard Garris
 
PPTX
Splunk for Machine Learning and Analytics
Splunk
 
PPTX
Splunk for Machine Learning and Analytics
Shannon Cuthbertson
 
PDF
Databricks: A Tool That Empowers You To Do More With Data
Databricks
 
Introducing MLflow for End-to-End Machine Learning on Databricks
Databricks
 
Efficiently Building Machine Learning Models for Predictive Maintenance in th...
Databricks
 
Databricks for MLOps Presentation (AI/ML)
Knoldus Inc.
 
Machine Learning Models in Production
DataWorks Summit
 
Introducing MLOps.pdf
Dr. Anish Cheriyan (PhD)
 
Practical Machine Learning on Databricks (1st Edition) Debu Sinha
eunoraarang67
 
Experimentation to Industrialization: Implementing MLOps
Databricks
 
Ideas spracklen-final
supportlogic
 
CNCF-Istanbul-MLOps for Devops Engineers.pptx
cansukavili1
 
Design Patterns for Large-Scale Real-Time Learning
Swiss Big Data User Group
 
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Anyscale
 
Software engineering practices for the data science and machine learning life...
DataWorks Summit
 
Data ops: Machine Learning in production
Stepan Pushkarev
 
Spark ML Pipeline serving
Stepan Pushkarev
 
Apache® Spark™ MLlib 2.x: migrating ML workloads to DataFrames
Databricks
 
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Databricks
 
Azure Databricks for Data Scientists
Richard Garris
 
Splunk for Machine Learning and Analytics
Splunk
 
Splunk for Machine Learning and Analytics
Shannon Cuthbertson
 
Databricks: A Tool That Empowers You To Do More With Data
Databricks
 
Ad

More from Databricks (20)

PPTX
DW Migration Webinar-March 2022.pptx
Databricks
 
PPTX
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
PPT
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 4
Databricks
 
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
PDF
Democratizing Data Quality Through a Centralized Platform
Databricks
 
PDF
Why APM Is Not the Same As ML Monitoring
Databricks
 
PDF
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
PDF
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
PDF
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
PDF
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
PDF
Sawtooth Windows for Feature Aggregations
Databricks
 
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
PDF
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
PDF
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
PDF
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
PDF
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
PDF
Machine Learning CI/CD for Email Attack Detection
Databricks
 
DW Migration Webinar-March 2022.pptx
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
Data Lakehouse Symposium | Day 2
Databricks
 
Data Lakehouse Symposium | Day 4
Databricks
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Why APM Is Not the Same As ML Monitoring
Databricks
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
Sawtooth Windows for Feature Aggregations
Databricks
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
Machine Learning CI/CD for Email Attack Detection
Databricks
 
Ad

Recently uploaded (20)

PDF
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
PPTX
apidays Singapore 2025 - Generative AI Landscape Building a Modern Data Strat...
apidays
 
PPTX
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
PDF
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
PDF
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
PPTX
SHREYAS25 INTERN-I,II,III PPT (1).pptx pre
swapnilherage
 
PDF
Research Methodology Overview Introduction
ayeshagul29594
 
PPTX
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
PDF
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
PPTX
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
PDF
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
PDF
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 
PDF
apidays Singapore 2025 - Trustworthy Generative AI: The Role of Observability...
apidays
 
PDF
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
PPTX
thid ppt defines the ich guridlens and gives the information about the ICH gu...
shaistabegum14
 
PPTX
BinarySearchTree in datastructures in detail
kichokuttu
 
PPTX
Feb 2021 Ransomware Recovery presentation.pptx
enginsayin1
 
PPTX
05_Jelle Baats_Tekst.pptx_AI_Barometer_Release_Event
FinTech Belgium
 
PPTX
big data eco system fundamentals of data science
arivukarasi
 
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
apidays Singapore 2025 - Generative AI Landscape Building a Modern Data Strat...
apidays
 
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
SHREYAS25 INTERN-I,II,III PPT (1).pptx pre
swapnilherage
 
Research Methodology Overview Introduction
ayeshagul29594
 
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 
apidays Singapore 2025 - Trustworthy Generative AI: The Role of Observability...
apidays
 
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
thid ppt defines the ich guridlens and gives the information about the ICH gu...
shaistabegum14
 
BinarySearchTree in datastructures in detail
kichokuttu
 
Feb 2021 Ransomware Recovery presentation.pptx
enginsayin1
 
05_Jelle Baats_Tekst.pptx_AI_Barometer_Release_Event
FinTech Belgium
 
big data eco system fundamentals of data science
arivukarasi
 

Learn to Use Databricks for the Full ML Lifecycle

  • 1. Learn to Use Databricks for the full ML Lifecycle Rafi Kurlansik, Sr. Solutions Architect
  • 2. Structured Semi-structured Unstructured Streaming BI & SQL Analytics Machine Learning Real-time Data Applications Data Management & Governance Open Data Storage Data Science & Engineering Lakehouse Platform Simple | Open | Collaborative Reliable | Scalable | Secure
  • 3. Structured Semi-structured Unstructured Streaming BI & SQL Analytics Machine Learning Real-time Data Applications Data Management & Governance Open Data Storage Data Science & Engineering Lakehouse Platform Simple | Open | Collaborative Reliable | Scalable | Secure Our focus today
  • 4. Essential Capabilities for Full ML Lifecycle To lower risk and maintain stability of our ML pipeline, we need to think about: ● Robust Data Processing and Management ● Secure Collaboration ● Testing ● Monitoring ● Reproducibility ● Documentation ...for code, data and models.
  • 5. Business Context: Customer Retention You are on a marketing analytics team and you have a lot of demographic and historical service data on your customers that have churned, which has been put into a SQL Analytics dashboard. The data team has been asked by business stakeholders if you can go further and predict which customers are likely to churn. Knowing this will allow the business to take action and retain revenue. Sounds simple enough. What steps do we need to take?
  • 11. To the demo we go!
  • 12. The Full ML Lifecycle
  • 13. To learn more: We’ll be releasing a series of blogs on MLOps and ML Engineering throughout 2021: ● The Need for Data-centric ML Platforms ● Selecting Technologies and Platforms for Data Science and Machine Learning ● Model and Data Monitoring on Databricks ● … and more Check out the other MLOps talks at DAIS!