SlideShare a Scribd company logo
ADVANCED SPARK
AND TENSORFLOW MEETUP
Deploy Spark andTensorflow Models:
From Notebook Dev to Microservice Prod
London - November 15, 2016
ThankYou, PipelineIO,Rise, Barclays!
http://pipeline.io
MEETUP AGENDA
• Meetup Updates, Metrics, and Announcements
• Spark Updates
• Tensorflow Updates
• Deploying and Scaling ML Models from
Notebook to Production with PipelineIO
(Chris Fregly, PipelineIO)
WHO AM I?
Chris Fregly
--------
Research Scientist @ PipelineIO
(Formerly Netflix and Databricks)
--------
http://pipeline.io
WHO ARE YOU?
--Techies of London --
FUN WORKSHOP LAST MONTH
(HERE IN LONDON)
MEETUP METRICS
• 10,000+ Members Globally in 1Year!
• 5,000 Members in San Francisco (Home)
• Github Repo: 900 Stars, 300 Forks
• DockerHub Repo: 6,200 Pulls
APACHE SPARK UPDATE
SPARK SUMMIT EU 2016 (BRUSSELS)
• https://spark-summit.org/eu-2016/schedule/
• Structured Streaming (2 talks)
• ML + Structured Streaming (1 talks)
• ML Model Deployment (2 talks)
• Spark Performance (4 talks)
TENSORFLOW UPDATE
TENSORFLOW WORKSHOP
• Great Workshop From Google Directly
• Finally, training involving more than MNIST!
• Slides
• http://bit.ly/tf-workshop-slides
• Workshop
• https://github.com/amygdala/tensorflow-workshop
WHAT IS PIPELINE.IO?
ExtendingYour ML Pipelines into Production
100% Open Source!
http://pipeline.io
BRAINSTORMING AND VALIDATING
• Major Gaming Company
• Large Ride Sharing Service
• Popular Q & A Site
• Online Clothing Retailer
• DominantVideo Streaming
PIPELINE.IO FOCUS
• Model Deploying andTesting
• Model Scaling and Serving
• Online ModelTraining
• Dynamic Model Optimizing
MODEL DEPLOYING AND TESTING
Continuously Test
and Deploy Models
in Production!
MODEL SCALING AND SERVING
ONLINE MODEL TRAINING
• Continuous, Incremental, and Partial Training
• Kafka + Spark Streaming + Spark ML
• Real-time, Dynamic Recommendations
DYNAMIC MODEL OPTIMIZING
Generate Optimized
Code from Spark ML!
BECOME A CONTRIBUTOR!
PIPELINE.IO PLAN FOR 2017
• Performance
• Code Generation: CPU and GPU
• Continued Global Expansion
MORE WORKSHOPS IN 2017
WE’RE HIRING!!
• Kafka, Spark ML, and TensorFlow Contributors
• Systems Engineers
• GPU/CUDA Engineers
• C++, Java, Scala, Python
WE ONLY HIRE
NICE PEOPLE!!
DEMO!
Circuit Breakers and Request Batching
DEMO!
Deploy Spark ML DecisionTree to Production
Deploy to Cloud
or On-Premise!
DEMO!
Dynamic Code Generation of DecisionTree
THANK YOU!!
http://pipeline.io

More Related Content

What's hot (20)

PPTX
Building cloud-enabled genomics workflows with Luigi and Docker
Jacob Feala
 
PDF
Graph Processing with Apache TinkerPop
Jason Plurad
 
PDF
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
Spark Summit
 
PDF
Building Event Streaming Applications with Pac-Man (Ricardo Ferreira, Conflue...
HostedbyConfluent
 
PPT
Interoperability at Apache Software Foundation
Paolo Mottadelli
 
PPTX
Apache Airflow (incubating) NL HUG Meetup 2016-07-19
Bolke de Bruin
 
PDF
The Need For A Cloud Native Tunnel
Alex Ellis
 
PDF
SF Hadoop Users Group August 2014 Meetup Slides
Yash Ranadive
 
PDF
Workflow Engines + Luigi
Vladislav Supalov
 
PPTX
Tuning and Monitoring Deep Learning on Apache Spark
Databricks
 
PPTX
From H2O to Steam - Dr. Bingwei Liu, Sr. Data Engineer, Aetna
Sri Ambati
 
PDF
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Tom Lous
 
PDF
Powering machine learning workflows with Apache Airflow and Python
Tatiana Al-Chueyr
 
PDF
Workflow Engines for Hadoop
Joe Crobak
 
PDF
Understanding and Improving Code Generation
Databricks
 
PDF
GraphConnect 2014 SF: How eBay and Shutl Deliver Even Faster Using Neo4j
Neo4j
 
PDF
Spark Workflow Management
Romi Kuntsman
 
PDF
Berlin Buzzwords 2019 - Taming the language border in data analytics and scie...
Uwe Korn
 
PPTX
Running Airflow Workflows as ETL Processes on Hadoop
clairvoyantllc
 
PDF
Madrid Meetup
Sri Ambati
 
Building cloud-enabled genomics workflows with Luigi and Docker
Jacob Feala
 
Graph Processing with Apache TinkerPop
Jason Plurad
 
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
Spark Summit
 
Building Event Streaming Applications with Pac-Man (Ricardo Ferreira, Conflue...
HostedbyConfluent
 
Interoperability at Apache Software Foundation
Paolo Mottadelli
 
Apache Airflow (incubating) NL HUG Meetup 2016-07-19
Bolke de Bruin
 
The Need For A Cloud Native Tunnel
Alex Ellis
 
SF Hadoop Users Group August 2014 Meetup Slides
Yash Ranadive
 
Workflow Engines + Luigi
Vladislav Supalov
 
Tuning and Monitoring Deep Learning on Apache Spark
Databricks
 
From H2O to Steam - Dr. Bingwei Liu, Sr. Data Engineer, Aetna
Sri Ambati
 
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Tom Lous
 
Powering machine learning workflows with Apache Airflow and Python
Tatiana Al-Chueyr
 
Workflow Engines for Hadoop
Joe Crobak
 
Understanding and Improving Code Generation
Databricks
 
GraphConnect 2014 SF: How eBay and Shutl Deliver Even Faster Using Neo4j
Neo4j
 
Spark Workflow Management
Romi Kuntsman
 
Berlin Buzzwords 2019 - Taming the language border in data analytics and scie...
Uwe Korn
 
Running Airflow Workflows as ETL Processes on Hadoop
clairvoyantllc
 
Madrid Meetup
Sri Ambati
 

Viewers also liked (20)

PDF
Spark on Kubernetes - Advanced Spark and Tensorflow Meetup - Jan 19 2017 - An...
Chris Fregly
 
PDF
Tallinn Estonia Advanced Java Meetup Spark + TensorFlow = TensorFrames Oct 24...
Chris Fregly
 
PDF
Atlanta MLconf Machine Learning Conference 09-23-2016
Chris Fregly
 
PDF
Advanced Spark and TensorFlow Meetup May 26, 2016
Chris Fregly
 
PDF
Atlanta Spark User Meetup 09 22 2016
Chris Fregly
 
PDF
Atlanta Hadoop Users Meetup 09 21 2016
Chris Fregly
 
PDF
Gradient Descent, Back Propagation, and Auto Differentiation - Advanced Spark...
Chris Fregly
 
PDF
Advanced Spark and TensorFlow Meetup 08-04-2016 One Click Spark ML Pipeline D...
Chris Fregly
 
PDF
Boston Spark Meetup May 24, 2016
Chris Fregly
 
PDF
Kafka Summit SF Apr 26 2016 - Generating Real-time Recommendations with NiFi,...
Chris Fregly
 
PDF
Spark After Dark 2.0 - Apache Big Data Conf - Vancouver - May 11, 2016
Chris Fregly
 
PDF
Advanced Apache Spark Meetup Spark and Elasticsearch 02-15-2016
Chris Fregly
 
PPTX
TensorFrames: Google Tensorflow on Apache Spark
Databricks
 
PDF
DC Spark Users Group March 15 2016 - Spark and Netflix Recommendations
Chris Fregly
 
PDF
Spark Summit East NYC Meetup 02-16-2016
Chris Fregly
 
PDF
Chicago Spark Meetup 03 01 2016 - Spark and Recommendations
Chris Fregly
 
PDF
Spark, Similarity, Approximations, NLP, Recommendations - Boulder Denver Spar...
Chris Fregly
 
PDF
Introduction to Deeplearning4j
Daehyun Kim
 
PDF
Intro to DeepLearning4J on ApacheSpark SDS DL Workshop 16
Romeo Kienzler
 
PDF
Deep learning on a mixed cluster with deeplearning4j and spark
François Garillot
 
Spark on Kubernetes - Advanced Spark and Tensorflow Meetup - Jan 19 2017 - An...
Chris Fregly
 
Tallinn Estonia Advanced Java Meetup Spark + TensorFlow = TensorFrames Oct 24...
Chris Fregly
 
Atlanta MLconf Machine Learning Conference 09-23-2016
Chris Fregly
 
Advanced Spark and TensorFlow Meetup May 26, 2016
Chris Fregly
 
Atlanta Spark User Meetup 09 22 2016
Chris Fregly
 
Atlanta Hadoop Users Meetup 09 21 2016
Chris Fregly
 
Gradient Descent, Back Propagation, and Auto Differentiation - Advanced Spark...
Chris Fregly
 
Advanced Spark and TensorFlow Meetup 08-04-2016 One Click Spark ML Pipeline D...
Chris Fregly
 
Boston Spark Meetup May 24, 2016
Chris Fregly
 
Kafka Summit SF Apr 26 2016 - Generating Real-time Recommendations with NiFi,...
Chris Fregly
 
Spark After Dark 2.0 - Apache Big Data Conf - Vancouver - May 11, 2016
Chris Fregly
 
Advanced Apache Spark Meetup Spark and Elasticsearch 02-15-2016
Chris Fregly
 
TensorFrames: Google Tensorflow on Apache Spark
Databricks
 
DC Spark Users Group March 15 2016 - Spark and Netflix Recommendations
Chris Fregly
 
Spark Summit East NYC Meetup 02-16-2016
Chris Fregly
 
Chicago Spark Meetup 03 01 2016 - Spark and Recommendations
Chris Fregly
 
Spark, Similarity, Approximations, NLP, Recommendations - Boulder Denver Spar...
Chris Fregly
 
Introduction to Deeplearning4j
Daehyun Kim
 
Intro to DeepLearning4J on ApacheSpark SDS DL Workshop 16
Romeo Kienzler
 
Deep learning on a mixed cluster with deeplearning4j and spark
François Garillot
 
Ad

Similar to Advanced Spark and Tensorflow Meetup - London - Nov 15, 2016 - Deploy Spark ML and Tensorflow AI Models from Dev Notebooks to Prod Microservices (20)

PDF
Accelerate Your ML Pipeline with AutoML and MLflow
Databricks
 
PDF
High Performance TensorFlow in Production - Big Data Spain - Madrid - Nov 15 ...
Chris Fregly
 
PDF
Tech leaders guide to effective building of machine learning products
Gianmario Spacagna
 
PDF
End-to-end Data Pipeline with Apache Spark
Databricks
 
PDF
Serverless machine learning operations
Stepan Pushkarev
 
PDF
Optimizing, Profiling, and Deploying High Performance Spark ML and TensorFlow AI
Data Con LA
 
PDF
PipelineAI + AWS SageMaker + Distributed TensorFlow + AI Model Training and S...
Chris Fregly
 
PPTX
Serverless machine learning architectures at Helixa
Data Science Milan
 
PDF
Nvidia GPU Tech Conference - Optimizing, Profiling, and Deploying TensorFlow...
Chris Fregly
 
PDF
PipelineAI + TensorFlow AI + Spark ML + Kuberenetes + Istio + AWS SageMaker +...
Chris Fregly
 
PPTX
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
ScyllaDB
 
PDF
Building Google's ML Engine from Scratch on AWS with GPUs, Kubernetes, Istio,...
Chris Fregly
 
PDF
Optimizing, Profiling, and Deploying TensorFlow AI Models with GPUs - San Fra...
Chris Fregly
 
PDF
C19013010 the tutorial to build shared ai services session 1
Bill Liu
 
PPTX
Open, Secure & Transparent AI Pipelines
Nick Pentreath
 
PDF
Machine Learning With H2O vs SparkML
Arnab Biswas
 
PDF
BKK16-408B Data Analytics and Machine Learning From Node to Cluster
Linaro
 
PDF
BKK16-404B Data Analytics and Machine Learning- from Node to Cluster
Linaro
 
PDF
Data Analytics and Machine Learning: From Node to Cluster on ARM64
Ganesh Raju
 
PDF
Dev Ops Training
Spark Summit
 
Accelerate Your ML Pipeline with AutoML and MLflow
Databricks
 
High Performance TensorFlow in Production - Big Data Spain - Madrid - Nov 15 ...
Chris Fregly
 
Tech leaders guide to effective building of machine learning products
Gianmario Spacagna
 
End-to-end Data Pipeline with Apache Spark
Databricks
 
Serverless machine learning operations
Stepan Pushkarev
 
Optimizing, Profiling, and Deploying High Performance Spark ML and TensorFlow AI
Data Con LA
 
PipelineAI + AWS SageMaker + Distributed TensorFlow + AI Model Training and S...
Chris Fregly
 
Serverless machine learning architectures at Helixa
Data Science Milan
 
Nvidia GPU Tech Conference - Optimizing, Profiling, and Deploying TensorFlow...
Chris Fregly
 
PipelineAI + TensorFlow AI + Spark ML + Kuberenetes + Istio + AWS SageMaker +...
Chris Fregly
 
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
ScyllaDB
 
Building Google's ML Engine from Scratch on AWS with GPUs, Kubernetes, Istio,...
Chris Fregly
 
Optimizing, Profiling, and Deploying TensorFlow AI Models with GPUs - San Fra...
Chris Fregly
 
C19013010 the tutorial to build shared ai services session 1
Bill Liu
 
Open, Secure & Transparent AI Pipelines
Nick Pentreath
 
Machine Learning With H2O vs SparkML
Arnab Biswas
 
BKK16-408B Data Analytics and Machine Learning From Node to Cluster
Linaro
 
BKK16-404B Data Analytics and Machine Learning- from Node to Cluster
Linaro
 
Data Analytics and Machine Learning: From Node to Cluster on ARM64
Ganesh Raju
 
Dev Ops Training
Spark Summit
 
Ad

More from Chris Fregly (20)

PDF
AWS reInvent 2022 reCap AI/ML and Data
Chris Fregly
 
PDF
Pandas on AWS - Let me count the ways.pdf
Chris Fregly
 
PDF
Ray AI Runtime (AIR) on AWS - Data Science On AWS Meetup
Chris Fregly
 
PDF
Smokey and the Multi-Armed Bandit Featuring BERT Reynolds Updated
Chris Fregly
 
PDF
Amazon reInvent 2020 Recap: AI and Machine Learning
Chris Fregly
 
PDF
Waking the Data Scientist at 2am: Detect Model Degradation on Production Mod...
Chris Fregly
 
PDF
Quantum Computing with Amazon Braket
Chris Fregly
 
PDF
15 Tips to Scale a Large AI/ML Workshop - Both Online and In-Person
Chris Fregly
 
PDF
AWS Re:Invent 2019 Re:Cap
Chris Fregly
 
PDF
KubeFlow + GPU + Keras/TensorFlow 2.0 + TF Extended (TFX) + Kubernetes + PyTo...
Chris Fregly
 
PDF
Swift for TensorFlow - Tanmay Bakshi - Advanced Spark and TensorFlow Meetup -...
Chris Fregly
 
PDF
Hands-on Learning with KubeFlow + Keras/TensorFlow 2.0 + TF Extended (TFX) + ...
Chris Fregly
 
PDF
Spark SQL Catalyst Optimizer, Custom Expressions, UDFs - Advanced Spark and T...
Chris Fregly
 
PDF
PipelineAI Continuous Machine Learning and AI - Rework Deep Learning Summit -...
Chris Fregly
 
PDF
PipelineAI Real-Time Machine Learning - Global Artificial Intelligence Confer...
Chris Fregly
 
PDF
Hyper-Parameter Tuning Across the Entire AI Pipeline GPU Tech Conference San ...
Chris Fregly
 
PDF
PipelineAI Optimizes Your Enterprise AI Pipeline from Distributed Training to...
Chris Fregly
 
PDF
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Chris Fregly
 
PDF
High Performance Distributed TensorFlow in Production with GPUs - NIPS 2017 -...
Chris Fregly
 
PDF
Building Google Cloud ML Engine From Scratch on AWS with PipelineAI - ODSC Lo...
Chris Fregly
 
AWS reInvent 2022 reCap AI/ML and Data
Chris Fregly
 
Pandas on AWS - Let me count the ways.pdf
Chris Fregly
 
Ray AI Runtime (AIR) on AWS - Data Science On AWS Meetup
Chris Fregly
 
Smokey and the Multi-Armed Bandit Featuring BERT Reynolds Updated
Chris Fregly
 
Amazon reInvent 2020 Recap: AI and Machine Learning
Chris Fregly
 
Waking the Data Scientist at 2am: Detect Model Degradation on Production Mod...
Chris Fregly
 
Quantum Computing with Amazon Braket
Chris Fregly
 
15 Tips to Scale a Large AI/ML Workshop - Both Online and In-Person
Chris Fregly
 
AWS Re:Invent 2019 Re:Cap
Chris Fregly
 
KubeFlow + GPU + Keras/TensorFlow 2.0 + TF Extended (TFX) + Kubernetes + PyTo...
Chris Fregly
 
Swift for TensorFlow - Tanmay Bakshi - Advanced Spark and TensorFlow Meetup -...
Chris Fregly
 
Hands-on Learning with KubeFlow + Keras/TensorFlow 2.0 + TF Extended (TFX) + ...
Chris Fregly
 
Spark SQL Catalyst Optimizer, Custom Expressions, UDFs - Advanced Spark and T...
Chris Fregly
 
PipelineAI Continuous Machine Learning and AI - Rework Deep Learning Summit -...
Chris Fregly
 
PipelineAI Real-Time Machine Learning - Global Artificial Intelligence Confer...
Chris Fregly
 
Hyper-Parameter Tuning Across the Entire AI Pipeline GPU Tech Conference San ...
Chris Fregly
 
PipelineAI Optimizes Your Enterprise AI Pipeline from Distributed Training to...
Chris Fregly
 
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Chris Fregly
 
High Performance Distributed TensorFlow in Production with GPUs - NIPS 2017 -...
Chris Fregly
 
Building Google Cloud ML Engine From Scratch on AWS with PipelineAI - ODSC Lo...
Chris Fregly
 

Recently uploaded (20)

PDF
[Solution] Why Choose the VeryPDF DRM Protector Custom-Built Solution for You...
Lingwen1998
 
PDF
Empower Your Tech Vision- Why Businesses Prefer to Hire Remote Developers fro...
logixshapers59
 
PPTX
AEM User Group: India Chapter Kickoff Meeting
jennaf3
 
PDF
IDM Crack with Internet Download Manager 6.42 Build 43 with Patch Latest 2025
bashirkhan333g
 
PDF
vMix Pro 28.0.0.42 Download vMix Registration key Bundle
kulindacore
 
PDF
iTop VPN With Crack Lifetime Activation Key-CODE
utfefguu
 
PDF
SAP Firmaya İade ABAB Kodları - ABAB ile yazılmıl hazır kod örneği
Salih Küçük
 
PPTX
ChiSquare Procedure in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
PDF
How to Hire AI Developers_ Step-by-Step Guide in 2025.pdf
DianApps Technologies
 
PPTX
Change Common Properties in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
PDF
Odoo CRM vs Zoho CRM: Honest Comparison 2025
Odiware Technologies Private Limited
 
PPTX
Comprehensive Risk Assessment Module for Smarter Risk Management
EHA Soft Solutions
 
PPTX
Homogeneity of Variance Test Options IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
PPTX
OpenChain @ OSS NA - In From the Cold: Open Source as Part of Mainstream Soft...
Shane Coughlan
 
PDF
MiniTool Partition Wizard Free Crack + Full Free Download 2025
bashirkhan333g
 
PPTX
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
PDF
Open Chain Q2 Steering Committee Meeting - 2025-06-25
Shane Coughlan
 
PPTX
Milwaukee Marketo User Group - Summer Road Trip: Mapping and Personalizing Yo...
bbedford2
 
PPTX
Help for Correlations in IBM SPSS Statistics.pptx
Version 1 Analytics
 
PDF
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 
[Solution] Why Choose the VeryPDF DRM Protector Custom-Built Solution for You...
Lingwen1998
 
Empower Your Tech Vision- Why Businesses Prefer to Hire Remote Developers fro...
logixshapers59
 
AEM User Group: India Chapter Kickoff Meeting
jennaf3
 
IDM Crack with Internet Download Manager 6.42 Build 43 with Patch Latest 2025
bashirkhan333g
 
vMix Pro 28.0.0.42 Download vMix Registration key Bundle
kulindacore
 
iTop VPN With Crack Lifetime Activation Key-CODE
utfefguu
 
SAP Firmaya İade ABAB Kodları - ABAB ile yazılmıl hazır kod örneği
Salih Küçük
 
ChiSquare Procedure in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
How to Hire AI Developers_ Step-by-Step Guide in 2025.pdf
DianApps Technologies
 
Change Common Properties in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
Odoo CRM vs Zoho CRM: Honest Comparison 2025
Odiware Technologies Private Limited
 
Comprehensive Risk Assessment Module for Smarter Risk Management
EHA Soft Solutions
 
Homogeneity of Variance Test Options IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
OpenChain @ OSS NA - In From the Cold: Open Source as Part of Mainstream Soft...
Shane Coughlan
 
MiniTool Partition Wizard Free Crack + Full Free Download 2025
bashirkhan333g
 
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
Open Chain Q2 Steering Committee Meeting - 2025-06-25
Shane Coughlan
 
Milwaukee Marketo User Group - Summer Road Trip: Mapping and Personalizing Yo...
bbedford2
 
Help for Correlations in IBM SPSS Statistics.pptx
Version 1 Analytics
 
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 

Advanced Spark and Tensorflow Meetup - London - Nov 15, 2016 - Deploy Spark ML and Tensorflow AI Models from Dev Notebooks to Prod Microservices