SlideShare a Scribd company logo
Heterogeneous Workflows with
Spark at Netflix
0
Antony Arokiasamy | Kedar Sadekar | Personalization Infrastructure
1
Help members find content to watch and enjoy to
maximize member satisfaction and retention
Everything is a Recommendation
2
Recommendations
are driven by
Machine Learning
Ranking
Rows
Machine Learning Pipeline
3
User Selection
Feature
Generation
Model
Validation
Publish
Model
Model
Training
Machine Learning Pipeline Challenges
4
• Innovation
• Heterogeneous Environments
• Spark
• Native Support
• Separate Orchestration and Execution
• Multi Tenancy
• Machine Learning Constructs
• Parameter Sweep – 30k Dockers
Meson Workflow System
5
• General Purpose Workflow Orchestration and Scheduling framework
• Delegates execution to resource managers like Mesos
• Optimized for Machine Learning Pipelines and Visualization
• Checkout the Blog
• http://bit.ly/mesonws or techblog.netflix.com
• Plan to Open Sourced soon
Meson Architecture
6
Standard and Custom Step Types
7
Parameter Passing
8
Hive Query User DataSet Regional DataSet
Global DataSet
Get Users
Regional Model
Global Model
User DataSet
Wrangle Data
Structured Constructs
9
Top Down or Bottom Up
10
Two Way Communication
11
Spark Step
12
Artifacts
13
• Step outputs tracked as Artifacts
• Visualization
• Memoization
Multi Tenancy
14
• Resource Attributes • spark.cores.max
• spark.executor.memory
• spark.mesos.constraints
• Dynamic Resource Allocation
Cluster Management
15
• Red-Black software updates
• Scale up/Scale down
Meson/Spark Cluster
16
• 100s of Concurrent Jobs
• 700 Nodes
• 5000 Cores
• 25 TB Memory
• Apps: Meson Workflow System, Spark and Dockers
• Few smaller clusters
17
Antony Arokiasamy Kedar Sadekar
@aasamy
/aasamy
aarokiasamy@netflix.com
@kedar_sadekar
/kedar-sadekar
ksadekar@netflix.com

More Related Content

PPTX
Meson: Building a Machine Learning Orchestration Framework on Mesos
Antony Arokiasamy
 
PPTX
Microservices in Azure
Doug Vanderweide
 
PPTX
MVC 6 - the new unified Web programming model
Alex Thissen
 
PDF
Infrastructure as Code for Azure: ARM or Terraform?
Katherine Golovinova
 
PPTX
JavaEE Microservices platforms
Payara
 
PDF
"In love with Open Source : Past, Present and Future" : Keynote OSDConf 2014
Piyush Kumar
 
PDF
Securing Containers From Day One | null Ahmedabad Meetup
Kumar Ashwin
 
PPTX
Cloud Compiler
Being Topper
 
Meson: Building a Machine Learning Orchestration Framework on Mesos
Antony Arokiasamy
 
Microservices in Azure
Doug Vanderweide
 
MVC 6 - the new unified Web programming model
Alex Thissen
 
Infrastructure as Code for Azure: ARM or Terraform?
Katherine Golovinova
 
JavaEE Microservices platforms
Payara
 
"In love with Open Source : Past, Present and Future" : Keynote OSDConf 2014
Piyush Kumar
 
Securing Containers From Day One | null Ahmedabad Meetup
Kumar Ashwin
 
Cloud Compiler
Being Topper
 

What's hot (20)

PDF
Monitor Micro-service with MicroProfile metrics
Rudy De Busscher
 
PDF
Azure Days 2019: Infrastructure as Code auf Azure (Jonas Wanninger & Daniel H...
Trivadis
 
PPTX
Exploring Microservices in a Microsoft Landscape
Alex Thissen
 
PDF
Cloud infrastructure on Apache Mesos
Ahmed Bacha
 
PDF
Cnam azure ze cloud resource manager
Aymeric Weinbach
 
PPTX
ServerTemplates - The Next Generation
RightScale
 
PDF
Breaking the Monolith: Organizing Your Team to Embrace Microservices
Paul Osman
 
PPTX
Azure PaaS (WebApp & SQL Database) workshop solution
Gelis Wu
 
PDF
Jelastic Enterprise
Julio Pari
 
PDF
Scale your docker containers with Mesos
Timothy Chen
 
PDF
WSO2Con USA 2017: Scalable Real-time Complex Event Processing at Uber
WSO2
 
PDF
Google app engine
esmaillhasanzadeh1
 
PPTX
Cloud patterns
Nicolas De Loof
 
PPT
Scalability using Node.js
ratankadam
 
PDF
Must Know Azure Kubernetes Best Practices And Features For Better Resiliency ...
CodeOps Technologies LLP
 
PPTX
Cnam azure 2015 storage
Aymeric Weinbach
 
PPTX
Azure sql introduction
ManishK55
 
PPT
Dashboard project.
ratankadam
 
PDF
Online Cloud Based Compilers System
uday chauhan
 
PPTX
AKS
girish goudar
 
Monitor Micro-service with MicroProfile metrics
Rudy De Busscher
 
Azure Days 2019: Infrastructure as Code auf Azure (Jonas Wanninger & Daniel H...
Trivadis
 
Exploring Microservices in a Microsoft Landscape
Alex Thissen
 
Cloud infrastructure on Apache Mesos
Ahmed Bacha
 
Cnam azure ze cloud resource manager
Aymeric Weinbach
 
ServerTemplates - The Next Generation
RightScale
 
Breaking the Monolith: Organizing Your Team to Embrace Microservices
Paul Osman
 
Azure PaaS (WebApp & SQL Database) workshop solution
Gelis Wu
 
Jelastic Enterprise
Julio Pari
 
Scale your docker containers with Mesos
Timothy Chen
 
WSO2Con USA 2017: Scalable Real-time Complex Event Processing at Uber
WSO2
 
Google app engine
esmaillhasanzadeh1
 
Cloud patterns
Nicolas De Loof
 
Scalability using Node.js
ratankadam
 
Must Know Azure Kubernetes Best Practices And Features For Better Resiliency ...
CodeOps Technologies LLP
 
Cnam azure 2015 storage
Aymeric Weinbach
 
Azure sql introduction
ManishK55
 
Dashboard project.
ratankadam
 
Online Cloud Based Compilers System
uday chauhan
 
Ad

Viewers also liked (20)

PPTX
Big Data Ingestion @ Flipkart Data Platform
Navneet Gupta
 
PPTX
Mesos sys adminday
Javier Cortejoso de Andrés
 
PDF
Open source data ingestion
Treasure Data, Inc.
 
PDF
Creating a Mesos python framework
Olivier Sallou
 
PPTX
Gobblin: Unifying Data Ingestion for Hadoop
Yinan Li
 
PPTX
High Speed Continuous & Reliable Data Ingest into Hadoop
DataWorks Summit
 
PPTX
Mesos framework API v1
Mesosphere Inc.
 
PDF
Heterogeneous Workflows With Spark At Netflix
Jen Aman
 
PPTX
DC/OS: The definitive platform for modern apps
Datio Big Data
 
PPTX
Data Ingestion, Extraction & Parsing on Hadoop
skaluska
 
PDF
Deploying Docker Containers at Scale with Mesos and Marathon
Discover Pinterest
 
PPTX
Spark Summit East 2017: Apache spark and object stores
Steve Loughran
 
PDF
TDC2016POA | Trilha Infraestrutura - Apache Mesos & Marathon: gerenciando rem...
tdc-globalcode
 
PDF
Use of Spark MLib for Predicting the Offlining of Digital Media-(Christopher ...
Spark Summit
 
PDF
Container Orchestration Wars (Micro Edition)
Karl Isenberg
 
PDF
CI/CD with Docker, DC/OS, and Jenkins
Karl Isenberg
 
PDF
Mesos introduction
haosdent huang
 
PDF
MesosCon EU - HTTP API Framework
Marco Massenzio
 
PDF
Piloter un loadbalancer pour exposer les microservoces de mon cluster Mesos/M...
Kodo Kojo
 
PDF
Machine Learning Pipelines
jeykottalam
 
Big Data Ingestion @ Flipkart Data Platform
Navneet Gupta
 
Mesos sys adminday
Javier Cortejoso de Andrés
 
Open source data ingestion
Treasure Data, Inc.
 
Creating a Mesos python framework
Olivier Sallou
 
Gobblin: Unifying Data Ingestion for Hadoop
Yinan Li
 
High Speed Continuous & Reliable Data Ingest into Hadoop
DataWorks Summit
 
Mesos framework API v1
Mesosphere Inc.
 
Heterogeneous Workflows With Spark At Netflix
Jen Aman
 
DC/OS: The definitive platform for modern apps
Datio Big Data
 
Data Ingestion, Extraction & Parsing on Hadoop
skaluska
 
Deploying Docker Containers at Scale with Mesos and Marathon
Discover Pinterest
 
Spark Summit East 2017: Apache spark and object stores
Steve Loughran
 
TDC2016POA | Trilha Infraestrutura - Apache Mesos & Marathon: gerenciando rem...
tdc-globalcode
 
Use of Spark MLib for Predicting the Offlining of Digital Media-(Christopher ...
Spark Summit
 
Container Orchestration Wars (Micro Edition)
Karl Isenberg
 
CI/CD with Docker, DC/OS, and Jenkins
Karl Isenberg
 
Mesos introduction
haosdent huang
 
MesosCon EU - HTTP API Framework
Marco Massenzio
 
Piloter un loadbalancer pour exposer les microservoces de mon cluster Mesos/M...
Kodo Kojo
 
Machine Learning Pipelines
jeykottalam
 
Ad

Similar to Meson: Heterogeneous Workflows with Spark at Netflix (20)

PDF
Apereo OAE - Bootcamp
Nicolaas Matthijs
 
PPTX
Geek Sync | Deployment and Management of Complex Azure Environments
IDERA Software
 
PDF
Software design with Domain-driven design
Allan Mangune
 
PPTX
Serverless spark
MamathaBusi
 
PPTX
Cross Site Collection Navigation using SPFx, Powershell PnP & PnP-JS
Thomas Daly
 
PPTX
ASP.NET MVC - Latest & Greatest So Far
Lohith Goudagere Nagaraj
 
PPTX
Electricity_Monitoring_Presentation.pptx
HarshSingh268350
 
PPTX
Play Architecture, Implementation, Shiny Objects, and a Proposal
Mike Slinn
 
PPTX
Cross Site Collection Navigation
Thomas Daly
 
PPTX
Portal and Intranets
Redar Ismail
 
PPTX
Cross Site Collection Navigation with SPFX, PowerShell PnP, PnP-JS, Office UI
Thomas Daly
 
PDF
Apereo OAE - Architectural overview
Nicolaas Matthijs
 
PPTX
Getting started with SparkSQL - Desert Code Camp 2016
clairvoyantllc
 
PPTX
Webpage mining services
najahfasihah
 
PPTX
Sitecore9 key features by jitendra soni - Presented in Sitecore User Group UK
Jitendra Soni
 
PDF
Application of Library Management Software: NewGenLib
David Nzoputa Ofili
 
PPTX
Microservices in the Enterprise
Jesus Rodriguez
 
PPTX
EBSCO Digital Transformation with AWS
Kenzan
 
PPTX
Exposing services with Azure API Management
Callon Campbell
 
PDF
F8 tech talk_pinterest_v4
malorie_pinterest
 
Apereo OAE - Bootcamp
Nicolaas Matthijs
 
Geek Sync | Deployment and Management of Complex Azure Environments
IDERA Software
 
Software design with Domain-driven design
Allan Mangune
 
Serverless spark
MamathaBusi
 
Cross Site Collection Navigation using SPFx, Powershell PnP & PnP-JS
Thomas Daly
 
ASP.NET MVC - Latest & Greatest So Far
Lohith Goudagere Nagaraj
 
Electricity_Monitoring_Presentation.pptx
HarshSingh268350
 
Play Architecture, Implementation, Shiny Objects, and a Proposal
Mike Slinn
 
Cross Site Collection Navigation
Thomas Daly
 
Portal and Intranets
Redar Ismail
 
Cross Site Collection Navigation with SPFX, PowerShell PnP, PnP-JS, Office UI
Thomas Daly
 
Apereo OAE - Architectural overview
Nicolaas Matthijs
 
Getting started with SparkSQL - Desert Code Camp 2016
clairvoyantllc
 
Webpage mining services
najahfasihah
 
Sitecore9 key features by jitendra soni - Presented in Sitecore User Group UK
Jitendra Soni
 
Application of Library Management Software: NewGenLib
David Nzoputa Ofili
 
Microservices in the Enterprise
Jesus Rodriguez
 
EBSCO Digital Transformation with AWS
Kenzan
 
Exposing services with Azure API Management
Callon Campbell
 
F8 tech talk_pinterest_v4
malorie_pinterest
 

Recently uploaded (20)

PPTX
Introduction to Data Analytics and Data Science
KavithaCIT
 
PPTX
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
PPTX
Presentation on animal welfare a good topic
kidscream385
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
PPTX
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PDF
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
PPT
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
PPT
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
PDF
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
PPTX
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
PDF
blockchain123456789012345678901234567890
tanvikhunt1003
 
PPTX
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
PDF
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
PPTX
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
PPTX
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
Introduction to Data Analytics and Data Science
KavithaCIT
 
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
Presentation on animal welfare a good topic
kidscream385
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
short term internship project on Data visualization
JMJCollegeComputerde
 
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
blockchain123456789012345678901234567890
tanvikhunt1003
 
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 

Meson: Heterogeneous Workflows with Spark at Netflix

Editor's Notes

  • #7: We build Meson…