SlideShare a Scribd company logo
How “Stranger Things” can
happen with Visual
Analytics
Jason Flittner
Senior Analytics Engineer / Manager
Netflix - Content Data Engineering and Analytics
#NetflixData
● About Netflix
● Tableau + Big Data
○ Lessons Learned
○ Where we are today
● Analytics and Iterating Quickly
What is Netflix?
Netflix Big Data Paris 2017
Netflix Big Data Paris 2017
● 93+ million members
● 190 countries
● 1,000+ devices
● 10B hours/qtr
We plan on spending ~$6B in 2017 on
content for our members
Metrics
● ~60 PB DW on S3
● ~1400 Tableau users
● Live & extract connections
● Analytics on billions of rows
(Hadoop
clusters)
Storage Compute Data Interface Data Access, Analytics and Visualization
AWS
S3
● About Netflix
● Tableau + Big Data
○ Lessons Learned
○ Where we are today
● Analytics and Iterating Quickly
Choosing a source
● Hive
● Spark
● Presto
● Redshift
● Published Data Source
● etc...
● Powerful and scalable
backend
● “Slower” 1,000,000,000/hr
● Hive + Tableau
○ Thrift Servers
○ Custom SQL vs Tables
○ Metadata
○ ODBC Optimization
● Scalable
● Faster than Hive in many
cases
● Spark + Tableau
○ Thrift Servers
○ Long running job on
Cluster
○ Query reliability
● Fast query engine
● Great for experimenting and
“smaller” data sets
● Connecting to Tableau
○ Web data connector
○ ODBC
● About Netflix
● Tableau + Big Data
○ Lessons Learned
○ Where we are today
● Analytics and Iterating Quickly
Tableau Data
Extract Publish to Server
Tableau Extract
API
Create Tableau Data ExtractProvision Container ResourceIssues Command Create
Extract
Publish to Server
Distributed Tableau Extract API
● Very fast loads from S3
● Native Tableau connector
● Quick Tableau Iteration
● Live or Extract
● Concurrency
Amazon
Redshift
BIG Data ● Too big to extract?
● Optimized live connections
○ SQL
● Custom data viz with Druid
● Tableau + Hyper!?
● About Netflix
● Tableau + Big Data
○ Lessons Learned
○ Where we are today
● Analytics and Iterating Quickly
Business
users
Analytics
Engineer
Analytics:
● Binge Analysis
● Viewing Patterns
● Hours Viewed
● Customer Joy
● Content Quality
Bringing it all
together
● Content analytics
● Iterate quickly
● Move between backend sources
● Strong user adoption
Netflix Big Data Paris 2017
Netflix Big Data Paris 2017
Netflix Big Data Paris 2017
Merci
Thank you
Jason Flittner -

More Related Content

PDF
Cloud Connect 2012, Big Data @ Netflix
Jerome Boulon
 
PPTX
The evolution of the big data platform @ Netflix (OSCON 2015)
Eva Tse
 
PDF
A unified analytics platform with Kafka and Flink | Stephan Ewen, Ververica
HostedbyConfluent
 
PDF
The Netflix data platform: Now and in the future by Kurt Brown
Data Con LA
 
PDF
Real Time Data Infrastructure team overview
Monal Daxini
 
PDF
Introducing the Hub for Data Orchestration
Alluxio, Inc.
 
PDF
Streaming Data in the Cloud with Confluent and MongoDB Atlas | Robert Walters...
HostedbyConfluent
 
PDF
Event & Data Mesh as a Service: Industrializing Microservices in the Enterpri...
HostedbyConfluent
 
Cloud Connect 2012, Big Data @ Netflix
Jerome Boulon
 
The evolution of the big data platform @ Netflix (OSCON 2015)
Eva Tse
 
A unified analytics platform with Kafka and Flink | Stephan Ewen, Ververica
HostedbyConfluent
 
The Netflix data platform: Now and in the future by Kurt Brown
Data Con LA
 
Real Time Data Infrastructure team overview
Monal Daxini
 
Introducing the Hub for Data Orchestration
Alluxio, Inc.
 
Streaming Data in the Cloud with Confluent and MongoDB Atlas | Robert Walters...
HostedbyConfluent
 
Event & Data Mesh as a Service: Industrializing Microservices in the Enterpri...
HostedbyConfluent
 

What's hot (20)

PDF
AWS Re-Invent 2017 Netflix Keystone SPaaS - Monal Daxini - Abd320 2017
Monal Daxini
 
PPTX
Watching Pigs Fly with the Netflix Hadoop Toolkit (Hadoop Summit 2013)
Jeff Magnusson
 
PPTX
Streaming data in the cloud with Confluent and MongoDB Atlas | Robert Waters,...
HostedbyConfluent
 
PPTX
Netflix incloudsmarch8 2011forwiki
Kevin McEntee
 
PDF
R, Spark, Tensorflow, H20.ai Applied to Streaming Analytics
Kai Wähner
 
PDF
Should You Read Kafka as a Stream or in Batch? Should You Even Care? | Ido Na...
HostedbyConfluent
 
PDF
DataOps Automation for a Kafka Streaming Platform (Andrew Stevenson + Spiros ...
HostedbyConfluent
 
PPTX
Getting It Right Exactly Once: Principles for Streaming Architectures
SingleStore
 
PDF
Análisis del roadmap del Elastic Stack
Elasticsearch
 
PPTX
Data Warehousing Patterns for Hadoop
Michelle Ufford
 
PDF
Keynote: Jay Kreps, Confluent | Kafka ♥ Cloud | Kafka Summit 2020
confluent
 
PDF
Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...
HostedbyConfluent
 
PDF
Spark at Airbnb
Hao Wang
 
PDF
Using Hazelcast in the Kappa architecture
Oliver Buckley-Salmon
 
PDF
Cornami Accelerates Performance on SPARK: Spark Summit East talk by Paul Master
Spark Summit
 
PDF
Visualizing AutoTrader Traffic in Near Real-Time with Spark Streaming-(Jon Gr...
Spark Summit
 
PDF
Big problems Big Data, simple solutions
Claudio Pontili
 
PDF
Presto: Fast SQL-on-Anything (including Delta Lake, Snowflake, Elasticsearch ...
Databricks
 
PPTX
SQL Server on Google Cloud Platform
Lynn Langit
 
PDF
Winning the On-Demand Economy with Spark and Predictive Analytics
SingleStore
 
AWS Re-Invent 2017 Netflix Keystone SPaaS - Monal Daxini - Abd320 2017
Monal Daxini
 
Watching Pigs Fly with the Netflix Hadoop Toolkit (Hadoop Summit 2013)
Jeff Magnusson
 
Streaming data in the cloud with Confluent and MongoDB Atlas | Robert Waters,...
HostedbyConfluent
 
Netflix incloudsmarch8 2011forwiki
Kevin McEntee
 
R, Spark, Tensorflow, H20.ai Applied to Streaming Analytics
Kai Wähner
 
Should You Read Kafka as a Stream or in Batch? Should You Even Care? | Ido Na...
HostedbyConfluent
 
DataOps Automation for a Kafka Streaming Platform (Andrew Stevenson + Spiros ...
HostedbyConfluent
 
Getting It Right Exactly Once: Principles for Streaming Architectures
SingleStore
 
Análisis del roadmap del Elastic Stack
Elasticsearch
 
Data Warehousing Patterns for Hadoop
Michelle Ufford
 
Keynote: Jay Kreps, Confluent | Kafka ♥ Cloud | Kafka Summit 2020
confluent
 
Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...
HostedbyConfluent
 
Spark at Airbnb
Hao Wang
 
Using Hazelcast in the Kappa architecture
Oliver Buckley-Salmon
 
Cornami Accelerates Performance on SPARK: Spark Summit East talk by Paul Master
Spark Summit
 
Visualizing AutoTrader Traffic in Near Real-Time with Spark Streaming-(Jon Gr...
Spark Summit
 
Big problems Big Data, simple solutions
Claudio Pontili
 
Presto: Fast SQL-on-Anything (including Delta Lake, Snowflake, Elasticsearch ...
Databricks
 
SQL Server on Google Cloud Platform
Lynn Langit
 
Winning the On-Demand Economy with Spark and Predictive Analytics
SingleStore
 
Ad

Viewers also liked (20)

PDF
Design in Tech Report 2017
John Maeda
 
PDF
IBM Storage for Analytics, Cognitive and Cloud
Tony Pearson
 
PPTX
The Big Data TV: Data Analytics, Algorithm, and Netflix’s Original Programming
hye-jin-lee
 
PPTX
Netflix-Using analytics to predict hits
Gaurav Dutta
 
PPTX
Culture
Reed Hastings
 
PDF
Netflix - Enabling a Culture of Analytics
Blake Irvine
 
PDF
How to Become a Thought Leader in Your Niche
Leslie Samuel
 
PPT
Netflix Case Study
Kikuyu Daniels
 
DOC
Case Study Netflix
Christina Cecil
 
PDF
What is A Cloud Stack in 2017
Gaurav Roy
 
DOCX
Netflix Case Study
Julien Guitton
 
PDF
Cross-regional Application Deplolyment on AWS - Channy Yun (JAWS Days 2017)
Amazon Web Services Korea
 
PPTX
Big Data Analytics with Hadoop
Philippe Julio
 
PDF
Netflix marketing plan
Evelyne Otto
 
PDF
Europa AI startup scaleups report 2016
Ian Beckett
 
PDF
3 Things Every Sales Team Needs to Be Thinking About in 2017
Drift
 
PDF
Elastic Data Analytics Platform @Datadog
C4Media
 
PPTX
Netflix company presentation
klibanow
 
KEY
The Secrets of Building Realtime Big Data Systems
nathanmarz
 
PPTX
ITI-Presentation-netflix
Angela Chen
 
Design in Tech Report 2017
John Maeda
 
IBM Storage for Analytics, Cognitive and Cloud
Tony Pearson
 
The Big Data TV: Data Analytics, Algorithm, and Netflix’s Original Programming
hye-jin-lee
 
Netflix-Using analytics to predict hits
Gaurav Dutta
 
Culture
Reed Hastings
 
Netflix - Enabling a Culture of Analytics
Blake Irvine
 
How to Become a Thought Leader in Your Niche
Leslie Samuel
 
Netflix Case Study
Kikuyu Daniels
 
Case Study Netflix
Christina Cecil
 
What is A Cloud Stack in 2017
Gaurav Roy
 
Netflix Case Study
Julien Guitton
 
Cross-regional Application Deplolyment on AWS - Channy Yun (JAWS Days 2017)
Amazon Web Services Korea
 
Big Data Analytics with Hadoop
Philippe Julio
 
Netflix marketing plan
Evelyne Otto
 
Europa AI startup scaleups report 2016
Ian Beckett
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
Drift
 
Elastic Data Analytics Platform @Datadog
C4Media
 
Netflix company presentation
klibanow
 
The Secrets of Building Realtime Big Data Systems
nathanmarz
 
ITI-Presentation-netflix
Angela Chen
 
Ad

Similar to Netflix Big Data Paris 2017 (20)

PDF
DATA @ NFLX (Tableau Conference 2014 Presentation)
Blake Irvine
 
PPTX
2013 DATA @ NFLX (Tableau User Group)
Albert Wong
 
PPTX
Creating a Culture of Data @ Facebook - TCCEU13
Andy Kriebel
 
PDF
Building a modern data platform in the cloud. AWS DevDay Nordics
javier ramirez
 
PPTX
Bigdatacooltools
suresh sood
 
PDF
The Evolving Landscape of Data Engineering
Andrei Savu
 
PPTX
Big Data Analytics: Finding diamonds in the rough with Azure
Christos Charmatzis
 
PDF
INF2190_W1_2016_public
Attila Barta
 
PDF
How Celtra Optimizes its Advertising Platform with Databricks
Grega Kespret
 
PDF
Building Data Lakes and Analytics on AWS. IPExpo Manchester.
javier ramirez
 
PPTX
Make your data fly - Building data platform in AWS
Kimmo Kantojärvi
 
PPTX
From raw data to business insights. A modern data lake
javier ramirez
 
PDF
PXL Data Engineering Workshop By Selligent
Jonny Daenen
 
PPTX
Big Data, Big Investment
GGV Capital
 
PPTX
Netflix Data Engineering @ Uber Engineering Meetup
Blake Irvine
 
PDF
Things you need to know about big data
Lantern Institute
 
PPTX
Data-Driven @ Netflix
Michelle Ufford
 
PPTX
2016 Tableau in the Cloud - A Netflix Original (AWS Re:invent)
Albert Wong
 
PDF
EDF2013: Big Data Tutorial: Marko Grobelnik
European Data Forum
 
PPTX
Big data analytics and machine intelligence v5.0
Amr Kamel Deklel
 
DATA @ NFLX (Tableau Conference 2014 Presentation)
Blake Irvine
 
2013 DATA @ NFLX (Tableau User Group)
Albert Wong
 
Creating a Culture of Data @ Facebook - TCCEU13
Andy Kriebel
 
Building a modern data platform in the cloud. AWS DevDay Nordics
javier ramirez
 
Bigdatacooltools
suresh sood
 
The Evolving Landscape of Data Engineering
Andrei Savu
 
Big Data Analytics: Finding diamonds in the rough with Azure
Christos Charmatzis
 
INF2190_W1_2016_public
Attila Barta
 
How Celtra Optimizes its Advertising Platform with Databricks
Grega Kespret
 
Building Data Lakes and Analytics on AWS. IPExpo Manchester.
javier ramirez
 
Make your data fly - Building data platform in AWS
Kimmo Kantojärvi
 
From raw data to business insights. A modern data lake
javier ramirez
 
PXL Data Engineering Workshop By Selligent
Jonny Daenen
 
Big Data, Big Investment
GGV Capital
 
Netflix Data Engineering @ Uber Engineering Meetup
Blake Irvine
 
Things you need to know about big data
Lantern Institute
 
Data-Driven @ Netflix
Michelle Ufford
 
2016 Tableau in the Cloud - A Netflix Original (AWS Re:invent)
Albert Wong
 
EDF2013: Big Data Tutorial: Marko Grobelnik
European Data Forum
 
Big data analytics and machine intelligence v5.0
Amr Kamel Deklel
 

Recently uploaded (20)

PDF
Software Development Methodologies in 2025
KodekX
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
 
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Software Development Methodologies in 2025
KodekX
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
REPORT: Heating appliances market in Poland 2024
SPIUG
 
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 

Netflix Big Data Paris 2017