SlideShare a Scribd company logo
Deep Learning
@
Scribd
kp
Data Science Salon: Deep Learning as a Product @ Scribd
Who is the talk for?
People interested in our experience using deep learning
Product side of deep learning
Not a technical talk on how to best utilize LSTM
cells/activation functions/networks architectures/GANS/etc
Motivation
Spelling correction to improve the user experience
Deep Learning as Product
Does the user even care?
What can Deep Learning even do?
Image Editing
Object Detection
Boundary Detection
Music Generation
Voice Generation
Translate Languages
This will be fun!
Style Transfer
- Product
- Visual
- Imperfect
- Visualize the errors
Classic Van Gogh Style Transfer Example
Source: http://ideate.xsead.cmu.edu/gallery/projects/art-transitions-style-transfers-by-neural-networks
https://www.theverge.com/2017/3/30/15124466/ai-photo-style-transfer-deep-neural-nets-adobe
Style Transfer
STYLE TRANSFER FTW - PRISMA
Base Foreground Transfer Heisenberg
Can it solve our problem?
If they can translate styles onto images...
Motivation
Spelling correction to improve the user experience
Spelling correction
Spelling correction is solved right? If you have enough data
Google is ok, we are not
They can take a probabilistic approach, good for them :P
We need to get fancier
Let’s get started
1. Read blog posts
2. Read books
3. Read tutorials
4. Watch videos
5. Deep Learn
Frameworks on frameworks
Data Science Salon: Deep Learning as a Product @ Scribd
Which one to pick?
We did a comprehensive framework bakeoff…
AND THE WINNER IS ?!?!
J/K
We picked Keras
- Abstraction layer (easier to get started)
- Familiarity with Python
Life as a Deep Learner
http://everythingbrilliant.co.uk/wp-content/uploads/2017/01/emotional-journey-of-creating-anything-new.jpeg
Architectures
RNN
CNN
Deep/Wide
One more thing...
GANS!!!
GANS eh?
BAD DOG GOOD DOG
Which architecture to use?
1. Read blog posts
2. Read books
3. Read tutorials
4. Watch Videos
5. Deep Learn
TL;DR
Certain problems require certain architectures
For example:
CNN -> Image parsing/Object Detection
Where are we?
What do we have? Keras (framework)
What do we need? Architecture/algorithm
Constraints? Avoid implementing our own from scratch
Can we do even do this?
Let’s code some boilerplate for a basic neural network
Data Science Salon: Deep Learning as a Product @ Scribd
Training
How long does model development take
TBD -> 1-2 weeks
How long to let the model train
7 days
Is more gpu moar better? Not yet...
So many parameters...
1. Read blog posts
2. Read books
3. Read tutorials
4. Watch videos
5. Deep Learn
Details (Hyperparameters)
Activation Units
Sigmoid, relu, prelu (yes, not even joking), lolu (actually joking)
Optimization Functions
Adam, sgd, etc, etc
Dropout Rate
Batch Size
Continues on forever...
Activation
Sigmoid RELU Leaky
RELU/PRELU
Let’s find our algorithm
1. Read blog posts
2. Read books
3. Read tutorials
4. Watch Videos
5. Deep Learn
Sequence to Sequence mapping
TL;DR
Given a sequence of things turn them into another sequence
Spelling Correction
‘Cande’ -> ‘candy’
English -> Spanish language translation
‘Hello’ -> ‘hola’
Nice viz of LSTM cells
Where are we?
1. We have a framework
2. We have an algorithm
3. Let’s find an example
Deep Spelling Example
Blog post on Medium
Github code
90%+ Accuracy!
ALL PROBLEMS SOLVED !!!
WRONG :(
Data Science Salon: Deep Learning as a Product @ Scribd
Data Science Salon: Deep Learning as a Product @ Scribd
Data science blog posts
Not reproducible
Proprietary data sets that are not shared
Incorrect accuracy metrics
Someone please solve this (Academic + Industry partnership
on peer reviewed + reproducible data science algorithms)
PIT OF DESPAIR
http://everythingbrilliant.co.uk/wp-content/uploads/2017/01/emotional-journey-of-creating-anything-new.jpeg
Let’s find a new approach
1. Read blog posts
2. Read books
3. Read tutorials
4. Watch videos
5. Deep Learn
What about seq2seq?
New approach
- Existing libraries/frameworks
- OpenNMT
- Open Source contribution
SUCCESS!!!
What’s edit distance?
Number of letters (operations) to replace to transform a given word into another
kitten -> sitting = edit distance of 3
1. k -> s (substitution)
2. e -> i (substitution)
3. _ -> g (insertion)
SHOW ME THE DISTANCE
https://gist.github.com/mrelich/36b5f37233026e828af6d63f6015554b#file-deep_spell_example-py
Production
Developed on AWS, but need to deploy on internally
We can’t use OpenNMT
Ops :(, they have their reasons but FML
What’s next?
Tensorflow to the rescue!!!
Seq2Seq in Tensorflow
Tensorflow is pain
Tensorflow is a collection of a code not a “library/framework”
Let’s figure out Tensorflow
1. Read blog posts
2. Read books
3. Read tutorials
4. Watch Videos
5. Deep Learn
Outcome
Added a week of development time...BUT
Simpler and Faster algorithm
Basis for production deployment
Data Science Salon: Deep Learning as a Product @ Scribd
Data Science Salon: Deep Learning as a Product @ Scribd
SO CLOSE!
We can properly correct spelling but…
We also have to guess partial words correctly
Query: ‘ub of the centry’
Corrected to: ‘club of the century’ ← Where we are
Correct query: ‘pub of the century’ ← Where we need to go
Dictionary matching FTW
Build a dictionary where the keys are phrases with 1 word dropped.
{ [joy luck]: “joy luck club”, [luck club]: “joy luck club”}
Does the user even care?
YES
Seq2Seq + Dictionary = 90%
How much time does it add?
● Seq2Seq model adds 15ms
● Dictionary lookup adds .2ms
Why do we hard things
Hard Problem (Spelling) -> Knowledge accumulation
New problem -> Query parsing
[‘john grisham pelican brief’] -> [author,author,title,title]
More Projects
Query Parsing (Authors/Titles/Topics/Series/etc)
Content Summarization (Books/Articles/Documents)
Churn Prediction
Document Classification (Study Guides/Court cases/etc)
Data Blog (coming in 2018)
Posts on:
Spelling Correction
Query Tagging
Multi-Armed Bandits
A/B Test System Infrastructure
Work on hard data problems?
Work at Scribd.
Questions
Or we can all leave early!!!
Appendix
https://imgflip.com/i/1woc8x https://imgflip.com/i/1wpxvp https://imgflip.com/i/1wpy1z
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
http://www.dogster.com/wp-content/uploads/2015/05/scolded-dog.jpg
http://bravodog.ca/wp-content/uploads/2016/11/Reward-training-Calling-your-dog-to-come-4.jpeg
https://www.skylinelabs.in/blog/images/tensorflow.jpg
https://deeplearning4j.org/assets/themes/thedocs/img/DL4J-LOGO-2.png
https://raw.githubusercontent.com/dmlc/dmlc.github.io/master/img/logo-m/mxnet2.png
https://valohai.com/static/img/support-logos/theano.svg
http://cs231n.github.io/neural-networks-1/

More Related Content

PPT
Generating Better Design Ideas Through Collaboration
Étienne Garbugli
 
PDF
Machine learning in a nutshell for dummies
Ning Zhou
 
PDF
Infrastructure as code might be literally impossible / Joe Domato (packageclo...
Ontico
 
ODP
2009 Eclipse Con
guest29922
 
ODP
Roundup presentation
mattgrommes
 
PDF
Python overview
Haroon Karim
 
PDF
Scottish Ruby Conference 2014
michaelag1971
 
PPTX
Tensorflow go
Patrick Walker
 
Generating Better Design Ideas Through Collaboration
Étienne Garbugli
 
Machine learning in a nutshell for dummies
Ning Zhou
 
Infrastructure as code might be literally impossible / Joe Domato (packageclo...
Ontico
 
2009 Eclipse Con
guest29922
 
Roundup presentation
mattgrommes
 
Python overview
Haroon Karim
 
Scottish Ruby Conference 2014
michaelag1971
 
Tensorflow go
Patrick Walker
 

Similar to Data Science Salon: Deep Learning as a Product @ Scribd (20)

KEY
Gearman and CodeIgniter
Erik Giberti
 
KEY
Work Queues
ciconf
 
PPT
What does OOP stand for?
Colin Riley
 
PPT
You shouldneverdo
daniil3
 
PDF
Infrastructure as code might be literally impossible
ice799
 
PDF
Bringing Change to Life | YOW 2016 | Melbourne, Brisbane, Sydney - Australia
Bill Scott
 
PPTX
Testing for the deeplearning folks
Vishwas N
 
PPTX
Pair Programming - Be the best pair you can be.
David Morgantini
 
PDF
TDD Walkthrough - Encryption
PeterKha2
 
PPTX
Prototype4Production Presented at FOSSASIA2015 at Singapore
Dhruv Gohil
 
PPTX
Recurrent Neural Networks for Text Analysis
odsc
 
PPTX
Introduction to deep learning
doppenhe
 
PDF
Infrastructure as code might be literally impossible part 2
ice799
 
PPTX
Jonathan Coveney: Why Pig?
mortardata
 
ODP
Is Python still production ready ? Ludovic Gasc
Pôle Systematic Paris-Region
 
PDF
The State of ML for iOS: On the Advent of WWDC 2018 🕯
Meghan Kane
 
PPTX
Introduction to deep learning
Zeynep Su Kurultay
 
PDF
DWX 2013 Nuremberg
Marcel Bruch
 
PDF
Atlassian: The latest and greatest - May/June 2013
Sven Peters
 
PPTX
3 blades webinar dec 12 deploying deep learning models to production
Greg Werner
 
Gearman and CodeIgniter
Erik Giberti
 
Work Queues
ciconf
 
What does OOP stand for?
Colin Riley
 
You shouldneverdo
daniil3
 
Infrastructure as code might be literally impossible
ice799
 
Bringing Change to Life | YOW 2016 | Melbourne, Brisbane, Sydney - Australia
Bill Scott
 
Testing for the deeplearning folks
Vishwas N
 
Pair Programming - Be the best pair you can be.
David Morgantini
 
TDD Walkthrough - Encryption
PeterKha2
 
Prototype4Production Presented at FOSSASIA2015 at Singapore
Dhruv Gohil
 
Recurrent Neural Networks for Text Analysis
odsc
 
Introduction to deep learning
doppenhe
 
Infrastructure as code might be literally impossible part 2
ice799
 
Jonathan Coveney: Why Pig?
mortardata
 
Is Python still production ready ? Ludovic Gasc
Pôle Systematic Paris-Region
 
The State of ML for iOS: On the Advent of WWDC 2018 🕯
Meghan Kane
 
Introduction to deep learning
Zeynep Su Kurultay
 
DWX 2013 Nuremberg
Marcel Bruch
 
Atlassian: The latest and greatest - May/June 2013
Sven Peters
 
3 blades webinar dec 12 deploying deep learning models to production
Greg Werner
 
Ad

More from Formulatedby (20)

PDF
Data Science Salon: An Experiment on Data Science Algorithms Enabled by a Pil...
Formulatedby
 
PDF
Data Science Salon: Are you sure you're an ethical technologist?: Build your ...
Formulatedby
 
PDF
Data Science Salon: In your own words: computing customer similarity from tex...
Formulatedby
 
PPTX
Data Science Salon: nterpretable Predictive Models in the Healthcare Domain
Formulatedby
 
PDF
Data Science Salon: Applications of Embeddings and Deep Learning at Groupon
Formulatedby
 
PDF
Data Science Salon: Kaggle 1st Place in 30 minutes: Putting AutoML to Work wi...
Formulatedby
 
PPTX
Data Science Salon: Smart Cities
Formulatedby
 
PDF
Data Science Salon: Building a Data Driven Product Mindset
Formulatedby
 
PPTX
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
Formulatedby
 
PDF
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
Formulatedby
 
PDF
Data Science Salon: Data visualization and Analysis in the Florida Panthers H...
Formulatedby
 
PDF
Data Science Salon: Machine Learning for Personalized Cancer Vaccines
Formulatedby
 
PDF
Data Science Salon: Building a Data Science Culture
Formulatedby
 
PPTX
Data Science Salon: Digital Transformation: The Data Science Catalyst
Formulatedby
 
PDF
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Formulatedby
 
PDF
Data Science Salon: Enabling self-service predictive analytics at Bidtellect
Formulatedby
 
PPTX
Data Science Salon: MCL Clustering of Sparse Graphs
Formulatedby
 
PPTX
Data Science Salon: Applying Machine Learning to Modernize Business Processes
Formulatedby
 
PPTX
Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...
Formulatedby
 
PDF
Data Science Salon: The Age of Co-creation
Formulatedby
 
Data Science Salon: An Experiment on Data Science Algorithms Enabled by a Pil...
Formulatedby
 
Data Science Salon: Are you sure you're an ethical technologist?: Build your ...
Formulatedby
 
Data Science Salon: In your own words: computing customer similarity from tex...
Formulatedby
 
Data Science Salon: nterpretable Predictive Models in the Healthcare Domain
Formulatedby
 
Data Science Salon: Applications of Embeddings and Deep Learning at Groupon
Formulatedby
 
Data Science Salon: Kaggle 1st Place in 30 minutes: Putting AutoML to Work wi...
Formulatedby
 
Data Science Salon: Smart Cities
Formulatedby
 
Data Science Salon: Building a Data Driven Product Mindset
Formulatedby
 
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
Formulatedby
 
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
Formulatedby
 
Data Science Salon: Data visualization and Analysis in the Florida Panthers H...
Formulatedby
 
Data Science Salon: Machine Learning for Personalized Cancer Vaccines
Formulatedby
 
Data Science Salon: Building a Data Science Culture
Formulatedby
 
Data Science Salon: Digital Transformation: The Data Science Catalyst
Formulatedby
 
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Formulatedby
 
Data Science Salon: Enabling self-service predictive analytics at Bidtellect
Formulatedby
 
Data Science Salon: MCL Clustering of Sparse Graphs
Formulatedby
 
Data Science Salon: Applying Machine Learning to Modernize Business Processes
Formulatedby
 
Data Science Salon: Building smart AI: How Deep Learning Can Get You Into Dee...
Formulatedby
 
Data Science Salon: The Age of Co-creation
Formulatedby
 
Ad

Recently uploaded (20)

PPTX
Employee Salary Presentation.l based on data science collection of data
barridevakumari2004
 
PPTX
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
PPTX
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
PPTX
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
PPTX
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
PDF
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PDF
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
PDF
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
PPTX
INFO8116 -Big data architecture and analytics
guddipatel10
 
PDF
blockchain123456789012345678901234567890
tanvikhunt1003
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PPTX
INFO8116 - Week 10 - Slides.pptx big data architecture
guddipatel10
 
PPTX
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PDF
Research about a FoodFolio app for personalized dietary tracking and health o...
AustinLiamAndres
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PDF
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 
Employee Salary Presentation.l based on data science collection of data
barridevakumari2004
 
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
INFO8116 -Big data architecture and analytics
guddipatel10
 
blockchain123456789012345678901234567890
tanvikhunt1003
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
INFO8116 - Week 10 - Slides.pptx big data architecture
guddipatel10
 
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
Research about a FoodFolio app for personalized dietary tracking and health o...
AustinLiamAndres
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 

Data Science Salon: Deep Learning as a Product @ Scribd

Editor's Notes

  • #18: Path of least resistance
  • #20: MOAR PICTURES
  • #59: 66% + 24%