SlideShare a Scribd company logo
Deep Learning and Recurrent Neural
Networks in the Enterprise
StampedeCon
St. Louis 2016
Josh Patterson, Skymind
Presenter: Josh Patterson
Past
Research in Swarm Algorithms: Real-time optimization techniques in
mesh sensor networks
TVA / NERC: Smartgrid, Sensor Collection, and Big Data
Cloudera: Principal SA, Working with Fortune 500
Patterson Consulting: Working with Fortune 500 on Big Data, ML
Today
Skymind, Director Field Engineering
josh@skymind.io / @jpatanooga
DL4J Co-creator,
Co-Author on Upcoming Oreilly Book
“Deep Learning: A Practitioner’s Approach”
Topics
• What is Deep Learning?
• DL4J
• Recurrent Neural Network Applications
WHAT IS DEEP LEARNING?
Defining Deep Learning
• Higher neuron counts than in previous
generation neural networks
• Different and evolved ways to connect layers
inside neural networks
• More computing power to train
• Automated Feature Learning
Automated Feature Learning
• Deep Learning can be thought of as workflows
for automated feature construction
– From “feature construction” to “feature learning”
• As Yann LeCun says:
– “machines that learn to represent the world”
Deep Learning and Recurrent Neural Networks in the Enterprise
Deep Learning and Recurrent Neural Networks in the Enterprise
These are the features learned at each neuron in a Restricted Boltzmann Machine
(RBMS)
These features are passed to higher levels of RBMs to learn more complicated things.
Part of the
“7” digit
Unreasonable Effectiveness:
Benchmark Records
1. Text-to-speech synthesis (Fan et al., Microsoft, Interspeech 2014)
2. Language identification (Gonzalez-Dominguez et al., Google, Interspeech 2014)
3. Large vocabulary speech recognition (Sak et al., Google, Interspeech 2014)
4. Prosody contour prediction (Fernandez et al., IBM, Interspeech 2014)
5. Medium vocabulary speech recognition (Geiger et al., Interspeech 2014)
6. English to French translation (Sutskever et al., Google, NIPS 2014)
7. Audio onset detection (Marchi et al., ICASSP 2014)
8. Social signal classification (Brueckner & Schulter, ICASSP 2014)
9. Arabic handwriting recognition (Bluche et al., DAS 2014)
10. TIMIT phoneme recognition (Graves et al., ICASSP 2013)
11. Optical character recognition (Breuel et al., ICDAR 2013)
12. Image caption generation (Vinyals et al., Google, 2014)
13. Video to textual description (Donahue et al., 2014)
14. Syntactic parsing for Natural Language Processing (Vinyals et al., Google, 2014)
15. Photo-real talking heads (Soong and Wang, Microsoft, 2014).
Four Major Architectures
• Deep Belief Networks
• Convolutional Neural Networks
• Recurrent Neural Networks
• Recursive Neural Networks
Quick Usage Guide
• If I have Timeseries or Audio Input
– I should use a Recurrent Neural Network
– Examples: Fraud Detection, Anomaly Detection
• If I have Image input
– I should use a Convolutional Neural Network
• If I have Video input
– I should use a hybrid Convolutional + Recurrent
Architecture!
Convolutional Generated Art
The More Things Change…
• Deep Learning is still trying to answer the
same fundamental questions such as:
– “is this image a face?”
• The difference is Deep Learning makes hard
questions easier to answer with better
architectures and more computing power
– We do this by matching the correct architecture
w the right problem
DL4J
Building Deep Neural Networks with
DL4J
• “The Hadoop of Deep Learning”
– Java, Scala, and Python APIs
– ASF 2.0 Licensed
• Java implementation
– Parallelization (Yarn + Spark)
– GPU support
• Also Supports multi-GPU per host
• Runtime Neutral
– Local
– Hadoop / YARN + Spark
• https://github.com/deeplearning4j/deeplearning4j
DL4J Workflow Toolchain
ETL
(DataVec)
Vectorization
(DataVec)
Modeling
(DL4J)
Evaluation
(Arbiter)
Execution Platforms: Spark, Single Machine
ND4J - Linear Algebra Runtime: CPU, GPU
ND4J: The Need for Speed
• Javacpp (cython for java)
– Auto generate JNI bindings for C++ by parsing classes
– Allows for easy maintenance and deployment of c++
binaries in java
• CPU backends
– Openmp (multithreading within native operations)
– Openblas or MKL (BLAS operations)
– SIMD-extensions
• GPU backends
– DL4J supports Cuda 7.5 at the moment, and will support
8.0 support as soon as it comes out.
– Leverages cudnn as well
Prepping Data is Time Consuming
http://www.forbes.com/sites/gilpress/2016/03/23/data-preparation-most-time-consuming-least-enjoyable-data-science-task-survey-says/#633ea7f67f75
Preparing Data for Modeling is Hard
DataVec
• DataVec is a tool for machine learning ETL
(Extract, Transform, Load) operations.
– Spark-Enabled and focused on Supporting DL4J
• Also performs vectorization
– Image, CSV, Sequences (timeseries), more
• Open Source, ASF 2.0 Licensed
– https://github.com/deeplearning4j/DataVec
RECURRENT NEURAL NETWORK
APPLICATIONS
Using DL4J for
Source: IDC White Paper - sponsored by EMC.
As the Economy Contracts, the Digital Universe Expands. May 2009.
.
Transactional Data Explosion
• 2,500 exabytes of new information in 2012 with Internet as primary driver
• Digital universe grew by 62% last year to 800K petabytes and will grow to 1.2
“zettabytes” this year
Relational
Transactional
(Logs, Sensors)
(You)
NERC Sensor Data Collection
openPDC PMU Data Collection circa 2009
• 120 Sensors
• 30 samples/second
• 4.3B Samples/day
• Housed in Hadoop
Sensor Timeseries Classification with RNNs
• Recurrent Neural Networks have the ability to
model change of input over time
• Older techniques (mostly) do not retain time
domain
– Hidden Markov Models do…
• but are more limited
• Key Takeaway:
– For working with Timeseries data, RNNs will be
more accurate
RNN Architectures
Standard
supervised
learning
Image
captioning
Sentiment
analysis
Video captioning,
Natural language
translation
Part of speech
tagging
Generative mode
for text
Anomaly Detection
• Model the normal patterns in the data
• Autoencoders give us the ability to look at
data that it hasn’t seen before
– Find anomalous patterns in sequences
– Can also use RNNs for pattern classification
• Interesting Industry Applications
– Telecom
– Financial Services
Audio Applications
• Text-to-Speech
• Recognize specific songs / audio
• Enables natural language interfaces
“Google is living a few years in the
future and sending the rest of us
messages”
-- Doug Cutting in 2013
• However
– Most organizations are not built like Google
• (and Jeff Dean does not work at your company…)
• Anyone building Next-Gen infrastructure has
to consider these things
Certified on Two Hadoop Distributions
• Running Spark on Hadoop via YARN gives us
– Sharing cluster resources between heterogeneous
workloads concurrently
– Access to the yarn scheduler capabilities
– Better control of executors in Spark
– Kerberos support for security
• Certified on CDH 5.4
• Certified on HDP 2.4
– [ Coming later this month ]
Questions?
Thank you for your time and attention
“Deep Learning: A Practitioner’s Approach”
(Oreilly, October 2016)
Running DL4J Workflows on Spark
• DataVec is built to scale out via Spark RDDs
– RDD<LabeledPoint>
– RDD<DataSet>
• DL4J Uses same MultiLayerConfiguration as
single host version
– Uses SparkDl4jMultiLayer to drive the training on spark
– Performs Parameter Averaging
spark-submit --class
io.skymind.spark.dl4j.datavec.BasicDataVecExample --master yarn --
num-executors 1 --properties-file ./spark_extra.props
./Skymind_spark-1.0-SNAPSHOT.jar

More Related Content

What's hot (20)

PPTX
Smart Data Conference: DL4J and DataVec
Josh Patterson
 
PPTX
Deep learning on Hadoop/Spark -NextML
Adam Gibson
 
PDF
DeepLearning4J and Spark: Successes and Challenges - François Garillot
Steve Moore
 
PPTX
Vectorization - Georgia Tech - CSE6242 - March 2015
Josh Patterson
 
ODP
Deep Learning for Java (DL4J)
신동 강
 
PPTX
Deep Learning on Qubole Data Platform
Shivaji Dutta
 
PDF
Snorkel: Dark Data and Machine Learning with Christopher Ré
Jen Aman
 
PPTX
Machine Learning Exposed!
javafxpert
 
PPTX
Amazon Deep Learning
Amanda Mackay (she/her)
 
PPTX
Hadoop Summit 2014 - San Jose - Introduction to Deep Learning on Hadoop
Josh Patterson
 
PDF
H2O with Erin LeDell at Portland R User Group
Sri Ambati
 
PDF
Distributed Inference on Large Datasets Using Apache MXNet and Apache Spark ...
Databricks
 
PDF
Searching Chinese Patents Presentation at Enterprise Data World
OpenSource Connections
 
PPTX
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Simplilearn
 
PDF
Building a Lightweight Discovery Interface for China's Patents@NYC Solr/Lucen...
OpenSource Connections
 
PDF
Deploying Enterprise Deep Learning Masterclass Preview - Enterprise Deep Lea...
Sam Putnam [Deep Learning]
 
PDF
High Performance Machine Learning in R with H2O
Sri Ambati
 
PDF
Building Intelligent Applications, Experimental ML with Uber’s Data Science W...
Databricks
 
PDF
Introduction to GPUs for Machine Learning
Sri Ambati
 
PDF
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
StampedeCon
 
Smart Data Conference: DL4J and DataVec
Josh Patterson
 
Deep learning on Hadoop/Spark -NextML
Adam Gibson
 
DeepLearning4J and Spark: Successes and Challenges - François Garillot
Steve Moore
 
Vectorization - Georgia Tech - CSE6242 - March 2015
Josh Patterson
 
Deep Learning for Java (DL4J)
신동 강
 
Deep Learning on Qubole Data Platform
Shivaji Dutta
 
Snorkel: Dark Data and Machine Learning with Christopher Ré
Jen Aman
 
Machine Learning Exposed!
javafxpert
 
Amazon Deep Learning
Amanda Mackay (she/her)
 
Hadoop Summit 2014 - San Jose - Introduction to Deep Learning on Hadoop
Josh Patterson
 
H2O with Erin LeDell at Portland R User Group
Sri Ambati
 
Distributed Inference on Large Datasets Using Apache MXNet and Apache Spark ...
Databricks
 
Searching Chinese Patents Presentation at Enterprise Data World
OpenSource Connections
 
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Simplilearn
 
Building a Lightweight Discovery Interface for China's Patents@NYC Solr/Lucen...
OpenSource Connections
 
Deploying Enterprise Deep Learning Masterclass Preview - Enterprise Deep Lea...
Sam Putnam [Deep Learning]
 
High Performance Machine Learning in R with H2O
Sri Ambati
 
Building Intelligent Applications, Experimental ML with Uber’s Data Science W...
Databricks
 
Introduction to GPUs for Machine Learning
Sri Ambati
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
StampedeCon
 

Similar to Deep Learning and Recurrent Neural Networks in the Enterprise (20)

PPTX
Democratizing AI with Apache Spark
Spark Summit
 
PDF
Nervana AI Overview Deck April 2016
Sean Everett
 
PPTX
Applied Deep Learning with Spark and Deeplearning4j
DataWorks Summit
 
PPTX
Getting Started with Splunk Breakout Session
Splunk
 
PDF
Bringing Deep Learning into production
Paolo Platter
 
PDF
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
Ilkay Altintas, Ph.D.
 
PDF
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Greg Makowski
 
PDF
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Larry Smarr
 
PDF
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Larry Smarr
 
PDF
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Larry Smarr
 
PPTX
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
MLconf
 
PPTX
AI Deep Learning - CF Machine Learning
Karl Seiler
 
PPTX
Essential Data Engineering for Data Scientist
SoftServe
 
PPTX
Choosing the right Technologies for your next unicorn.
Gladson DSouza
 
PDF
Introduction to multi gpu deep learning with DIGITS 2 - Mike Wang
PAPIs.io
 
PPTX
deep-learning-ppt-full-notes.pptx presen
RamakanthChhaparwal
 
PDF
Spark
Nitish Upreti
 
PDF
Data Science und Machine Learning im Kubernetes-Ökosystem
inovex GmbH
 
PDF
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Jen Aman
 
PDF
Deep Learning on Apache® Spark™: Workflows and Best Practices
Databricks
 
Democratizing AI with Apache Spark
Spark Summit
 
Nervana AI Overview Deck April 2016
Sean Everett
 
Applied Deep Learning with Spark and Deeplearning4j
DataWorks Summit
 
Getting Started with Splunk Breakout Session
Splunk
 
Bringing Deep Learning into production
Paolo Platter
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
Ilkay Altintas, Ph.D.
 
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Greg Makowski
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Larry Smarr
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Larry Smarr
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Larry Smarr
 
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
MLconf
 
AI Deep Learning - CF Machine Learning
Karl Seiler
 
Essential Data Engineering for Data Scientist
SoftServe
 
Choosing the right Technologies for your next unicorn.
Gladson DSouza
 
Introduction to multi gpu deep learning with DIGITS 2 - Mike Wang
PAPIs.io
 
deep-learning-ppt-full-notes.pptx presen
RamakanthChhaparwal
 
Data Science und Machine Learning im Kubernetes-Ökosystem
inovex GmbH
 
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Jen Aman
 
Deep Learning on Apache® Spark™: Workflows and Best Practices
Databricks
 
Ad

More from Josh Patterson (13)

PPTX
Patterson Consulting: What is Artificial Intelligence?
Josh Patterson
 
PPTX
What is Artificial Intelligence
Josh Patterson
 
PPTX
Modeling Electronic Health Records with Recurrent Neural Networks
Josh Patterson
 
PPTX
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Josh Patterson
 
PPTX
Intro to Vectorization Concepts - GaTech cse6242
Josh Patterson
 
PPTX
MLConf 2013: Metronome and Parallel Iterative Algorithms on YARN
Josh Patterson
 
PPTX
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Josh Patterson
 
PPTX
Knitting boar atl_hug_jan2013_v2
Josh Patterson
 
PPTX
Knitting boar - Toronto and Boston HUGs - Nov 2012
Josh Patterson
 
PPTX
LA HUG Dec 2011 - Recommendation Talk
Josh Patterson
 
PPTX
Oct 2011 CHADNUG Presentation on Hadoop
Josh Patterson
 
PPTX
Machine Learning and Hadoop
Josh Patterson
 
PPTX
Classification with Naive Bayes
Josh Patterson
 
Patterson Consulting: What is Artificial Intelligence?
Josh Patterson
 
What is Artificial Intelligence
Josh Patterson
 
Modeling Electronic Health Records with Recurrent Neural Networks
Josh Patterson
 
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Josh Patterson
 
Intro to Vectorization Concepts - GaTech cse6242
Josh Patterson
 
MLConf 2013: Metronome and Parallel Iterative Algorithms on YARN
Josh Patterson
 
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Josh Patterson
 
Knitting boar atl_hug_jan2013_v2
Josh Patterson
 
Knitting boar - Toronto and Boston HUGs - Nov 2012
Josh Patterson
 
LA HUG Dec 2011 - Recommendation Talk
Josh Patterson
 
Oct 2011 CHADNUG Presentation on Hadoop
Josh Patterson
 
Machine Learning and Hadoop
Josh Patterson
 
Classification with Naive Bayes
Josh Patterson
 
Ad

Recently uploaded (20)

PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
PPTX
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
PPTX
Feb 2021 Ransomware Recovery presentation.pptx
enginsayin1
 
PPTX
apidays Singapore 2025 - Generative AI Landscape Building a Modern Data Strat...
apidays
 
PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
PDF
Research Methodology Overview Introduction
ayeshagul29594
 
PDF
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
PDF
apidays Singapore 2025 - Trustworthy Generative AI: The Role of Observability...
apidays
 
PDF
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
PDF
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
PPTX
01_Nico Vincent_Sailpeak.pptx_AI_Barometer_2025
FinTech Belgium
 
PDF
Development and validation of the Japanese version of the Organizational Matt...
Yoga Tokuyoshi
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PPTX
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
PPTX
How to Add Columns and Rows in an R Data Frame
subhashenia
 
PDF
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
PDF
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
PPTX
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
PDF
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
Feb 2021 Ransomware Recovery presentation.pptx
enginsayin1
 
apidays Singapore 2025 - Generative AI Landscape Building a Modern Data Strat...
apidays
 
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
Research Methodology Overview Introduction
ayeshagul29594
 
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
apidays Singapore 2025 - Trustworthy Generative AI: The Role of Observability...
apidays
 
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
01_Nico Vincent_Sailpeak.pptx_AI_Barometer_2025
FinTech Belgium
 
Development and validation of the Japanese version of the Organizational Matt...
Yoga Tokuyoshi
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
How to Add Columns and Rows in an R Data Frame
subhashenia
 
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 

Deep Learning and Recurrent Neural Networks in the Enterprise

  • 1. Deep Learning and Recurrent Neural Networks in the Enterprise StampedeCon St. Louis 2016 Josh Patterson, Skymind
  • 2. Presenter: Josh Patterson Past Research in Swarm Algorithms: Real-time optimization techniques in mesh sensor networks TVA / NERC: Smartgrid, Sensor Collection, and Big Data Cloudera: Principal SA, Working with Fortune 500 Patterson Consulting: Working with Fortune 500 on Big Data, ML Today Skymind, Director Field Engineering [email protected] / @jpatanooga DL4J Co-creator, Co-Author on Upcoming Oreilly Book “Deep Learning: A Practitioner’s Approach”
  • 3. Topics • What is Deep Learning? • DL4J • Recurrent Neural Network Applications
  • 4. WHAT IS DEEP LEARNING?
  • 5. Defining Deep Learning • Higher neuron counts than in previous generation neural networks • Different and evolved ways to connect layers inside neural networks • More computing power to train • Automated Feature Learning
  • 6. Automated Feature Learning • Deep Learning can be thought of as workflows for automated feature construction – From “feature construction” to “feature learning” • As Yann LeCun says: – “machines that learn to represent the world”
  • 9. These are the features learned at each neuron in a Restricted Boltzmann Machine (RBMS) These features are passed to higher levels of RBMs to learn more complicated things. Part of the “7” digit
  • 10. Unreasonable Effectiveness: Benchmark Records 1. Text-to-speech synthesis (Fan et al., Microsoft, Interspeech 2014) 2. Language identification (Gonzalez-Dominguez et al., Google, Interspeech 2014) 3. Large vocabulary speech recognition (Sak et al., Google, Interspeech 2014) 4. Prosody contour prediction (Fernandez et al., IBM, Interspeech 2014) 5. Medium vocabulary speech recognition (Geiger et al., Interspeech 2014) 6. English to French translation (Sutskever et al., Google, NIPS 2014) 7. Audio onset detection (Marchi et al., ICASSP 2014) 8. Social signal classification (Brueckner & Schulter, ICASSP 2014) 9. Arabic handwriting recognition (Bluche et al., DAS 2014) 10. TIMIT phoneme recognition (Graves et al., ICASSP 2013) 11. Optical character recognition (Breuel et al., ICDAR 2013) 12. Image caption generation (Vinyals et al., Google, 2014) 13. Video to textual description (Donahue et al., 2014) 14. Syntactic parsing for Natural Language Processing (Vinyals et al., Google, 2014) 15. Photo-real talking heads (Soong and Wang, Microsoft, 2014).
  • 11. Four Major Architectures • Deep Belief Networks • Convolutional Neural Networks • Recurrent Neural Networks • Recursive Neural Networks
  • 12. Quick Usage Guide • If I have Timeseries or Audio Input – I should use a Recurrent Neural Network – Examples: Fraud Detection, Anomaly Detection • If I have Image input – I should use a Convolutional Neural Network • If I have Video input – I should use a hybrid Convolutional + Recurrent Architecture!
  • 14. The More Things Change… • Deep Learning is still trying to answer the same fundamental questions such as: – “is this image a face?” • The difference is Deep Learning makes hard questions easier to answer with better architectures and more computing power – We do this by matching the correct architecture w the right problem
  • 15. DL4J Building Deep Neural Networks with
  • 16. DL4J • “The Hadoop of Deep Learning” – Java, Scala, and Python APIs – ASF 2.0 Licensed • Java implementation – Parallelization (Yarn + Spark) – GPU support • Also Supports multi-GPU per host • Runtime Neutral – Local – Hadoop / YARN + Spark • https://github.com/deeplearning4j/deeplearning4j
  • 17. DL4J Workflow Toolchain ETL (DataVec) Vectorization (DataVec) Modeling (DL4J) Evaluation (Arbiter) Execution Platforms: Spark, Single Machine ND4J - Linear Algebra Runtime: CPU, GPU
  • 18. ND4J: The Need for Speed • Javacpp (cython for java) – Auto generate JNI bindings for C++ by parsing classes – Allows for easy maintenance and deployment of c++ binaries in java • CPU backends – Openmp (multithreading within native operations) – Openblas or MKL (BLAS operations) – SIMD-extensions • GPU backends – DL4J supports Cuda 7.5 at the moment, and will support 8.0 support as soon as it comes out. – Leverages cudnn as well
  • 19. Prepping Data is Time Consuming http://www.forbes.com/sites/gilpress/2016/03/23/data-preparation-most-time-consuming-least-enjoyable-data-science-task-survey-says/#633ea7f67f75
  • 20. Preparing Data for Modeling is Hard
  • 21. DataVec • DataVec is a tool for machine learning ETL (Extract, Transform, Load) operations. – Spark-Enabled and focused on Supporting DL4J • Also performs vectorization – Image, CSV, Sequences (timeseries), more • Open Source, ASF 2.0 Licensed – https://github.com/deeplearning4j/DataVec
  • 23. Source: IDC White Paper - sponsored by EMC. As the Economy Contracts, the Digital Universe Expands. May 2009. . Transactional Data Explosion • 2,500 exabytes of new information in 2012 with Internet as primary driver • Digital universe grew by 62% last year to 800K petabytes and will grow to 1.2 “zettabytes” this year Relational Transactional (Logs, Sensors) (You)
  • 24. NERC Sensor Data Collection openPDC PMU Data Collection circa 2009 • 120 Sensors • 30 samples/second • 4.3B Samples/day • Housed in Hadoop
  • 25. Sensor Timeseries Classification with RNNs • Recurrent Neural Networks have the ability to model change of input over time • Older techniques (mostly) do not retain time domain – Hidden Markov Models do… • but are more limited • Key Takeaway: – For working with Timeseries data, RNNs will be more accurate
  • 26. RNN Architectures Standard supervised learning Image captioning Sentiment analysis Video captioning, Natural language translation Part of speech tagging Generative mode for text
  • 27. Anomaly Detection • Model the normal patterns in the data • Autoencoders give us the ability to look at data that it hasn’t seen before – Find anomalous patterns in sequences – Can also use RNNs for pattern classification • Interesting Industry Applications – Telecom – Financial Services
  • 28. Audio Applications • Text-to-Speech • Recognize specific songs / audio • Enables natural language interfaces
  • 29. “Google is living a few years in the future and sending the rest of us messages” -- Doug Cutting in 2013 • However – Most organizations are not built like Google • (and Jeff Dean does not work at your company…) • Anyone building Next-Gen infrastructure has to consider these things
  • 30. Certified on Two Hadoop Distributions • Running Spark on Hadoop via YARN gives us – Sharing cluster resources between heterogeneous workloads concurrently – Access to the yarn scheduler capabilities – Better control of executors in Spark – Kerberos support for security • Certified on CDH 5.4 • Certified on HDP 2.4 – [ Coming later this month ]
  • 31. Questions? Thank you for your time and attention “Deep Learning: A Practitioner’s Approach” (Oreilly, October 2016)
  • 32. Running DL4J Workflows on Spark • DataVec is built to scale out via Spark RDDs – RDD<LabeledPoint> – RDD<DataSet> • DL4J Uses same MultiLayerConfiguration as single host version – Uses SparkDl4jMultiLayer to drive the training on spark – Performs Parameter Averaging spark-submit --class io.skymind.spark.dl4j.datavec.BasicDataVecExample --master yarn -- num-executors 1 --properties-file ./spark_extra.props ./Skymind_spark-1.0-SNAPSHOT.jar