SlideShare a Scribd company logo
Building Distributed Deep Learning Engine 
Guangdeng Liao, Zhan Zhang and Murtaza Zafer 
SRA-SV | Cloud Research Lab Slide 1
What is Deep Learning 
Deep learning is a set of algorithms that attempt to model high-level 
abstractions in data by using architectures composed of multiple non-linear 
transformations 
Learn hidden features Learning state emission 
prob. 
Learning word vectors 
SRA-SV | Cloud Research Lab Slide 2
Thanks Big Data, Deep Learning is not only research 
Usage Scenario: Speech Recognition, Image processing and NLP 
SRA-SV | Cloud Research Lab Slide 3
Why Samsung needs Deep Learning? 
To make our devices smarter and more intelligent by recognizing 
voice, image and even language 
SRA-SV | Cloud Research Lab Slide 4
How does Deep Learning look like? 
Many more examples (millions to billions parameters ) in Speech 
Recognition, Image Processing and NLP 
Krizhevsky, A., Sutskever, I. and Hinton, G. E. ImageNet Classification with Deep Convolutional Neural Networks 
SRA-SV | Cloud Research Lab Slide 5
Deep Learning is challenging.. 
BIG DATA + BIG MODEL 
Building a distributed deep learning platform for 
Quite new, no mature platform yet 
Samsung R&D 
Hard to design and develop DL algorithms 
SRA-SV | Cloud Research Lab Slide 6
Distributed Deep Learning Platform we are building 
Object 
recognition 
App. 
Speech 
recognition 
…. 
RBM FF DA CNN …. 
Model-parallel 
engine 
Algorithms 
Infrastructure 
I/O …. 
Parameter 
server 
Execution 
engine 
Math 
SRA-SV | Cloud Research Lab Slide 7 
Our focus
Now, Let’s dive deeper and more technically… 
SRA-SV | Cloud Research Lab Slide 8
Model-Parallel Engine (MPE) 
Parallelize a big ML model over Hadoop YARN cluster 
Auto-deployment 
of topology (in-memory) 
SRA-SV | Cloud Research Lab Slide 9 
User 
defined 
model 
Auto-generation 
of model topology 
Auto-partition of 
topology over 
cluster 
c1 
c2 
c3 
Neuron-like 
programming 
Message-based 
communication 
Message-driven 
computation 
-Define nodes 
-Define groups 
-Define connections
MPE’s Architecture 
Container 
Node Manager 
Data Communication: 
• node-level 
• group-level 
Container 
Node manager 
Control comm. based on 
Thrift 
Data comm. based on Netty 
Application Master 
Controller 
Partition and 
deploy topology 
Container 
Node manager 
SRA-SV | Cloud Research Lab Slide 10
How to partition big models 
Vertical Partition Horizontal Partition 
SRA-SV | Cloud Research Lab Slide 11
Execution Engine (Layer-by-Layer Training) 
Can stack different layers and training algorithms 
HDFS/LFS 
HDFS/LFS 
SRA-SV | Cloud Research Lab Slide 12
Model-parallel itself is not scalable enough 
SRA-SV | Cloud Research Lab Slide 13
Deep Learning Infra.: Hybrid of Data-parallelism and Model-parallelism 
…….. Data Chunk 
Data Chunk 
…….. 
Model-parallel Model-parallel 
Parameter 
Server 1 
Parameters coordination 
Parameter 
Server n …….. 
Data-parallelism 
Lots of model instances 
Parameter servers 
help models learn 
each other 
SRA-SV | Cloud Research Lab Slide 14
Distributed Parameter Servers 
Currently we support asynchronous stochastic gradient descent with AdaGrad 
Client Client Client 
Server 1 Server 2 Server 3 
In-memory 
cache/storage 
HBase/HDFS 
In-memory 
cache/storage 
Asyn. communication 
In-memory 
cache/storage 
SRA-SV | Cloud Research Lab Slide 15 
Pull/Push
Deep Learning Algorithms 
Deep Learning Algorithms 
Feed-forward Neural Network 
Restricted Boltzmann 
Machine 
Denoise Auto-encoder 
Deep Belief Network 
More importantly, we can stack them 
layer by layer 
SRA-SV | Cloud Research Lab Slide 16
More Challenging Algorithm: Convolutional Neural Network 
Different convolutional, normalization and pooling layers 
Weight shared and non-shared feature maps 
Feature map is minimum partition unit 
SRA-SV | Cloud Research Lab Slide 17 
第17 页 
Layer: 
Multi-dimensional feature map 
neurons 
Output: 
Dense layer feed-forward 
neurons 
Input: 
e.g. image, spectral map of 
voice data 
Layer: 
Multi-dimensional feature map 
neurons
Sharing some early experiences/lessons 
Infrastructure 
Computation abstraction might be too low level (a lot 
of pros and cons) 
A generic deep learning platform is very challenging 
(like recurrent NN) 
Communication is important 
Methods of partitioning models are important 
High performance mathematical library is useful 
SRA-SV | Cloud Research Lab Slide 18
Sharing some early experiences 
Algorithm/Models 
Models for ASR are relatively small 
Models for image are much larger 
Models for NLP are typical small 
DA seems more efficient than RBM for 
image 
Accelerated SGD or Hessian free 
optimizations need to be explored 
SRA-SV | Cloud Research Lab Slide 19
Usage cases of Deep Learning 
SRA-SV | Cloud Research Lab Slide 20
Image Recognition 
Object models 
Object parts 
SRA-SV | Cloud Research Lab Slide 21 
Edges 
Pixels 
Image 
pixels 
Hand-designed 
Feature Extraction 
(SIFT, HOG etc.) 
Trainable 
Classifier 
Object 
Category 
Featured Learner 
(Convolutional NN is 
popular) 
Learned high level features 
Data 
augmentation 
Central and 
corner crops 
Original 
Image
Speech Recognition 
• DNN is used to replace GMM to learn state 
output probability in HMM. 
• FF and DBN have been used for ASR 
• CNN starts being used to further improve WER 
• Rectified Linear Activation seems better than 
SRA-SV | Cloud Research Lab Slide 22 
Sigmoid 
• Models are relatively small (e.g. 5 layers, 2560 
neurons/hidden layer) 
Li Deng, A Tutorial Survey of Architectures, Algorithms, and Applications for Deep Learning
NLP 
Deep Learning in NLP is quite new 
Learn word vector 
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean, Efficient Estimation of Word Representations in Vector Space 
SRA-SV | Cloud Research Lab Slide 23
NLP 
Based on word vector, map sentences to vector space now 
Sentiment Analysis 
Richard Socher Jeffrey Pennington Eric H. Huang Andrew Y. Ng Christopher D. Manning, Semi-Supervised Recursive Autoencoders 
for Predicting Sentiment Distributions 
SRA-SV | Cloud Research Lab Slide 24
Q&A

More Related Content

What's hot (20)

PDF
Neural Networks and Deep Learning
Asim Jalis
 
PDF
Handwritten Recognition using Deep Learning with R
Poo Kuan Hoong
 
PPTX
Introduction to deep learning
Abhishek Bhandwaldar
 
PDF
Intro To Convolutional Neural Networks
Mark Scully
 
PDF
Convolutional neural network
Yan Xu
 
PDF
Synthetic dialogue generation with Deep Learning
S N
 
PDF
Deep learning in Computer Vision
David Dao
 
PDF
Deep Learning and the state of AI / 2016
Grigory Sapunov
 
PPTX
Introduction to Deep Learning
Oswald Campesato
 
PDF
101: Convolutional Neural Networks
Mad Scientists
 
PDF
AI&BigData Lab. Артем Чернодуб "Распознавание изображений методом Lazy Deep ...
GeeksLab Odessa
 
PDF
Deep Learning for Personalized Search and Recommender Systems
Benjamin Le
 
PPTX
Hadoop Summit 2014 - San Jose - Introduction to Deep Learning on Hadoop
Josh Patterson
 
PDF
Deep Learning: a birds eye view
Roelof Pieters
 
PDF
Deep Learning, an interactive introduction for NLP-ers
Roelof Pieters
 
PPTX
Transfer Learning and Fine-tuning Deep Neural Networks
PyData
 
PDF
Convolutional Neural Network
Junho Cho
 
PPTX
Deep learning intro
beamandrew
 
PDF
Deep Learning & NLP: Graphs to the Rescue!
Roelof Pieters
 
PPTX
Deep learning: the future of recommendations
Balázs Hidasi
 
Neural Networks and Deep Learning
Asim Jalis
 
Handwritten Recognition using Deep Learning with R
Poo Kuan Hoong
 
Introduction to deep learning
Abhishek Bhandwaldar
 
Intro To Convolutional Neural Networks
Mark Scully
 
Convolutional neural network
Yan Xu
 
Synthetic dialogue generation with Deep Learning
S N
 
Deep learning in Computer Vision
David Dao
 
Deep Learning and the state of AI / 2016
Grigory Sapunov
 
Introduction to Deep Learning
Oswald Campesato
 
101: Convolutional Neural Networks
Mad Scientists
 
AI&BigData Lab. Артем Чернодуб "Распознавание изображений методом Lazy Deep ...
GeeksLab Odessa
 
Deep Learning for Personalized Search and Recommender Systems
Benjamin Le
 
Hadoop Summit 2014 - San Jose - Introduction to Deep Learning on Hadoop
Josh Patterson
 
Deep Learning: a birds eye view
Roelof Pieters
 
Deep Learning, an interactive introduction for NLP-ers
Roelof Pieters
 
Transfer Learning and Fine-tuning Deep Neural Networks
PyData
 
Convolutional Neural Network
Junho Cho
 
Deep learning intro
beamandrew
 
Deep Learning & NLP: Graphs to the Rescue!
Roelof Pieters
 
Deep learning: the future of recommendations
Balázs Hidasi
 

Viewers also liked (20)

PPTX
From Narrow AI to Artificial General Intelligence (AGI)
Helgi Páll Helgason, PhD
 
PDF
Deep learning and the systemic challenges of data science initiatives
Balázs Kégl
 
PDF
Deep Learning for Speech Recognition - Vikrant Singh Tomar
WithTheBest
 
PDF
Deep learning for music classification, 2016-05-24
Keunwoo Choi
 
PPTX
온라인 데이터 분석을 통한 선거예측- 김찬우, 조인호
datasciencekorea
 
PDF
Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중
datasciencekorea
 
PDF
(Deep) Neural Networks在 NLP 和 Text Mining 总结
君 廖
 
PDF
Entrepreneurial ecosystem markers -slides
Norris Krueger
 
PDF
Tecnologías sociales.
José María
 
DOCX
Turbo promotor
simprro379
 
PDF
INDEPEDENT OUTSOURCING ASSURANCE
Arul Nambi
 
PDF
Alphaworks deck v2
alphaworks
 
PDF
data science in academia and the real world
chris wiggins
 
PPT
Sheena Lowrie, Spaces 2012
TCV Scotland
 
PPTX
Verefranceze
simprro379
 
DOC
Android开发工程师必备
mornone
 
DOC
Evaluation question 1 music video
Greenshaw High School
 
PPTX
Vinsitesafe
simprro379
 
PPTX
Universidad nacional de chimborazo(Nellyta)
UNACH
 
PPTX
Synflex
simprro379
 
From Narrow AI to Artificial General Intelligence (AGI)
Helgi Páll Helgason, PhD
 
Deep learning and the systemic challenges of data science initiatives
Balázs Kégl
 
Deep Learning for Speech Recognition - Vikrant Singh Tomar
WithTheBest
 
Deep learning for music classification, 2016-05-24
Keunwoo Choi
 
온라인 데이터 분석을 통한 선거예측- 김찬우, 조인호
datasciencekorea
 
Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중
datasciencekorea
 
(Deep) Neural Networks在 NLP 和 Text Mining 总结
君 廖
 
Entrepreneurial ecosystem markers -slides
Norris Krueger
 
Tecnologías sociales.
José María
 
Turbo promotor
simprro379
 
INDEPEDENT OUTSOURCING ASSURANCE
Arul Nambi
 
Alphaworks deck v2
alphaworks
 
data science in academia and the real world
chris wiggins
 
Sheena Lowrie, Spaces 2012
TCV Scotland
 
Verefranceze
simprro379
 
Android开发工程师必备
mornone
 
Evaluation question 1 music video
Greenshaw High School
 
Vinsitesafe
simprro379
 
Universidad nacional de chimborazo(Nellyta)
UNACH
 
Synflex
simprro379
 
Ad

Similar to Building distributed deep learning engine (20)

PPTX
Data platform at Samsung (Big Learning)
ZhuanzhuanDing
 
PDF
Kaz Sato, Evangelist, Google at MLconf ATL 2016
MLconf
 
PPT
Introduction_to_DEEP_LEARNING ppt 101ppt
sathyanarayanakb1
 
PPT
Introduction_to_DEEP_LEARNING.sfsdafsadfsadfsdafsdppt
NaiduSetti
 
PPT
Introduction_to_DEEP_LEARNING.ppt
SwatiMahale4
 
PDF
Big Data Malaysia - A Primer on Deep Learning
Poo Kuan Hoong
 
PDF
Large Scale Deep Learning with TensorFlow
Jen Aman
 
PDF
Track2 02. machine intelligence at google scale google, kaz sato, staff devel...
양 한빛
 
PDF
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Tomasz Sikora
 
PPTX
Deep learning tutorial (i)
Guan Wang
 
PDF
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Chris Fregly
 
PPTX
Deep Learning and Recurrent Neural Networks in the Enterprise
Josh Patterson
 
PDF
An Introduction to Deep Learning
Poo Kuan Hoong
 
PDF
Introduction to Deep Learning (NVIDIA)
Rakuten Group, Inc.
 
PDF
Neural Networks, Spark MLlib, Deep Learning
Asim Jalis
 
PPTX
Deep learning tutorial 9/2019
Amr Rashed
 
PPTX
Deep Learning Tutorial
Amr Rashed
 
PPTX
Deep learning
SauravJaiswal17
 
DOCX
Title_ Deep Learning Explained_ What You Should Be Aware of in Data Science a...
nnibedita021
 
PDF
Deep Learning with Hadoop 1st Edition Dipayan Dev
sekashomolax
 
Data platform at Samsung (Big Learning)
ZhuanzhuanDing
 
Kaz Sato, Evangelist, Google at MLconf ATL 2016
MLconf
 
Introduction_to_DEEP_LEARNING ppt 101ppt
sathyanarayanakb1
 
Introduction_to_DEEP_LEARNING.sfsdafsadfsadfsdafsdppt
NaiduSetti
 
Introduction_to_DEEP_LEARNING.ppt
SwatiMahale4
 
Big Data Malaysia - A Primer on Deep Learning
Poo Kuan Hoong
 
Large Scale Deep Learning with TensorFlow
Jen Aman
 
Track2 02. machine intelligence at google scale google, kaz sato, staff devel...
양 한빛
 
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Tomasz Sikora
 
Deep learning tutorial (i)
Guan Wang
 
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Chris Fregly
 
Deep Learning and Recurrent Neural Networks in the Enterprise
Josh Patterson
 
An Introduction to Deep Learning
Poo Kuan Hoong
 
Introduction to Deep Learning (NVIDIA)
Rakuten Group, Inc.
 
Neural Networks, Spark MLlib, Deep Learning
Asim Jalis
 
Deep learning tutorial 9/2019
Amr Rashed
 
Deep Learning Tutorial
Amr Rashed
 
Deep learning
SauravJaiswal17
 
Title_ Deep Learning Explained_ What You Should Be Aware of in Data Science a...
nnibedita021
 
Deep Learning with Hadoop 1st Edition Dipayan Dev
sekashomolax
 
Ad

Building distributed deep learning engine

  • 1. Building Distributed Deep Learning Engine Guangdeng Liao, Zhan Zhang and Murtaza Zafer SRA-SV | Cloud Research Lab Slide 1
  • 2. What is Deep Learning Deep learning is a set of algorithms that attempt to model high-level abstractions in data by using architectures composed of multiple non-linear transformations Learn hidden features Learning state emission prob. Learning word vectors SRA-SV | Cloud Research Lab Slide 2
  • 3. Thanks Big Data, Deep Learning is not only research Usage Scenario: Speech Recognition, Image processing and NLP SRA-SV | Cloud Research Lab Slide 3
  • 4. Why Samsung needs Deep Learning? To make our devices smarter and more intelligent by recognizing voice, image and even language SRA-SV | Cloud Research Lab Slide 4
  • 5. How does Deep Learning look like? Many more examples (millions to billions parameters ) in Speech Recognition, Image Processing and NLP Krizhevsky, A., Sutskever, I. and Hinton, G. E. ImageNet Classification with Deep Convolutional Neural Networks SRA-SV | Cloud Research Lab Slide 5
  • 6. Deep Learning is challenging.. BIG DATA + BIG MODEL Building a distributed deep learning platform for Quite new, no mature platform yet Samsung R&D Hard to design and develop DL algorithms SRA-SV | Cloud Research Lab Slide 6
  • 7. Distributed Deep Learning Platform we are building Object recognition App. Speech recognition …. RBM FF DA CNN …. Model-parallel engine Algorithms Infrastructure I/O …. Parameter server Execution engine Math SRA-SV | Cloud Research Lab Slide 7 Our focus
  • 8. Now, Let’s dive deeper and more technically… SRA-SV | Cloud Research Lab Slide 8
  • 9. Model-Parallel Engine (MPE) Parallelize a big ML model over Hadoop YARN cluster Auto-deployment of topology (in-memory) SRA-SV | Cloud Research Lab Slide 9 User defined model Auto-generation of model topology Auto-partition of topology over cluster c1 c2 c3 Neuron-like programming Message-based communication Message-driven computation -Define nodes -Define groups -Define connections
  • 10. MPE’s Architecture Container Node Manager Data Communication: • node-level • group-level Container Node manager Control comm. based on Thrift Data comm. based on Netty Application Master Controller Partition and deploy topology Container Node manager SRA-SV | Cloud Research Lab Slide 10
  • 11. How to partition big models Vertical Partition Horizontal Partition SRA-SV | Cloud Research Lab Slide 11
  • 12. Execution Engine (Layer-by-Layer Training) Can stack different layers and training algorithms HDFS/LFS HDFS/LFS SRA-SV | Cloud Research Lab Slide 12
  • 13. Model-parallel itself is not scalable enough SRA-SV | Cloud Research Lab Slide 13
  • 14. Deep Learning Infra.: Hybrid of Data-parallelism and Model-parallelism …….. Data Chunk Data Chunk …….. Model-parallel Model-parallel Parameter Server 1 Parameters coordination Parameter Server n …….. Data-parallelism Lots of model instances Parameter servers help models learn each other SRA-SV | Cloud Research Lab Slide 14
  • 15. Distributed Parameter Servers Currently we support asynchronous stochastic gradient descent with AdaGrad Client Client Client Server 1 Server 2 Server 3 In-memory cache/storage HBase/HDFS In-memory cache/storage Asyn. communication In-memory cache/storage SRA-SV | Cloud Research Lab Slide 15 Pull/Push
  • 16. Deep Learning Algorithms Deep Learning Algorithms Feed-forward Neural Network Restricted Boltzmann Machine Denoise Auto-encoder Deep Belief Network More importantly, we can stack them layer by layer SRA-SV | Cloud Research Lab Slide 16
  • 17. More Challenging Algorithm: Convolutional Neural Network Different convolutional, normalization and pooling layers Weight shared and non-shared feature maps Feature map is minimum partition unit SRA-SV | Cloud Research Lab Slide 17 第17 页 Layer: Multi-dimensional feature map neurons Output: Dense layer feed-forward neurons Input: e.g. image, spectral map of voice data Layer: Multi-dimensional feature map neurons
  • 18. Sharing some early experiences/lessons Infrastructure Computation abstraction might be too low level (a lot of pros and cons) A generic deep learning platform is very challenging (like recurrent NN) Communication is important Methods of partitioning models are important High performance mathematical library is useful SRA-SV | Cloud Research Lab Slide 18
  • 19. Sharing some early experiences Algorithm/Models Models for ASR are relatively small Models for image are much larger Models for NLP are typical small DA seems more efficient than RBM for image Accelerated SGD or Hessian free optimizations need to be explored SRA-SV | Cloud Research Lab Slide 19
  • 20. Usage cases of Deep Learning SRA-SV | Cloud Research Lab Slide 20
  • 21. Image Recognition Object models Object parts SRA-SV | Cloud Research Lab Slide 21 Edges Pixels Image pixels Hand-designed Feature Extraction (SIFT, HOG etc.) Trainable Classifier Object Category Featured Learner (Convolutional NN is popular) Learned high level features Data augmentation Central and corner crops Original Image
  • 22. Speech Recognition • DNN is used to replace GMM to learn state output probability in HMM. • FF and DBN have been used for ASR • CNN starts being used to further improve WER • Rectified Linear Activation seems better than SRA-SV | Cloud Research Lab Slide 22 Sigmoid • Models are relatively small (e.g. 5 layers, 2560 neurons/hidden layer) Li Deng, A Tutorial Survey of Architectures, Algorithms, and Applications for Deep Learning
  • 23. NLP Deep Learning in NLP is quite new Learn word vector Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean, Efficient Estimation of Word Representations in Vector Space SRA-SV | Cloud Research Lab Slide 23
  • 24. NLP Based on word vector, map sentences to vector space now Sentiment Analysis Richard Socher Jeffrey Pennington Eric H. Huang Andrew Y. Ng Christopher D. Manning, Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions SRA-SV | Cloud Research Lab Slide 24
  • 25. Q&A

Editor's Notes

  • #17: Plan to validate Stanford’s alg. For ASR??