SlideShare a Scribd company logo
OCR PROCESSING WITH
DEEP LEARNING: APPLY TO
VIETNAMESE DOCUMENTS
VIET-TRUNG TRAN, ANH PHI NGUYEN, KHUYEN NGUYEN
OUTLINE
• OCR overview
• History
• Pipelining
• Deep learning for OCR
• Motivation
• Connectionist temporal classification (CTC) network
• LSTM + CTC for sequence recognition
WHAT IS OCR
• Optical character recognition (optical character reader) (OCR) is the
mechanical or electronic conversion of images of typed, handwritten or
printed text into machine-encoded text
OCR TYPES
• Optical Character Recognition (OCR)
• Targets typewritten text, one character at a time
• Optical Word Recognition (OWR)
• Typewritten text, one word at a time
• Intelligent Character Recognition (ICR)
• Handwritten print script, one character at a time
• Intelligent Word Recognition (IWR)
• Handwritten, one word at a time
HISTORY OF OCR: TESSERACT OCR ENGINE
TIMELINE
TESSERACT SYSTEM ARCHITECTURE
ARCHITECTURE [CONT’D]
ADAPTIVE THRESHOLDING
PAGE LAYOUT ANALYSIS
Smith, Ray. "Hybrid page layout analysis via tab-stop
detection." Document Analysis and Recognition, 2009. ICDAR'09. 10th
International Conference on. IEEE, 2009.
IMAGE LEVEL PAGE LAYOUT ANALYSIS
• Using the morphological processing from Leptonica
• http://www.slideshare.net/versae/javier-de-larosacs9883-5912825
CONNECTED COMPONENT ANALYSIS
COLUMN FINDING
BLOCK FINDING
TESSERACT WORD RECOGNIZER
http://www.slideshare.net/temsolin/2-architecture-anddatastructures
FEATURES AND WORD CLASSIFIER
Classical character classification
SEGMENTATION GRAPH
CHAR SEGMENTATION, LANGUAGE MODEL AND
BEAM SEARCH
OCR CHALLENGES
1. Fonts specifics
Never overcome their ability to understand a limited numbers of fonts and page
formats
2. Character bounding boxes
3. Extracting features unreliable
4. Slow performance
TESSERACT TUTORIAL @ DAS 2014
RECENT IMPROVEMENTS
1. Multilanguages
2. Full layout analysis
3. Table detection
4. Equation detection
5. Better language models
6. Hand-written text
LSTM FOR TEXT RECOGNITION
MOTIVATION
• Segmentation is difficult for cursive or unconstrained text
• R. Smith, “History of the Tesseract OCR engine: what worked and
what didn’t ,” in DRR XX, San Francisco, USA, Feb. 2013.
• there was not a single method proposed for OCR, that can achieve
very low error rates without using aforementioned sophisticated
post-processing techniques.
RESEARCH BREAKTHROUGH
A. Graves, M. Liwicki, S. Fernandez, Bertolami, H. Bunke, and J.
Schmidhuber, “A Novel Connectionist System for Unconstrained
Handwriting Recognition,” IEEE Trans. on Pattern Analysis and Machine
Intelligence, vol. 31, no. 5, pp. 855–868, May 2008.
TEXT LINE NORMALIZATION
TEXT LINE RECOGNITION
MOTIVATION
• Real-world sequence learning task
• OCR (Optical character recognition)
• ASR (Automatic speech recognition)
• Requires
• prediction of sequences of labels from noisy, unsegmented input data
• Recurrent neural networks (RNN) can be used for sequence learning, but
ask for
• pre-segmented training data
• post-processing to transform outputs into label sequences
CONNECTIONIST TEMPORAL CLASSIFICATION
(CTC)
• Graves, Alex, et al. "Connectionist temporal classification: labelling
unsegmented sequence data with recurrent neural
networks." Proceedings of the 23rd international conference on Machine
learning. ACM, 2006.
• WHAT CTC IS ALL ABOUT?
•a novel method for training RNNs to label
unsegmented sequences directly
THE SPEECH RECOGNITION PROBLEM
OCR processing with deep learning: Apply to Vietnamese documents
OCR processing with deep learning: Apply to Vietnamese documents
OCR processing with deep learning: Apply to Vietnamese documents
OCR processing with deep learning: Apply to Vietnamese documents
OCR processing with deep learning: Apply to Vietnamese documents
DYNAMIC TIME WRAPERING
• Because the length of y might differ from (often longer than) l, so the
inference of l from y is actually a dynamic time warping problem.
OCR processing with deep learning: Apply to Vietnamese documents
OCR processing with deep learning: Apply to Vietnamese documents
CONNECTIONIST TEMPORAL CLASSIFICATION
• o transform the network outputs into a conditional probability
distribution over label sequences
• A CTC network has a softmax output layer with one more unit than there
are labels in L
• activations of the first |L| units are interpreted as the probabilities of observing the
corresponding labels at particular times
• activation of the extra unit is the probability of observing a ‘blank’, or no label
PREFIX SEARCH DECODING ON THE LABEL
ALPHABET X,Y
OCR processing with deep learning: Apply to Vietnamese documents
LONG SHORT-TERM MEMORY (LSTM)
• One type of RNN networks
• RNN vanishing gradient problem
• influence of a given input on the hidden layer, and therefore on the network output,
either decays or blows up exponentially as it cycles around the network’s recurrent
connections
• LSTM is designed to address vanishing gradient problem
• An LSTM hidden layer consists of recurrently connected subnets, called
memory blocks
• Each block contains a set of internal units, or cells, whose activation is
controlled by three multiplicative gates: the input gate, forget gate and
output gate
LSTM MEMORY BLOCK
FORGET GATE
INPUT GATE
OUTPUT GATE
LSTM -> CTC OUTPUT LAYER: OCR
DEMO TIME: OCR FOR VIETNAMESE DOCUMENTS
Thank you!
REFERENCES - CREDITS
• https://github.com/yiwangbaidu/notes/blob/master/CTC/CTC.pdf
• http://colah.github.io/posts/2015-08-Understanding-LSTMs/
• Ray Smith. Everything you always wanted to know about
Tesseract. Tesseract tutorial @ DAS 2014

More Related Content

What's hot (20)

PDF
Spring Framework - AOP
Dzmitry Naskou
 
PPTX
TypeScript Overview
Aniruddha Chakrabarti
 
PDF
Big Query Basics
Ido Green
 
PPTX
Functional programming with Java 8
LivePerson
 
PDF
JavaScript Interview Questions and Answers | Full Stack Web Development Train...
Edureka!
 
PPTX
Token Authentication in ASP.NET Core
Stormpath
 
PPTX
RedisConf18 - Techniques for Synchronizing In-Memory Caches with Redis
Redis Labs
 
PDF
Mastering the MongoDB Shell
MongoDB
 
PPTX
C# Async Await
Simplilearn
 
PDF
Solid NodeJS with TypeScript, Jest & NestJS
Rafael Casuso Romate
 
PDF
Web Services (SOAP, WSDL, UDDI)
Peter R. Egli
 
DOC
Typescript Basics
Manikandan [M M K]
 
PPTX
Introduction to Spring Boot
Purbarun Chakrabarti
 
PPTX
Angular
sridhiya
 
PPTX
Jenkins for java world
Ashok Kumar
 
PDF
FixMatch:simplifying semi supervised learning with consistency and confidence
LEE HOSEONG
 
PDF
Semantic Versioning
Drupalize.Me
 
PDF
Nodejs presentation
Arvind Devaraj
 
PDF
Data Modeling for MongoDB
MongoDB
 
Spring Framework - AOP
Dzmitry Naskou
 
TypeScript Overview
Aniruddha Chakrabarti
 
Big Query Basics
Ido Green
 
Functional programming with Java 8
LivePerson
 
JavaScript Interview Questions and Answers | Full Stack Web Development Train...
Edureka!
 
Token Authentication in ASP.NET Core
Stormpath
 
RedisConf18 - Techniques for Synchronizing In-Memory Caches with Redis
Redis Labs
 
Mastering the MongoDB Shell
MongoDB
 
C# Async Await
Simplilearn
 
Solid NodeJS with TypeScript, Jest & NestJS
Rafael Casuso Romate
 
Web Services (SOAP, WSDL, UDDI)
Peter R. Egli
 
Typescript Basics
Manikandan [M M K]
 
Introduction to Spring Boot
Purbarun Chakrabarti
 
Angular
sridhiya
 
Jenkins for java world
Ashok Kumar
 
FixMatch:simplifying semi supervised learning with consistency and confidence
LEE HOSEONG
 
Semantic Versioning
Drupalize.Me
 
Nodejs presentation
Arvind Devaraj
 
Data Modeling for MongoDB
MongoDB
 

Similar to OCR processing with deep learning: Apply to Vietnamese documents (20)

PPTX
Intelligent Handwriting Recognition_MIL_presentation_v3_final
Suhas Pillai
 
PDF
Evgen Terpil "OCR in the Wild World of Social Media"
Fwdays
 
PDF
Optical Character Recognition Using Python
YogeshIJTSRD
 
PDF
Real Time Sign Language Recognition Using Deep Learning
IRJET Journal
 
PDF
IRJET- Image to Text Conversion using Tesseract
IRJET Journal
 
PDF
IRJET- Automatic Data Collection from Forms using Optical Character Recognition
IRJET Journal
 
PDF
Handwritten Text Recognition and Translation with Audio
IRJET Journal
 
PDF
Deep Learning in Text Recognition and Text Detection : A Review
IRJET Journal
 
PPTX
OCR Presentation hjhPresentation 23.pptx
SupriyaGhosh51
 
PDF
IRJET- Offline Transcription using AI
IRJET Journal
 
PPTX
sahuPPT.pptx
KartikDutta10
 
PDF
IRJET- Optical Character Recognition using Neural Networks by Classification ...
IRJET Journal
 
PDF
DIGEST PODCAST
IRJET Journal
 
PDF
Text Recognition using Convolutional Neural Network: A Review
IRJET Journal
 
PPTX
ICFHR'18-DataAug.pptx
KartikDutta10
 
PDF
Text and Object Recognition using Deep Learning for Visually Impaired People
ijtsrd
 
PDF
Optical Recognition of Handwritten Text
IRJET Journal
 
PPTX
TOPIC__MODELING_IN_NLP__& __EasyOCR.pptx
ebraheem943946
 
PDF
IRJET- Wearable AI Device for Blind
IRJET Journal
 
PDF
Handwritten Text Recognition and Digital Text Conversion
ijtsrd
 
Intelligent Handwriting Recognition_MIL_presentation_v3_final
Suhas Pillai
 
Evgen Terpil "OCR in the Wild World of Social Media"
Fwdays
 
Optical Character Recognition Using Python
YogeshIJTSRD
 
Real Time Sign Language Recognition Using Deep Learning
IRJET Journal
 
IRJET- Image to Text Conversion using Tesseract
IRJET Journal
 
IRJET- Automatic Data Collection from Forms using Optical Character Recognition
IRJET Journal
 
Handwritten Text Recognition and Translation with Audio
IRJET Journal
 
Deep Learning in Text Recognition and Text Detection : A Review
IRJET Journal
 
OCR Presentation hjhPresentation 23.pptx
SupriyaGhosh51
 
IRJET- Offline Transcription using AI
IRJET Journal
 
sahuPPT.pptx
KartikDutta10
 
IRJET- Optical Character Recognition using Neural Networks by Classification ...
IRJET Journal
 
DIGEST PODCAST
IRJET Journal
 
Text Recognition using Convolutional Neural Network: A Review
IRJET Journal
 
ICFHR'18-DataAug.pptx
KartikDutta10
 
Text and Object Recognition using Deep Learning for Visually Impaired People
ijtsrd
 
Optical Recognition of Handwritten Text
IRJET Journal
 
TOPIC__MODELING_IN_NLP__& __EasyOCR.pptx
ebraheem943946
 
IRJET- Wearable AI Device for Blind
IRJET Journal
 
Handwritten Text Recognition and Digital Text Conversion
ijtsrd
 
Ad

More from Viet-Trung TRAN (20)

PDF
Bắt đầu tìm hiểu về dữ liệu lớn như thế nào - 2017
Viet-Trung TRAN
 
PDF
Dynamo: Amazon’s Highly Available Key-value Store
Viet-Trung TRAN
 
PDF
Pregel: Hệ thống xử lý đồ thị lớn
Viet-Trung TRAN
 
PDF
Mapreduce simplified-data-processing
Viet-Trung TRAN
 
PDF
Tìm kiếm needle trong Haystack: Hệ thống lưu trữ ảnh của Facebook
Viet-Trung TRAN
 
PPTX
giasan.vn real-estate analytics: a Vietnam case study
Viet-Trung TRAN
 
PDF
Giasan.vn @rstars
Viet-Trung TRAN
 
PDF
A Vietnamese Language Model Based on Recurrent Neural Network
Viet-Trung TRAN
 
PDF
A Vietnamese Language Model Based on Recurrent Neural Network
Viet-Trung TRAN
 
PPTX
Large-Scale Geographically Weighted Regression on Spark
Viet-Trung TRAN
 
PDF
Recent progress on distributing deep learning
Viet-Trung TRAN
 
PDF
success factors for project proposals
Viet-Trung TRAN
 
PDF
GPSinsights poster
Viet-Trung TRAN
 
PDF
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Viet-Trung TRAN
 
PDF
Deep learning for nlp
Viet-Trung TRAN
 
PDF
Introduction to BigData @TCTK2015
Viet-Trung TRAN
 
PDF
From neural networks to deep learning
Viet-Trung TRAN
 
PDF
From decision trees to random forests
Viet-Trung TRAN
 
PPTX
Recommender systems: Content-based and collaborative filtering
Viet-Trung TRAN
 
PPTX
3 - Finding similar items
Viet-Trung TRAN
 
Bắt đầu tìm hiểu về dữ liệu lớn như thế nào - 2017
Viet-Trung TRAN
 
Dynamo: Amazon’s Highly Available Key-value Store
Viet-Trung TRAN
 
Pregel: Hệ thống xử lý đồ thị lớn
Viet-Trung TRAN
 
Mapreduce simplified-data-processing
Viet-Trung TRAN
 
Tìm kiếm needle trong Haystack: Hệ thống lưu trữ ảnh của Facebook
Viet-Trung TRAN
 
giasan.vn real-estate analytics: a Vietnam case study
Viet-Trung TRAN
 
Giasan.vn @rstars
Viet-Trung TRAN
 
A Vietnamese Language Model Based on Recurrent Neural Network
Viet-Trung TRAN
 
A Vietnamese Language Model Based on Recurrent Neural Network
Viet-Trung TRAN
 
Large-Scale Geographically Weighted Regression on Spark
Viet-Trung TRAN
 
Recent progress on distributing deep learning
Viet-Trung TRAN
 
success factors for project proposals
Viet-Trung TRAN
 
GPSinsights poster
Viet-Trung TRAN
 
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Viet-Trung TRAN
 
Deep learning for nlp
Viet-Trung TRAN
 
Introduction to BigData @TCTK2015
Viet-Trung TRAN
 
From neural networks to deep learning
Viet-Trung TRAN
 
From decision trees to random forests
Viet-Trung TRAN
 
Recommender systems: Content-based and collaborative filtering
Viet-Trung TRAN
 
3 - Finding similar items
Viet-Trung TRAN
 
Ad

Recently uploaded (20)

PDF
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
PDF
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PDF
Python basic programing language for automation
DanialHabibi2
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
PDF
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PDF
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
July Patch Tuesday
Ivanti
 
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
Python basic programing language for automation
DanialHabibi2
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
July Patch Tuesday
Ivanti
 

OCR processing with deep learning: Apply to Vietnamese documents

  • 1. OCR PROCESSING WITH DEEP LEARNING: APPLY TO VIETNAMESE DOCUMENTS VIET-TRUNG TRAN, ANH PHI NGUYEN, KHUYEN NGUYEN
  • 2. OUTLINE • OCR overview • History • Pipelining • Deep learning for OCR • Motivation • Connectionist temporal classification (CTC) network • LSTM + CTC for sequence recognition
  • 3. WHAT IS OCR • Optical character recognition (optical character reader) (OCR) is the mechanical or electronic conversion of images of typed, handwritten or printed text into machine-encoded text
  • 4. OCR TYPES • Optical Character Recognition (OCR) • Targets typewritten text, one character at a time • Optical Word Recognition (OWR) • Typewritten text, one word at a time • Intelligent Character Recognition (ICR) • Handwritten print script, one character at a time • Intelligent Word Recognition (IWR) • Handwritten, one word at a time
  • 5. HISTORY OF OCR: TESSERACT OCR ENGINE TIMELINE
  • 9. PAGE LAYOUT ANALYSIS Smith, Ray. "Hybrid page layout analysis via tab-stop detection." Document Analysis and Recognition, 2009. ICDAR'09. 10th International Conference on. IEEE, 2009.
  • 10. IMAGE LEVEL PAGE LAYOUT ANALYSIS • Using the morphological processing from Leptonica • http://www.slideshare.net/versae/javier-de-larosacs9883-5912825
  • 15. FEATURES AND WORD CLASSIFIER Classical character classification
  • 17. CHAR SEGMENTATION, LANGUAGE MODEL AND BEAM SEARCH
  • 18. OCR CHALLENGES 1. Fonts specifics Never overcome their ability to understand a limited numbers of fonts and page formats 2. Character bounding boxes 3. Extracting features unreliable 4. Slow performance
  • 20. RECENT IMPROVEMENTS 1. Multilanguages 2. Full layout analysis 3. Table detection 4. Equation detection 5. Better language models 6. Hand-written text
  • 21. LSTM FOR TEXT RECOGNITION
  • 22. MOTIVATION • Segmentation is difficult for cursive or unconstrained text • R. Smith, “History of the Tesseract OCR engine: what worked and what didn’t ,” in DRR XX, San Francisco, USA, Feb. 2013. • there was not a single method proposed for OCR, that can achieve very low error rates without using aforementioned sophisticated post-processing techniques.
  • 23. RESEARCH BREAKTHROUGH A. Graves, M. Liwicki, S. Fernandez, Bertolami, H. Bunke, and J. Schmidhuber, “A Novel Connectionist System for Unconstrained Handwriting Recognition,” IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 31, no. 5, pp. 855–868, May 2008.
  • 26. MOTIVATION • Real-world sequence learning task • OCR (Optical character recognition) • ASR (Automatic speech recognition) • Requires • prediction of sequences of labels from noisy, unsegmented input data • Recurrent neural networks (RNN) can be used for sequence learning, but ask for • pre-segmented training data • post-processing to transform outputs into label sequences
  • 27. CONNECTIONIST TEMPORAL CLASSIFICATION (CTC) • Graves, Alex, et al. "Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks." Proceedings of the 23rd international conference on Machine learning. ACM, 2006. • WHAT CTC IS ALL ABOUT? •a novel method for training RNNs to label unsegmented sequences directly
  • 34. DYNAMIC TIME WRAPERING • Because the length of y might differ from (often longer than) l, so the inference of l from y is actually a dynamic time warping problem.
  • 37. CONNECTIONIST TEMPORAL CLASSIFICATION • o transform the network outputs into a conditional probability distribution over label sequences • A CTC network has a softmax output layer with one more unit than there are labels in L • activations of the first |L| units are interpreted as the probabilities of observing the corresponding labels at particular times • activation of the extra unit is the probability of observing a ‘blank’, or no label
  • 38. PREFIX SEARCH DECODING ON THE LABEL ALPHABET X,Y
  • 40. LONG SHORT-TERM MEMORY (LSTM) • One type of RNN networks • RNN vanishing gradient problem • influence of a given input on the hidden layer, and therefore on the network output, either decays or blows up exponentially as it cycles around the network’s recurrent connections • LSTM is designed to address vanishing gradient problem • An LSTM hidden layer consists of recurrently connected subnets, called memory blocks • Each block contains a set of internal units, or cells, whose activation is controlled by three multiplicative gates: the input gate, forget gate and output gate
  • 45. LSTM -> CTC OUTPUT LAYER: OCR
  • 46. DEMO TIME: OCR FOR VIETNAMESE DOCUMENTS Thank you!
  • 47. REFERENCES - CREDITS • https://github.com/yiwangbaidu/notes/blob/master/CTC/CTC.pdf • http://colah.github.io/posts/2015-08-Understanding-LSTMs/ • Ray Smith. Everything you always wanted to know about Tesseract. Tesseract tutorial @ DAS 2014

Editor's Notes

  • #8: http://www.slideshare.net/100002968637682/seminar-p2
  • #9: http://www.slideshare.net/DocuFi/improve-ocr-accuracy-with-cleanup?qid=a439a1d4-ad96-4fb3-b376-9b301e58674b&v=default&b=&from_search=13
  • #18: http://www.slideshare.net/temsolin/6-char-segmentation