Automated Cheque Processing of
Indian Bank Cheques
Story of Cheque Transactions
Bengal Bank,
established in 1784,
was the first bank to
introduce cheque
system in India.
Majority the
cheques processed
are hand written
In 2017, 405 million
cheques were used
for payments and to
acquire cash across
the UK
The Problem Begins…
Manual evaluation
Human errors
Workforce
Expensive
Proposed Solution
Automation
Recognition of handwritten
information (digits)
Reduce manual effort
Reduce delays
Validations
What More?
Webservice Integration
Digitization for auditing
Faster operations
Parallel Processing
How is it done?
Automated Handwritten Digit
Recognition
Fragmentation and
Identification
Neural Networks
Validation
What did we use?
 MNIST Dataset
 Python
 Image processing libraries (PIL)
 Computer Vision libraries (OpenCV)
 Keras with Tensorflow Backend
 CNN
Preprocessing
 Extract Amount ROI
 Identifying ROI using
pattern Matching
 Convert to greyscale (invert)
and intensify
 Extract Date ROI
 Segment Date ROI and Amount ROI, to get
individual digit images
 Resize the digit images to 28 x 28
Algorithms
 Random Forest
 Pros:
 Used for uneven dataset
and missing values
 Cons:
 Tendency to Overfit
 Neural Networks
 Pros:
 Fault tolerance, parallel
processing
 Cons:
 Blackbox solution
Architecture
Scanned
Images
Image
Processor
ROI Extractor
Digit
Segmenter
Digit Identifier
using Model
Combiner
Business
Validator
Code Concepts
 Image Processing
 Resize
 Convert to greyscale (Invert)
 Intensify
 ROI Extraction
 Sectioning the area
 Pattern Matching
Code Concepts
 Digit Segmentation
 Find Contours
 Find bounding boxes and extract
 Pad digit with black background
 Digit Identification
 Load model and identify the digit
Code Concepts
 Combiner
 Get date string and convert to datetime
 Get Amount as string and convert to int
 Business Validation
 Read Validation CSV and Compare Values
 Print Results
amtPredicted dtPredicted fname CorrectAmtPrediction CorrectDtPrediction Lessthan90days validCheque humanValNeeded
567 29/11/18cheque_3_noisy.jpg TRUE TRUE TRUE TRUE 0
29 19/07/15
cheque_4_amountStarted0.jp
g TRUE TRUE FALSE FALSE 0
10000 11/02/19cheque_5_comma.jpg TRUE TRUE TRUE TRUE 0
1234 10/10/18cheque_2.jpg TRUE TRUE FALSE FALSE 0
127 22/05/17hdfc_joinedNumbers.jpg FALSE FALSE FALSE FALSE 1
890 01/12/18cheque_1.jpg TRUE TRUE TRUE TRUE 0
Code Concepts
 Model
 Image Data Generator
Results
 ROI Extraction accuracy
IOU / Jaccard Index
 MNIST accuracy
Model training
Overall accuracy 95-98%
Digit level misclassifications
Confusion Matrix
Classification Report
Production Plan
 Use more than one algorithm to predict
digits
 Provide as a web service
 Provide more configurable options for
generalization
 Use rule based predictions for Date
Challenges
 Joined Digits
 Model accuracy for Digits like 0 and 1
 Orientation of Image
 Image Quality and Noise
 Special symbols
 Decimal values
Automated cheque recognition

More Related Content

PDF
Explainable AI - making ML and DL models more interpretable
PDF
Iib v10 performance problem determination examples
PDF
Jonathan Ronen - Variational Autoencoders tutorial
PPT
Technical and Product Debt Management
PPSX
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
PDF
Financial Performance KPI Dashboard Showing Gross Profit Margin Sales Growth ...
PDF
A Comparison of Loss Function on Deep Embedding
PPTX
Forts and Fights Scaling Performance on Unreal Engine*
Explainable AI - making ML and DL models more interpretable
Iib v10 performance problem determination examples
Jonathan Ronen - Variational Autoencoders tutorial
Technical and Product Debt Management
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
Financial Performance KPI Dashboard Showing Gross Profit Margin Sales Growth ...
A Comparison of Loss Function on Deep Embedding
Forts and Fights Scaling Performance on Unreal Engine*

What's hot (15)

PDF
Fuzzy Matching on Apache Spark with Jennifer Shin
PDF
선형 최소 자승 최적화
PPTX
Digital Business - Accenture
PDF
Cloud Training Powerpoint Presentation Slides
PDF
Bindless Deferred Decals in The Surge 2
PPTX
[TGDF 2019] Mali GPU Architecture and Mobile Studio
PPT
It Service Management Implementation Overview
PDF
Procurement Purchase Orders And Cost Reduction Dashboard
PDF
Near Real-Time Netflix Recommendations Using Apache Spark Streaming with Nit...
PDF
The Technology of Uncharted: Drake’s Fortune
PPTX
Domain storytelling-one size fit all process
PDF
Road Map - ITIL Implemetation
PPTX
eScience SHAP talk
PPTX
How to Deploy & Integrate Oracle EPM Cloud Profitability and Cost Management ...
PPTX
IT Service Management Overview
Fuzzy Matching on Apache Spark with Jennifer Shin
선형 최소 자승 최적화
Digital Business - Accenture
Cloud Training Powerpoint Presentation Slides
Bindless Deferred Decals in The Surge 2
[TGDF 2019] Mali GPU Architecture and Mobile Studio
It Service Management Implementation Overview
Procurement Purchase Orders And Cost Reduction Dashboard
Near Real-Time Netflix Recommendations Using Apache Spark Streaming with Nit...
The Technology of Uncharted: Drake’s Fortune
Domain storytelling-one size fit all process
Road Map - ITIL Implemetation
eScience SHAP talk
How to Deploy & Integrate Oracle EPM Cloud Profitability and Cost Management ...
IT Service Management Overview
Ad

Similar to Automated cheque recognition (20)

PPTX
big-data-anallytics.pptx
PPT
Cdp Blog
PDF
Fraud prevention is better with TigerGraph inside
PDF
Review of Various Image Processing Techniques for Currency Note Authentication
PDF
IRJET - Fake Currency Detection using CNN
PPTX
Comparative Study of ML Techniques for RealTime Credit Card Fraud Detection S...
PDF
fundamentalsofeventdrivenmicroservices11728489736099.pdf
PDF
A Review of deep learning techniques in detection of anomaly incredit card tr...
PDF
Explainable AI
PDF
IRJET- Credit Card Fraud Detection using Machine Learning
PPTX
Credit card fraud detection using python machine learning
PDF
Understanding IDP: Data Validation and Feedback Loop
PDF
Credit card fraud detection through machine learning
PDF
A Comparative Study for Credit Card Fraud Detection System using Machine Lear...
PPTX
Untitled presentation on support system for Btech
PPT
50409621003 fingerprint recognition system-ppt
PDF
Automation Tool Development to Improve Machine Results using Data Analysis
PDF
Credit Card Fraud Detection Using Machine Learning & Data Science
PDF
Credit Card Fraud Detection Using Machine Learning & Data Science
PPTX
artificialintelligencedata driven analytics23.pptx
big-data-anallytics.pptx
Cdp Blog
Fraud prevention is better with TigerGraph inside
Review of Various Image Processing Techniques for Currency Note Authentication
IRJET - Fake Currency Detection using CNN
Comparative Study of ML Techniques for RealTime Credit Card Fraud Detection S...
fundamentalsofeventdrivenmicroservices11728489736099.pdf
A Review of deep learning techniques in detection of anomaly incredit card tr...
Explainable AI
IRJET- Credit Card Fraud Detection using Machine Learning
Credit card fraud detection using python machine learning
Understanding IDP: Data Validation and Feedback Loop
Credit card fraud detection through machine learning
A Comparative Study for Credit Card Fraud Detection System using Machine Lear...
Untitled presentation on support system for Btech
50409621003 fingerprint recognition system-ppt
Automation Tool Development to Improve Machine Results using Data Analysis
Credit Card Fraud Detection Using Machine Learning & Data Science
Credit Card Fraud Detection Using Machine Learning & Data Science
artificialintelligencedata driven analytics23.pptx
Ad

Recently uploaded (20)

PPT
Classification methods in data analytics.ppt
PDF
Buddhism presentation about world religion
PDF
General category merit rank list for neet pg
PPTX
lung disease detection using transfer learning approach.pptx
PPTX
Chapter security of computer_8_v8.1.pptx
PDF
9 FinOps Tools That Simplify Cloud Cost Reporting.pdf
PDF
Teal Blue Futuristic Metaverse Presentation.pdf
PPTX
1.Introduction to orthodonti hhhgghhcs.pptx
PPTX
inbound2857676998455010149.pptxmmmmmmmmm
PPTX
PPT for Diseases (1)-2, types of diseases.pptx
PPTX
Bussiness Plan S Group of college 2020-23 Final
PPTX
Reinforcement learning in artificial intelligence and deep learning
PPTX
ch20 Database System Architecture by Rizvee
PDF
Mcdonald's : a half century growth . pdf
PPTX
Sheep Seg. Marketing Plan_C2 2025 (1).pptx
PDF
Introduction to Database Systems Lec # 1
PPTX
research framework and review of related literature chapter 2
PDF
newhireacademy couselaunchedwith pri.pdf
PDF
PPT nikita containers of the company use
PPTX
DIGITAL DESIGN AND.pptx hhhhhhhhhhhhhhhhh
Classification methods in data analytics.ppt
Buddhism presentation about world religion
General category merit rank list for neet pg
lung disease detection using transfer learning approach.pptx
Chapter security of computer_8_v8.1.pptx
9 FinOps Tools That Simplify Cloud Cost Reporting.pdf
Teal Blue Futuristic Metaverse Presentation.pdf
1.Introduction to orthodonti hhhgghhcs.pptx
inbound2857676998455010149.pptxmmmmmmmmm
PPT for Diseases (1)-2, types of diseases.pptx
Bussiness Plan S Group of college 2020-23 Final
Reinforcement learning in artificial intelligence and deep learning
ch20 Database System Architecture by Rizvee
Mcdonald's : a half century growth . pdf
Sheep Seg. Marketing Plan_C2 2025 (1).pptx
Introduction to Database Systems Lec # 1
research framework and review of related literature chapter 2
newhireacademy couselaunchedwith pri.pdf
PPT nikita containers of the company use
DIGITAL DESIGN AND.pptx hhhhhhhhhhhhhhhhh

Automated cheque recognition

  • 1. Automated Cheque Processing of Indian Bank Cheques
  • 2. Story of Cheque Transactions Bengal Bank, established in 1784, was the first bank to introduce cheque system in India. Majority the cheques processed are hand written In 2017, 405 million cheques were used for payments and to acquire cash across the UK
  • 3. The Problem Begins… Manual evaluation Human errors Workforce Expensive
  • 4. Proposed Solution Automation Recognition of handwritten information (digits) Reduce manual effort Reduce delays Validations
  • 5. What More? Webservice Integration Digitization for auditing Faster operations Parallel Processing
  • 6. How is it done? Automated Handwritten Digit Recognition Fragmentation and Identification Neural Networks Validation
  • 7. What did we use?  MNIST Dataset  Python  Image processing libraries (PIL)  Computer Vision libraries (OpenCV)  Keras with Tensorflow Backend  CNN
  • 8. Preprocessing  Extract Amount ROI  Identifying ROI using pattern Matching  Convert to greyscale (invert) and intensify  Extract Date ROI  Segment Date ROI and Amount ROI, to get individual digit images  Resize the digit images to 28 x 28
  • 9. Algorithms  Random Forest  Pros:  Used for uneven dataset and missing values  Cons:  Tendency to Overfit  Neural Networks  Pros:  Fault tolerance, parallel processing  Cons:  Blackbox solution
  • 11. Code Concepts  Image Processing  Resize  Convert to greyscale (Invert)  Intensify  ROI Extraction  Sectioning the area  Pattern Matching
  • 12. Code Concepts  Digit Segmentation  Find Contours  Find bounding boxes and extract  Pad digit with black background  Digit Identification  Load model and identify the digit
  • 13. Code Concepts  Combiner  Get date string and convert to datetime  Get Amount as string and convert to int  Business Validation  Read Validation CSV and Compare Values  Print Results amtPredicted dtPredicted fname CorrectAmtPrediction CorrectDtPrediction Lessthan90days validCheque humanValNeeded 567 29/11/18cheque_3_noisy.jpg TRUE TRUE TRUE TRUE 0 29 19/07/15 cheque_4_amountStarted0.jp g TRUE TRUE FALSE FALSE 0 10000 11/02/19cheque_5_comma.jpg TRUE TRUE TRUE TRUE 0 1234 10/10/18cheque_2.jpg TRUE TRUE FALSE FALSE 0 127 22/05/17hdfc_joinedNumbers.jpg FALSE FALSE FALSE FALSE 1 890 01/12/18cheque_1.jpg TRUE TRUE TRUE TRUE 0
  • 14. Code Concepts  Model  Image Data Generator
  • 15. Results  ROI Extraction accuracy IOU / Jaccard Index  MNIST accuracy Model training Overall accuracy 95-98% Digit level misclassifications Confusion Matrix Classification Report
  • 16. Production Plan  Use more than one algorithm to predict digits  Provide as a web service  Provide more configurable options for generalization  Use rule based predictions for Date
  • 17. Challenges  Joined Digits  Model accuracy for Digits like 0 and 1  Orientation of Image  Image Quality and Noise  Special symbols  Decimal values