SlideShare a Scribd company logo
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 02, Volume 6 (February 2019) www.ijirae.com
_________________________________________________________________________________________________
IJIRAE: Impact Factor Value – Mendeley (Elsevier Indexed); Citefactor 1.9 (2017); SJIF: Innospace, Morocco
(2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 19, All Rights Reserved Page–55
OPTICAL CHARACTER RECOGNITION USING RBFNN
R.Seenu
Department of Computer Science and Engineering,
Annamalai University, Tamilnadu, India
Mca_seenu@hotmail.com;
Manuscript History
Number: IJIRAE/RS/Vol.06/Issue02/FBAE10087
Received: 26, January 2019
Final Correction: 07, February 2019
Final Accepted: 22, February 2019
Published: February 2019
Citation: Seenu (2019). OPTICAL CHARACTER RECOGNITION USING RBFNN. IJIRAE::International Journal of
Innovative Research in Advanced Engineering, Volume VI, 55-59. doi: 10.26562/IJIRAE.2019.FBAE10087
Editor: Dr.A.Arul L.S, Chief Editor, IJIRAE, AM Publications, India
Copyright: ©2019 This is an open access article distributed under the terms of the Creative Commons Attribution
License, Which Permits unrestricted use, distribution, and reproduction in any medium, provided the original author
and source are credited
Abstract— Optical character recognition (OCR) is process of classification of optical patterns contained in a
digital image. The process of OCR Recognition involves several steps including pre-processing, segmentation,
feature extraction, classification. Pre-processing is for done the basic operation on input image like noise
reduction which remove the noisy signal from image. Segmentation stage for segment the given image into line by
line and segment each character from segmented line. Future extraction calculates the characteristics of character.
A Radial Basis Function Neural Network (RBFNN) is used to classification contains the database and does the
comparison.
Keywords— Optical character recognition (OCR); Pre-processing; Segmentation; Feature Extraction; RBFNN;
I. INTRODUCTION
OCR has gained increasing attention in both academic research and in industry. OCR technology enables us to
convert different types of documents such as scanned paper documents, pdf files or images captured by a digital
camera into editable and searchable data. OCR systems have become one of the most successful applications of
technology in pattern recognition and artificial intelligence fields [1]. Though many commercial systems for
performing OCR exist for a wide variety of applications, the avail-able machines are still not able to compete with
human reading capabilities with desired accuracy levels. Conversion of handwritten characters is important for
making several important documents related to our history, such as manuscripts into machine editable form, so
that, it can be easily accessed and pres independent work is going on in Optical Character Recognition that is the
processing of printed/computer generated document, handwritten and manually created document processing i.e.
handwritten character recognition. External factors like pressure, speed of writing have any influence in case of
offline system but they have great impact on online system. Again, offline or online system canbe applied on
optical character Fig 1.
Fig 1. (a) Optical character (b) Handwritten character.
Character recognition is an art of detecting segmenting and identifying characters from image [2]. More precisely,
character recognition is a process of detecting and recognizing characters from input image and converts it into
American Standard Code for Information Interchange (ASCII) or other equivalent machine editable form [3].
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 02, Volume 6 (February 2019) www.ijirae.com
_________________________________________________________________________________________________
IJIRAE: Impact Factor Value – Mendeley (Elsevier Indexed); Citefactor 1.9 (2017); SJIF: Innospace, Morocco
(2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 19, All Rights Reserved Page–56
Character recognition is generally known as optical character recognition because it uses optical means to acquire
the characters. Optical character recognition has huge applications including conversion of any handwritten
document into structural text form, automatic number plate recognition, reading aid for blind, bank cheques etc.
Character is the basic building block of any language that is used to build different structure of a language [4].
Character recognition is a process which associates with a symbolic meaning with objects (letters, symbols &
numbers) drawn on an image. OCR contributes immensely to the achievement of automation process and
improves the interface between man and machine in numerous applications. OCR deals with the problem of
recognizing optically processed characters. Optical recognition is performed offline after the writing or printing
has been completed whereas the online recognition is achieved where computer recognizes the characters as they
are drawn. Both hand printed and printed characters may be recognized but the performance is directly
dependent upon the quality of input documents. The more constrained the input is, better is the performance of
OCR system. But when it comes to totally unconstrained hand-writing performance of OCR machines is still
questionable. The Fig. 2 shows the schematic representation of different areas of character recognition.
Fig. 2. The different areas of character recognition
II. IMPLEMENTATION
A typical OCR system consists of several components as shown in Fig. 3. The first step is to digitize analog
document using an optical scanner. When regions containing text are located each symbol is extracted through
segmentation process [6]. The extracted symbols are pre-processed, eliminating noise to facilitate feature
extraction. The identity of each symbol is found by comparing extracted features with descriptions of symbol
classes obtained through a previous learning phase. Finally contextual information is used to reconstruct words
and numbers of the original text.
Fig. 3 The components of an OCR system for Proposed System
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 02, Volume 6 (February 2019) www.ijirae.com
_________________________________________________________________________________________________
IJIRAE: Impact Factor Value – Mendeley (Elsevier Indexed); Citefactor 1.9 (2017); SJIF: Innospace, Morocco
(2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 19, All Rights Reserved Page–57
A. Input Scanned Written Text
The proposed OCR starts with image acquisition process that takes an input of a digital image by using a digital
camera or scanner. The first component in OCR is optical scanning. Through scanning process digital image of
original document is captured. In OCR optical scanners are used which consist of transport mechanism and
sensing device that converts light intensity into grey levels. Printed documents consist of black print on white
background. When performing OCR multilevel image is converted into bi-level black and white image.
B. Pre-processing
The raw data depending on the data acquisition type is subjected to a number of preliminary processing steps to
make it usable in the descriptive stages of character analysis. The image resulting from scanning process may
contain certain amount of noise. Depending on the scanner resolution and the inherent thresholding, the
characters may be smeared or broken. Pre-processing of the character image is much significant for maintain the
novelty of the image for classification. Here in the pre-processing phase we have performed filtering of image
using median filter. Subsequently we make normalize and performed skeleton of images and extraction of
boundary edge pixel points [7].
C. Segmentation
The pre-processing stage yields a clean character image in the sense that a sufficient amount of shape information,
high compression, and low noise on a normalized image is obtained. The next OCR component is segmentation.
Here the character image is segmented into its subcomponents. Segmentation is important because the extent one
can reach in separation of the various lines in the characters directly affects the recognition rate. Internal
segmentation is used here which isolates lines and curves in the cursively written characters.
D. Feature Extraction
The objective of feature extraction is to capture essential characteristics of symbols. Feature extraction is accepted
as one of the most difficult problems of pattern recognition. The most straight forward way of describing
character is by actual raster image. Another approach is to extract certain features that characterize symbols but
leaves the unimportant attributes. Feature extraction techniques are used to evaluate the uniqueness of each
character image by which they differs from the rest character images [8].
E. Classification
Classification is one the vital stage of the any recognition model. After evaluating the desired key feature values
we process these vector to classifier separately and noted down the overall recognition accuracy. Radial Basis
Function Neural Network (RBFNN) has been used as a classifier. There are three layers in RBFNN network namely:
an input layer, a hidden layer and an output layer. In this paper, a number of samples of two English characters
have been taken.
Fig. 4: RBFNN Architecture
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 02, Volume 6 (February 2019) www.ijirae.com
_________________________________________________________________________________________________
IJIRAE: Impact Factor Value – Mendeley (Elsevier Indexed); Citefactor 1.9 (2017); SJIF: Innospace, Morocco
(2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 19, All Rights Reserved Page–58
Therefore, there are only two outputs at output layer. The outputs of output layer represents whether the
character has been correctly recognized or not. The structure of multi-input and multi-output RBF neural network
is represented by Fig. 4.The parameters of an RBF type neural network consist of the centers spread the basic
functions at the hidden layer nodes and the synaptic weights of the output layer nodes. The RBF centers are also
points in the input space. It would be ideal to have them at each distinct point on the input space, but for any
realistic problem, only a few input points from all available points are selected using clustering [5].
III.RESULTS AND DISCUSSIONS
In feature extraction we have proposed an algorithm for evaluation of feature vector for our implementation of
recognition system we have placed the image into a 3×3 grid into image. All the operations were performed over
skeletonized image of handwritten characters. We have drawn the horizontal and vertical symmetric projection
angel to the nearest pixel of the image. From which we have calculated the horizontal and vertical Euclidean
distance for the same nearest pixel from centroid of each zone. Then we have calculated the mean Euclidean
distance as well as the mean angular values of the zones with respect to the midpoint of symmetry axes. This is
considered as the key feature values of our proposed system. The model generates two set of parameter for each
row symmetry axis and column symmetry axis. Where one parameter show mean value of Ecludian distance of
every symmetry axis pixel position to centroid of each zone and other parameter shows the angle between the
points of the symmetry axis to the centroid of each zone the respective image. Thereafter we obtained four
parameter of each image.
One category is appended for recognized and the other category is not recognized for the written character. By
using the feature values with appended value RBFNN training is carried out. For testing the feature extraction is
done on different used in the training set. The RBFN is trained by adaptively updating the free parameters, i.e.
center and width of the basis function, and the weight between the hidden and output neurons of the network. To
select an optimal RBFN model, the number of neurons in the hidden layer was varied from 2 to 30, and the
learning rate was varied between 0.05 and 0.5. The initial basis function centers were chosen randomly from the
input space, and the initial weight values were chosen randomly between ±0.9. Normalized datasets were used for
the training, testing, and validation of the RBFN model. The best network was found to be one having 26 basis
functions with a learning rate of 0.9 and 0.05 for center and weight respectively. The prediction errors of the
validation patterns are larger because these patterns are outside the training space. The Fig. 5 shows the
comparison of various means in RBFNN.
Fig: 5 Comparison graph for various means in RBFNN
Experimental result shows that this directional group values technique requires less training time. In this paper,
recognition accuracy, training time and classification time obtained by implementing RBFNN better accuracy with
requires less training time and classification time. Implementing directional group values with RBFNN yields
recognition accuracy around 88% accuracy achieved It is also analyzed during experiment that only 50 iterations
are sufficient for training the network using this technique.
International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163
Issue 02, Volume 6 (February 2019) www.ijirae.com
_________________________________________________________________________________________________
IJIRAE: Impact Factor Value – Mendeley (Elsevier Indexed); Citefactor 1.9 (2017); SJIF: Innospace, Morocco
(2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35
IJIRAE © 2014- 19, All Rights Reserved Page–59
IV. CONCLUSIONS
The system recognized the written character. It is currently the state of the art approach for categorization. In order
to the process of OCR Recognition involved several steps including pre-processing, segmentation, feature extraction,
classification is done the basic operation on input image like noise reduction which remove the noisy signal from
image. Segmentation stage for segment the given image into line by line and segment each character from
segmented line. Future extraction calculates the characteristics of character. A Radial Basis Function Neural
Network (RBFNN) is used to recognize rate of 88% accuracy.
REFERENCES
1. Bunke, H., Wang, P. S. P. (Editors), Handbook of Character Recognition and Document Image Analysis, World
Scientific, 1997.
2. Kai Ding, Zhibin Liu, LianwenJin, Xinghua Zhu, A Comparative study of GABOR feature and gradient feature for
handwritten 17hinese character recognition, International Conference on Wavelet Analysis and Pattern
Recognition, pp. 1182-1186, Beijing, China, 2-4 Nov. 2007.
3. Pranob K Charles, V.Harish, M.Swathi, CH. Deepthi, "A Review on the Various Techniques used for Optical
Character Recognition", International Journal of Engineering Research and Applications, Vol. 2, Issue 1, pp.
659-662, Jan-Feb 2012.
4. M. F. Kader and K. Deb, “Neural network based English alphanumeric character recognition”, International
Journal of Computer Science, Engineering and Applications (IJCSEA)Vol.2, No.4, August 2012
5. D.Tjondronegoro, Y.Chen, and B.Pham, “The power of play break for automatic detection and browsing of self
consumable sport video highlights”, In Proceedings of the ACM Workshop on Multimedia Information Retrieval,
pp. 267-274, 2004.
6. Saleem Pasha, M.C.Padma” Handwritten Kannada Character Recognition using Wavelet Transform and
Structural Features” International Conference on Emerging Research in Electronics, Computer Science and
Technology,pp 346- 351, IEEE 2015
7. Kalyan S Dash , N.B. Puhan and Ganapati Panda , “BESAC: Binary External Symmetry Axis Constellation for
unconstrained handwritten character recognition” Pattern Recognition Letters, June 25, 2016.
8. Krupa Dholakia “A Survey on Handwritten Character Recognition Techniques For Various Indian Language”
International Journal Of Computer Application (0975-8887) Volume 115-No. 1, April 2015

More Related Content

What's hot (20)

PDF
IRJET- Photo Optical Character Recognition Model
IRJET Journal
 
PDF
Optical Character Recognition from Text Image
Editor IJCATR
 
PPTX
Optical Character Recognition (OCR) based Retrieval
Biniam Asnake
 
PPTX
Optical Character Recognition
Durjoy Saha
 
PDF
Ocr 1
Manoj Nanduri
 
PDF
Optical Character Recognition (OCR) System
iosrjce
 
PPT
optical character recognition system
Vijay Apurva
 
PDF
Comparative Analysis of PSO and GA in Geom-Statistical Character Features Sel...
IJERA Editor
 
PDF
Ijartes v1-i2-001
IJARTES
 
PDF
Co4201605611
IJERA Editor
 
PDF
A Review of Optical Character Recognition System for Recognition of Printed Text
iosrjce
 
PDF
Character Recognition (Devanagari Script)
IJERA Editor
 
PDF
IRJET- Gesture Recognition for Indian Sign Language using HOG and SVM
IRJET Journal
 
PDF
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
CSCJournals
 
PDF
Design and Description of Feature Extraction Algorithm for Old English Font
IRJET Journal
 
PDF
OFFLINE SIGNATURE VERIFICATION SYSTEM FOR BANK CHEQUES USING ZERNIKE MOMENTS,...
ijaia
 
DOCX
Opticalcharacter recognition
Shobhit Saxena
 
PDF
ARABIC ONLINE HANDWRITING RECOGNITION USING NEURAL NETWORK
ijaia
 
PDF
IRJET- Cheque Bounce Detection System using Image Processing
IRJET Journal
 
PDF
Handwritten Text Recognition and Digital Text Conversion
ijtsrd
 
IRJET- Photo Optical Character Recognition Model
IRJET Journal
 
Optical Character Recognition from Text Image
Editor IJCATR
 
Optical Character Recognition (OCR) based Retrieval
Biniam Asnake
 
Optical Character Recognition
Durjoy Saha
 
Optical Character Recognition (OCR) System
iosrjce
 
optical character recognition system
Vijay Apurva
 
Comparative Analysis of PSO and GA in Geom-Statistical Character Features Sel...
IJERA Editor
 
Ijartes v1-i2-001
IJARTES
 
Co4201605611
IJERA Editor
 
A Review of Optical Character Recognition System for Recognition of Printed Text
iosrjce
 
Character Recognition (Devanagari Script)
IJERA Editor
 
IRJET- Gesture Recognition for Indian Sign Language using HOG and SVM
IRJET Journal
 
Optical Character Recognition System for Urdu (Naskh Font)Using Pattern Match...
CSCJournals
 
Design and Description of Feature Extraction Algorithm for Old English Font
IRJET Journal
 
OFFLINE SIGNATURE VERIFICATION SYSTEM FOR BANK CHEQUES USING ZERNIKE MOMENTS,...
ijaia
 
Opticalcharacter recognition
Shobhit Saxena
 
ARABIC ONLINE HANDWRITING RECOGNITION USING NEURAL NETWORK
ijaia
 
IRJET- Cheque Bounce Detection System using Image Processing
IRJET Journal
 
Handwritten Text Recognition and Digital Text Conversion
ijtsrd
 

Similar to OPTICAL CHARACTER RECOGNITION USING RBFNN (20)

DOCX
Optical character recognition IEEE Paper Study
Er. Ashish Pandey
 
PDF
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET Journal
 
PPTX
OCR Presentation hjhPresentation 23.pptx
SupriyaGhosh51
 
PDF
Number plate recognition using ocr technique
eSAT Journals
 
PDF
IRJET- Intelligent Character Recognition of Handwritten Characters
IRJET Journal
 
PDF
IRJET-Optical Character Recognition using ANN
IRJET Journal
 
PDF
Volume 2-issue-6-2009-2015
Editor IJARCET
 
PDF
Volume 2-issue-6-2009-2015
Editor IJARCET
 
PDF
Optical character recognition an encompassing review
eSAT Journals
 
PDF
D017222226
IOSR Journals
 
PDF
O45018291
IJERA Editor
 
PDF
Optical Character Recognition Using Python
YogeshIJTSRD
 
PDF
IRJET- Wearable AI Device for Blind
IRJET Journal
 
PDF
IRJET- Offline Transcription using AI
IRJET Journal
 
PDF
IRJET- Intelligent Character Recognition of Handwritten Characters using ...
IRJET Journal
 
PDF
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
ijcsitcejournal
 
PDF
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
IRJET Journal
 
PDF
IRJET- Information Retrieval & Text Analytics using Artificial Intelligence
IRJET Journal
 
PDF
LICENSE NUMBER PLATE RECOGNITION SYSTEM USING ANDROID APP
Aditya Mishra
 
PDF
Optical Character Recognition deep learning .pdf
navenjeyamurugan
 
Optical character recognition IEEE Paper Study
Er. Ashish Pandey
 
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET Journal
 
OCR Presentation hjhPresentation 23.pptx
SupriyaGhosh51
 
Number plate recognition using ocr technique
eSAT Journals
 
IRJET- Intelligent Character Recognition of Handwritten Characters
IRJET Journal
 
IRJET-Optical Character Recognition using ANN
IRJET Journal
 
Volume 2-issue-6-2009-2015
Editor IJARCET
 
Volume 2-issue-6-2009-2015
Editor IJARCET
 
Optical character recognition an encompassing review
eSAT Journals
 
D017222226
IOSR Journals
 
O45018291
IJERA Editor
 
Optical Character Recognition Using Python
YogeshIJTSRD
 
IRJET- Wearable AI Device for Blind
IRJET Journal
 
IRJET- Offline Transcription using AI
IRJET Journal
 
IRJET- Intelligent Character Recognition of Handwritten Characters using ...
IRJET Journal
 
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
ijcsitcejournal
 
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
IRJET Journal
 
IRJET- Information Retrieval & Text Analytics using Artificial Intelligence
IRJET Journal
 
LICENSE NUMBER PLATE RECOGNITION SYSTEM USING ANDROID APP
Aditya Mishra
 
Optical Character Recognition deep learning .pdf
navenjeyamurugan
 
Ad

More from AM Publications (20)

PDF
DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
AM Publications
 
PDF
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
AM Publications
 
PDF
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
AM Publications
 
PDF
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
AM Publications
 
PDF
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
AM Publications
 
PDF
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
AM Publications
 
PDF
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
AM Publications
 
PDF
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
AM Publications
 
PDF
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
AM Publications
 
PDF
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
AM Publications
 
PDF
INTELLIGENT BLIND STICK
AM Publications
 
PDF
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
AM Publications
 
PDF
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
AM Publications
 
PDF
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
AM Publications
 
PDF
DETECTION OF MOVING OBJECT
AM Publications
 
PDF
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
AM Publications
 
PDF
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
AM Publications
 
PDF
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
AM Publications
 
PDF
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
AM Publications
 
PDF
DATA MINING WITH CLUSTERING ON BIG DATA FOR SHOPPING MALL’S DATASET
AM Publications
 
DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
AM Publications
 
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
AM Publications
 
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
AM Publications
 
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
AM Publications
 
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
AM Publications
 
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
AM Publications
 
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
AM Publications
 
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
AM Publications
 
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
AM Publications
 
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
AM Publications
 
INTELLIGENT BLIND STICK
AM Publications
 
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
AM Publications
 
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
AM Publications
 
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
AM Publications
 
DETECTION OF MOVING OBJECT
AM Publications
 
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
AM Publications
 
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
AM Publications
 
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
AM Publications
 
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
AM Publications
 
DATA MINING WITH CLUSTERING ON BIG DATA FOR SHOPPING MALL’S DATASET
AM Publications
 
Ad

Recently uploaded (20)

DOC
MRRS Strength and Durability of Concrete
CivilMythili
 
PDF
AI TECHNIQUES FOR IDENTIFYING ALTERATIONS IN THE HUMAN GUT MICROBIOME IN MULT...
vidyalalltv1
 
PPTX
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
PDF
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
PPTX
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
PPTX
What is Shot Peening | Shot Peening is a Surface Treatment Process
Vibra Finish
 
DOCX
CS-802 (A) BDH Lab manual IPS Academy Indore
thegodhimself05
 
PPTX
Shinkawa Proposal to meet Vibration API670.pptx
AchmadBashori2
 
PPTX
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
PPTX
The Role of Information Technology in Environmental Protectio....pptx
nallamillisriram
 
PPTX
Hashing Introduction , hash functions and techniques
sailajam21
 
PPTX
Worm gear strength and wear calculation as per standard VB Bhandari Databook.
shahveer210504
 
PPTX
Mechanical Design of shell and tube heat exchangers as per ASME Sec VIII Divi...
shahveer210504
 
PPTX
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
PPT
Carmon_Remote Sensing GIS by Mahesh kumar
DhananjayM6
 
PPTX
原版一样(Acadia毕业证书)加拿大阿卡迪亚大学毕业证办理方法
Taqyea
 
PDF
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
PDF
GTU Civil Engineering All Semester Syllabus.pdf
Vimal Bhojani
 
PPTX
VITEEE 2026 Exam Details , Important Dates
SonaliSingh127098
 
PPTX
Introduction to Design of Machine Elements
PradeepKumarS27
 
MRRS Strength and Durability of Concrete
CivilMythili
 
AI TECHNIQUES FOR IDENTIFYING ALTERATIONS IN THE HUMAN GUT MICROBIOME IN MULT...
vidyalalltv1
 
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
What is Shot Peening | Shot Peening is a Surface Treatment Process
Vibra Finish
 
CS-802 (A) BDH Lab manual IPS Academy Indore
thegodhimself05
 
Shinkawa Proposal to meet Vibration API670.pptx
AchmadBashori2
 
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
The Role of Information Technology in Environmental Protectio....pptx
nallamillisriram
 
Hashing Introduction , hash functions and techniques
sailajam21
 
Worm gear strength and wear calculation as per standard VB Bhandari Databook.
shahveer210504
 
Mechanical Design of shell and tube heat exchangers as per ASME Sec VIII Divi...
shahveer210504
 
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
Carmon_Remote Sensing GIS by Mahesh kumar
DhananjayM6
 
原版一样(Acadia毕业证书)加拿大阿卡迪亚大学毕业证办理方法
Taqyea
 
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
GTU Civil Engineering All Semester Syllabus.pdf
Vimal Bhojani
 
VITEEE 2026 Exam Details , Important Dates
SonaliSingh127098
 
Introduction to Design of Machine Elements
PradeepKumarS27
 

OPTICAL CHARACTER RECOGNITION USING RBFNN

  • 1. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 02, Volume 6 (February 2019) www.ijirae.com _________________________________________________________________________________________________ IJIRAE: Impact Factor Value – Mendeley (Elsevier Indexed); Citefactor 1.9 (2017); SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 19, All Rights Reserved Page–55 OPTICAL CHARACTER RECOGNITION USING RBFNN R.Seenu Department of Computer Science and Engineering, Annamalai University, Tamilnadu, India [email protected]; Manuscript History Number: IJIRAE/RS/Vol.06/Issue02/FBAE10087 Received: 26, January 2019 Final Correction: 07, February 2019 Final Accepted: 22, February 2019 Published: February 2019 Citation: Seenu (2019). OPTICAL CHARACTER RECOGNITION USING RBFNN. IJIRAE::International Journal of Innovative Research in Advanced Engineering, Volume VI, 55-59. doi: 10.26562/IJIRAE.2019.FBAE10087 Editor: Dr.A.Arul L.S, Chief Editor, IJIRAE, AM Publications, India Copyright: ©2019 This is an open access article distributed under the terms of the Creative Commons Attribution License, Which Permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited Abstract— Optical character recognition (OCR) is process of classification of optical patterns contained in a digital image. The process of OCR Recognition involves several steps including pre-processing, segmentation, feature extraction, classification. Pre-processing is for done the basic operation on input image like noise reduction which remove the noisy signal from image. Segmentation stage for segment the given image into line by line and segment each character from segmented line. Future extraction calculates the characteristics of character. A Radial Basis Function Neural Network (RBFNN) is used to classification contains the database and does the comparison. Keywords— Optical character recognition (OCR); Pre-processing; Segmentation; Feature Extraction; RBFNN; I. INTRODUCTION OCR has gained increasing attention in both academic research and in industry. OCR technology enables us to convert different types of documents such as scanned paper documents, pdf files or images captured by a digital camera into editable and searchable data. OCR systems have become one of the most successful applications of technology in pattern recognition and artificial intelligence fields [1]. Though many commercial systems for performing OCR exist for a wide variety of applications, the avail-able machines are still not able to compete with human reading capabilities with desired accuracy levels. Conversion of handwritten characters is important for making several important documents related to our history, such as manuscripts into machine editable form, so that, it can be easily accessed and pres independent work is going on in Optical Character Recognition that is the processing of printed/computer generated document, handwritten and manually created document processing i.e. handwritten character recognition. External factors like pressure, speed of writing have any influence in case of offline system but they have great impact on online system. Again, offline or online system canbe applied on optical character Fig 1. Fig 1. (a) Optical character (b) Handwritten character. Character recognition is an art of detecting segmenting and identifying characters from image [2]. More precisely, character recognition is a process of detecting and recognizing characters from input image and converts it into American Standard Code for Information Interchange (ASCII) or other equivalent machine editable form [3].
  • 2. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 02, Volume 6 (February 2019) www.ijirae.com _________________________________________________________________________________________________ IJIRAE: Impact Factor Value – Mendeley (Elsevier Indexed); Citefactor 1.9 (2017); SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 19, All Rights Reserved Page–56 Character recognition is generally known as optical character recognition because it uses optical means to acquire the characters. Optical character recognition has huge applications including conversion of any handwritten document into structural text form, automatic number plate recognition, reading aid for blind, bank cheques etc. Character is the basic building block of any language that is used to build different structure of a language [4]. Character recognition is a process which associates with a symbolic meaning with objects (letters, symbols & numbers) drawn on an image. OCR contributes immensely to the achievement of automation process and improves the interface between man and machine in numerous applications. OCR deals with the problem of recognizing optically processed characters. Optical recognition is performed offline after the writing or printing has been completed whereas the online recognition is achieved where computer recognizes the characters as they are drawn. Both hand printed and printed characters may be recognized but the performance is directly dependent upon the quality of input documents. The more constrained the input is, better is the performance of OCR system. But when it comes to totally unconstrained hand-writing performance of OCR machines is still questionable. The Fig. 2 shows the schematic representation of different areas of character recognition. Fig. 2. The different areas of character recognition II. IMPLEMENTATION A typical OCR system consists of several components as shown in Fig. 3. The first step is to digitize analog document using an optical scanner. When regions containing text are located each symbol is extracted through segmentation process [6]. The extracted symbols are pre-processed, eliminating noise to facilitate feature extraction. The identity of each symbol is found by comparing extracted features with descriptions of symbol classes obtained through a previous learning phase. Finally contextual information is used to reconstruct words and numbers of the original text. Fig. 3 The components of an OCR system for Proposed System
  • 3. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 02, Volume 6 (February 2019) www.ijirae.com _________________________________________________________________________________________________ IJIRAE: Impact Factor Value – Mendeley (Elsevier Indexed); Citefactor 1.9 (2017); SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 19, All Rights Reserved Page–57 A. Input Scanned Written Text The proposed OCR starts with image acquisition process that takes an input of a digital image by using a digital camera or scanner. The first component in OCR is optical scanning. Through scanning process digital image of original document is captured. In OCR optical scanners are used which consist of transport mechanism and sensing device that converts light intensity into grey levels. Printed documents consist of black print on white background. When performing OCR multilevel image is converted into bi-level black and white image. B. Pre-processing The raw data depending on the data acquisition type is subjected to a number of preliminary processing steps to make it usable in the descriptive stages of character analysis. The image resulting from scanning process may contain certain amount of noise. Depending on the scanner resolution and the inherent thresholding, the characters may be smeared or broken. Pre-processing of the character image is much significant for maintain the novelty of the image for classification. Here in the pre-processing phase we have performed filtering of image using median filter. Subsequently we make normalize and performed skeleton of images and extraction of boundary edge pixel points [7]. C. Segmentation The pre-processing stage yields a clean character image in the sense that a sufficient amount of shape information, high compression, and low noise on a normalized image is obtained. The next OCR component is segmentation. Here the character image is segmented into its subcomponents. Segmentation is important because the extent one can reach in separation of the various lines in the characters directly affects the recognition rate. Internal segmentation is used here which isolates lines and curves in the cursively written characters. D. Feature Extraction The objective of feature extraction is to capture essential characteristics of symbols. Feature extraction is accepted as one of the most difficult problems of pattern recognition. The most straight forward way of describing character is by actual raster image. Another approach is to extract certain features that characterize symbols but leaves the unimportant attributes. Feature extraction techniques are used to evaluate the uniqueness of each character image by which they differs from the rest character images [8]. E. Classification Classification is one the vital stage of the any recognition model. After evaluating the desired key feature values we process these vector to classifier separately and noted down the overall recognition accuracy. Radial Basis Function Neural Network (RBFNN) has been used as a classifier. There are three layers in RBFNN network namely: an input layer, a hidden layer and an output layer. In this paper, a number of samples of two English characters have been taken. Fig. 4: RBFNN Architecture
  • 4. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 02, Volume 6 (February 2019) www.ijirae.com _________________________________________________________________________________________________ IJIRAE: Impact Factor Value – Mendeley (Elsevier Indexed); Citefactor 1.9 (2017); SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 19, All Rights Reserved Page–58 Therefore, there are only two outputs at output layer. The outputs of output layer represents whether the character has been correctly recognized or not. The structure of multi-input and multi-output RBF neural network is represented by Fig. 4.The parameters of an RBF type neural network consist of the centers spread the basic functions at the hidden layer nodes and the synaptic weights of the output layer nodes. The RBF centers are also points in the input space. It would be ideal to have them at each distinct point on the input space, but for any realistic problem, only a few input points from all available points are selected using clustering [5]. III.RESULTS AND DISCUSSIONS In feature extraction we have proposed an algorithm for evaluation of feature vector for our implementation of recognition system we have placed the image into a 3×3 grid into image. All the operations were performed over skeletonized image of handwritten characters. We have drawn the horizontal and vertical symmetric projection angel to the nearest pixel of the image. From which we have calculated the horizontal and vertical Euclidean distance for the same nearest pixel from centroid of each zone. Then we have calculated the mean Euclidean distance as well as the mean angular values of the zones with respect to the midpoint of symmetry axes. This is considered as the key feature values of our proposed system. The model generates two set of parameter for each row symmetry axis and column symmetry axis. Where one parameter show mean value of Ecludian distance of every symmetry axis pixel position to centroid of each zone and other parameter shows the angle between the points of the symmetry axis to the centroid of each zone the respective image. Thereafter we obtained four parameter of each image. One category is appended for recognized and the other category is not recognized for the written character. By using the feature values with appended value RBFNN training is carried out. For testing the feature extraction is done on different used in the training set. The RBFN is trained by adaptively updating the free parameters, i.e. center and width of the basis function, and the weight between the hidden and output neurons of the network. To select an optimal RBFN model, the number of neurons in the hidden layer was varied from 2 to 30, and the learning rate was varied between 0.05 and 0.5. The initial basis function centers were chosen randomly from the input space, and the initial weight values were chosen randomly between ±0.9. Normalized datasets were used for the training, testing, and validation of the RBFN model. The best network was found to be one having 26 basis functions with a learning rate of 0.9 and 0.05 for center and weight respectively. The prediction errors of the validation patterns are larger because these patterns are outside the training space. The Fig. 5 shows the comparison of various means in RBFNN. Fig: 5 Comparison graph for various means in RBFNN Experimental result shows that this directional group values technique requires less training time. In this paper, recognition accuracy, training time and classification time obtained by implementing RBFNN better accuracy with requires less training time and classification time. Implementing directional group values with RBFNN yields recognition accuracy around 88% accuracy achieved It is also analyzed during experiment that only 50 iterations are sufficient for training the network using this technique.
  • 5. International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163 Issue 02, Volume 6 (February 2019) www.ijirae.com _________________________________________________________________________________________________ IJIRAE: Impact Factor Value – Mendeley (Elsevier Indexed); Citefactor 1.9 (2017); SJIF: Innospace, Morocco (2016): 3.916 | PIF: 2.469 | Jour Info: 4.085 | ISRAJIF (2017): 4.011 | Indexcopernicus: (ICV 2016): 64.35 IJIRAE © 2014- 19, All Rights Reserved Page–59 IV. CONCLUSIONS The system recognized the written character. It is currently the state of the art approach for categorization. In order to the process of OCR Recognition involved several steps including pre-processing, segmentation, feature extraction, classification is done the basic operation on input image like noise reduction which remove the noisy signal from image. Segmentation stage for segment the given image into line by line and segment each character from segmented line. Future extraction calculates the characteristics of character. A Radial Basis Function Neural Network (RBFNN) is used to recognize rate of 88% accuracy. REFERENCES 1. Bunke, H., Wang, P. S. P. (Editors), Handbook of Character Recognition and Document Image Analysis, World Scientific, 1997. 2. Kai Ding, Zhibin Liu, LianwenJin, Xinghua Zhu, A Comparative study of GABOR feature and gradient feature for handwritten 17hinese character recognition, International Conference on Wavelet Analysis and Pattern Recognition, pp. 1182-1186, Beijing, China, 2-4 Nov. 2007. 3. Pranob K Charles, V.Harish, M.Swathi, CH. Deepthi, "A Review on the Various Techniques used for Optical Character Recognition", International Journal of Engineering Research and Applications, Vol. 2, Issue 1, pp. 659-662, Jan-Feb 2012. 4. M. F. Kader and K. Deb, “Neural network based English alphanumeric character recognition”, International Journal of Computer Science, Engineering and Applications (IJCSEA)Vol.2, No.4, August 2012 5. D.Tjondronegoro, Y.Chen, and B.Pham, “The power of play break for automatic detection and browsing of self consumable sport video highlights”, In Proceedings of the ACM Workshop on Multimedia Information Retrieval, pp. 267-274, 2004. 6. Saleem Pasha, M.C.Padma” Handwritten Kannada Character Recognition using Wavelet Transform and Structural Features” International Conference on Emerging Research in Electronics, Computer Science and Technology,pp 346- 351, IEEE 2015 7. Kalyan S Dash , N.B. Puhan and Ganapati Panda , “BESAC: Binary External Symmetry Axis Constellation for unconstrained handwritten character recognition” Pattern Recognition Letters, June 25, 2016. 8. Krupa Dholakia “A Survey on Handwritten Character Recognition Techniques For Various Indian Language” International Journal Of Computer Application (0975-8887) Volume 115-No. 1, April 2015