SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 3304
Text Extraction from Text Based Image Using Android
Amar G. Waghade1, Anuja V. Zopate2, Ankita G. Titare3, Suraj A. Shelke4
1 Professor, Dept. Of CSE Engineering, DES’sCOET Dhamangaon Railway, Maharashtra, India
2 ,3,4 Student, Dept. Of CSE Engineering, DES’sCOET Dhamangaon Railway, Maharashtra, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract – In world the smartphones are everywhere and
everyone uses smartphones for their day to day life. The
smartphones have been developed for various purposes like
capturing images, record videos, surf the internet and etc.
With advancement of technology, it is possible to apply some
techniques to perform text detection and translation. In this
study, we have tried to integrate the TesseractOCRengineand
the Google Vision library and develop an application on
android platform that allows user to capture the imagesusing
camera and extract the text from it. This application
recognizes the text that is captured by a mobile phone camera
and displays back the recognized text on to the screen. To
develop this application we have used the Optical Character
Recognition, OCR engine, and we develop our own opensource
Android application. This application has come up with the
solutions for the problems of retyping any hard copy of any
documents, scanning the documents in the scanner then use
the computer oriented software to recognize the text in the
file, or guessing the text and typing it. It aims to develop a
piece of software that runs on smartphones and can beusedto
identify and recognize text of any text based image using the
phone’s camera. In case of a finding, In order to achieve fast
processing times, the project has to deal with the low
computing power of smartphones. For this purpose, different
approaches to object detection, such as concepts from the
machine learning domain, and their implementation on
mobile platforms will be analysed.
Key Words: Android, OCR, Text translator, Text to
speech.
1. INTRODUCTION
In society, mobile phones over a wide variety of
functionalities that are not always related to calling or
sending messages. Those functionalities include web
browsing, playing games or music, banking, taking photos
and so much more. The Text Extraction from Text Based
Image is an Android application that aims to allow the user
to extract the text from the image and after extracting if user
want to translate it into another language he will also
translate into other language and also to hear a text
contained in a picture that has been taken with a mobile
phone. It is an application meant to help those who cannot
read a text they encounter, like non-native speakers, the
visually impaired and the blind people, estimated at 285
millions in 2010 by the World Health Organization. This
project is mainly related to image processing to recognize
characters in an image.
Considering text element, image can be classified as: (i)
document image and (ii) scene text image. Currently
available OCR algorithms have been developed to deal with
document images. OCR algorithm expects the input to be
black and white image and relatively clean and well
structured. In these conditions, the direct application of an
OCR engine on a scene text image would result in an almost
certain failure. Therefore, the first task while designing
application using text information in natural scene image is
to transform the image in a way that current OCR engines
are able to manage. Although there exist a lot of research
activities in this field, scene text extraction is still remained
as a challenging problem, mainly due to two issues: different
variety of text patterns like fonts, colours, sizes, orientations
and presence of background outliers similar to text
characters, such as windows, bricks and character-like
texture.
In this project, an android platform based tourist translator
application is developed, which is able to extract the text
from scene image and translate the same from one language
to other. Extraction is performed using stroke width
transform (SWT) approach and connected component
analysis. Extracted texts are recognized using Google’sopen
source optical character recognition (OCR) engine
‘Tesseract’ and translated to a target languageusingGoogle’s
translation.
1.1 OCR
OCR technology is allows the conversion of image which is
scanned of printed character into text or any other
information that user want using android mobile. OCR
technology usesthree phases first is Scanning of documents
as optical images. Next is Recognition which involves
converting those images to character streams representing
letters of recognized words and the final element used to
accessing or storing the text which are already converted.
Converted text is nothing but the extracted text. When, the
user begins by capturing an image using mobile camera
containing text .To convert extracted text into any language
text synthesizer is used.
Tesseract is an open source engine developed by HP labs
between years1985 to 1995 and then handedovertoGoogle
Inc. in 2006. Tesseract combined with the Leptonica Image
Processing library which can read a wide variety of image
formats and convert them to text in over 60 languages. It
works well on all computer operating system as well as
Android and iPhone mobile platform. Due to popularity of
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 3305
Tesseract being open source engine, there are a lot of
academic experiments and OCR software developments
conducted successfully. Based on study conducted between
OCRAD, GOCR and Tesseract, found out that the Tesseract
outperform other open source engines.
2. DESIGN AND IMPLEMENTSTION
Fig -1:Block Diagram
2.1 Text Feature Filtering
Text feature extraction that extracts text information is an
extraction to represent a text message, it is the basis of a
large number of text processing. The basic unit of thefeature
is called text features. Selecting a set of features from some
effective ways to reduce the dimension of feature space, the
purpose of this process is called feature extraction. During
feature extraction, uncorrelated or superfluousfeatureswill
be deleted. As a method of data preprocessing of learning
algorithm, feature extraction can better improve the
accuracy of learning algorithm and shorten the time.
Selection from the document part can reflecttheinformation
on the content words, and the calculation of weight is called
the text feature extraction. Common methodsof text feature
extraction include filtration, fusion, mapping, and clustering
method.
2.2 Text-based retrieval
The basic level of multimedia retrieval on the basisoftextual
information is keyword searching. Numerousmethodshave
been proposed in solving documents indexing and retrieval
tasksbased on only text content with noisy data,forexample
the output of an OCR system. These methods can improve
the retrieval performance on top of simple word matching
using fuzzy logic, confusion information for charactersanda
bi-gram model, finite state machine, similarity distance
measure, and OCR error modeling.
2.3 Google API's
We used the API's in this project like Google translator for
translating languages and text to speech engine for
converting the text retrieved into the audio. Google APIs is a
set of application programming interfaces(APIs) developed
by Google which allow communication with Google Services
and their integration to other services. Google translator
used in this project for translating the present language into
any other language that user wants. Google Translate can
translate multiple forms of text and media, including text,
speech, images, sites, or real-time video, from one language
to another.
Fig -2: Example of Extracting Text
3. CONCLUSION
Text extraction is recent research area in the field of
Computer Vision. It is challenging problem in the
information processing field mainly due to, different variety
of text patterns like fonts, colors, sizes, orientations; and
presence of background outliers similar to text characters,
such as windows, bricks. In this research work, considering
few of challenges, real time application named as demo OCR
is designed and developed. Extraction is performed using
stroke width transform and connected component based
approach & using the Tesseract tool using the google vision
API. Proposed application assiststhe tourists, while theyare
roaming in foreign countries. The performance of system is
tested based on extraction rate. With proposed application
almost all the text in horizontal orientation extracted
correctly, whereas performance of real time images varies
with lighting condition and camera resolution.
Proposed android application can be further extended to
deal with any target and source language for translation. It
can be further modified to deal with text having vertical or
arbitrary orientation.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 3306
3. SCREENSHOTS
Snapshot 1: StartActivity
Snapshot 2: MainActivity
Snapshot 3: Image to Text
Snapshot 4: Output
REFERENCES
[1] K. Jung, K. I. Kim, and A. K. Jain, “Text information
extraction in images and video: A survey,” Pattern
Recognit., vol. 37, no. 5, pp. 977–997, 2004.
[2] X. Zhao, K.-H. Lin, Y. Fu, Y. Hu, Y. Liu, and T. S. Huang,
“Text from corners: A novel approach to detect text and
caption in videos,” IEEE Trans. Image Process., vol. 20,
no. 3, pp. 790–799, Mar. 2011.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 3307
[3] W. Kim and C. Kim, “A new approach for overlay text
detection and extraction from complex video scene,”
IEEE Trans. Image Process., vol. 18, no. 2, pp. 401–411,
Feb. 2009.
[4] Jing Zhang and R. Kasturi, “A noval text detectionsystem
based on character and link energies” in IEEE Trans.
Image Processing, Vol. 23, No. 9, Sep 2014.
[5] Y.-F. Pan, X. Huo, and C.-L. Liu, “A hybrid approach to
detect and localize texts in natural scene images,” IEEE
Trans. Image Process., vol. 20, no. 3, pp. 800–813, Mar.
2010.
[6] Z. Tu, X. Chen, A. L. Yuille, and S.-C. Zhu, “Image parsing:
Unifying segmentation, detection, and recognition,” Int.
J. Comput. Vis., vol. 63, no. 2, pp. 113– 140, 2005.
[7] C. Yi and Y. Tian, “Text string detection from natural
scenes by structure-based partition andgrouping,”IEEE
Trans. Image Process., vol. 20, no. 9, pp.2594–2605,Sep.
2011.

More Related Content

What's hot (20)

PDF
Volume 2-issue-6-2009-2015
Editor IJARCET
 
PDF
Optical Character Recognition Using Python
YogeshIJTSRD
 
PDF
OCR Text Extraction
Dr. Amarjeet Singh
 
PPTX
Optical Character Recognition( OCR )
Karan Panjwani
 
PDF
A NOVEL APPROACH FOR WORD RETRIEVAL FROM DEVANAGARI DOCUMENT IMAGES
ijnlc
 
PPT
Tesseract OCR Engine - OpenFest 2009
Svetlin Nakov
 
DOCX
Optical character recognition IEEE Paper Study
Er. Ashish Pandey
 
PPTX
OCR (Optical Character Recognition)
IstiaqueBinIslam
 
PDF
IRJET- A Novel Approach – Automatic paper evaluation system
IRJET Journal
 
PPTX
Optical Character Recognition
Durjoy Saha
 
PDF
Optical Character Recognition (OCR) System
iosrjce
 
PDF
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
ijcsitcejournal
 
PPTX
Optical Character Recognition (OCR) based Retrieval
Biniam Asnake
 
PPTX
OCR speech using Labview
Bharat Thakur
 
PDF
IRJET - Storytelling App for Children with Hearing Impairment using Natur...
IRJET Journal
 
PPTX
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
GBM
 
DOC
Ocr abstract
Punya Prakash
 
PPT
optical character recognition system
Vijay Apurva
 
PDF
Harendra singh ai_consulting_portfolio_2021
Harendra Singh
 
Volume 2-issue-6-2009-2015
Editor IJARCET
 
Optical Character Recognition Using Python
YogeshIJTSRD
 
OCR Text Extraction
Dr. Amarjeet Singh
 
Optical Character Recognition( OCR )
Karan Panjwani
 
A NOVEL APPROACH FOR WORD RETRIEVAL FROM DEVANAGARI DOCUMENT IMAGES
ijnlc
 
Tesseract OCR Engine - OpenFest 2009
Svetlin Nakov
 
Optical character recognition IEEE Paper Study
Er. Ashish Pandey
 
OCR (Optical Character Recognition)
IstiaqueBinIslam
 
IRJET- A Novel Approach – Automatic paper evaluation system
IRJET Journal
 
Optical Character Recognition
Durjoy Saha
 
Optical Character Recognition (OCR) System
iosrjce
 
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
ijcsitcejournal
 
Optical Character Recognition (OCR) based Retrieval
Biniam Asnake
 
OCR speech using Labview
Bharat Thakur
 
IRJET - Storytelling App for Children with Hearing Impairment using Natur...
IRJET Journal
 
Offline Omni Font Arabic Optical Text Recognition System using Prolog Classif...
GBM
 
Ocr abstract
Punya Prakash
 
optical character recognition system
Vijay Apurva
 
Harendra singh ai_consulting_portfolio_2021
Harendra Singh
 

Similar to IRJET- Text Extraction from Text Based Image using Android (20)

PDF
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
IRJET Journal
 
PDF
IRJET - Optical Character Recognition and Translation
IRJET Journal
 
PPTX
OCR.pptx
ALLIPURAMLAVANYA21K9
 
PDF
Audio computing Image to Text Synthesizer - A Cutting-Edge Content Generator ...
IRJET Journal
 
PPTX
Team-98 research paper presentation.pptx
dipakshukla158
 
PPSX
Mob ocr
Vivek Bharadwaj
 
PDF
Text Recognition Using Tesseract OCR Facilitating Multilingualism: A Review
IRJET Journal
 
PPTX
mobile camera based text detection
vallabh potadar
 
PDF
Text Recognition, Object Detection and Language Translation App
IRJET Journal
 
PDF
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET Journal
 
PDF
IRJET- Text Line Detection in Camera Caputerd Images using Matlab GUI
IRJET Journal
 
PDF
IRJET - Number/Text Translate from Image
IRJET Journal
 
PDF
Handwritten Text Recognition and Digital Text Conversion
ijtsrd
 
PPTX
Text extraction from natural scene image, a survey
SOYEON KIM
 
PPTX
Word Detection & Translation from image on an android device
Ritwik Kumar
 
PDF
Text Detection and Recognition with Speech Output for Visually Challenged Per...
IJERA Editor
 
PDF
E1803012329
IOSR Journals
 
PDF
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING
IRJET Journal
 
PPTX
What is OCR Technology and How to Extract Text from Any Image for Free
TwisterTools
 
PDF
Smart Assistant for Blind Humans using Rashberry PI
ijtsrd
 
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
IRJET Journal
 
IRJET - Optical Character Recognition and Translation
IRJET Journal
 
Audio computing Image to Text Synthesizer - A Cutting-Edge Content Generator ...
IRJET Journal
 
Team-98 research paper presentation.pptx
dipakshukla158
 
Text Recognition Using Tesseract OCR Facilitating Multilingualism: A Review
IRJET Journal
 
mobile camera based text detection
vallabh potadar
 
Text Recognition, Object Detection and Language Translation App
IRJET Journal
 
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET Journal
 
IRJET- Text Line Detection in Camera Caputerd Images using Matlab GUI
IRJET Journal
 
IRJET - Number/Text Translate from Image
IRJET Journal
 
Handwritten Text Recognition and Digital Text Conversion
ijtsrd
 
Text extraction from natural scene image, a survey
SOYEON KIM
 
Word Detection & Translation from image on an android device
Ritwik Kumar
 
Text Detection and Recognition with Speech Output for Visually Challenged Per...
IJERA Editor
 
E1803012329
IOSR Journals
 
IMAGE TO TEXT TO SPEECH CONVERSION USING MACHINE LEARNING
IRJET Journal
 
What is OCR Technology and How to Extract Text from Any Image for Free
TwisterTools
 
Smart Assistant for Blind Humans using Rashberry PI
ijtsrd
 
Ad

More from IRJET Journal (20)

PDF
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
IRJET Journal
 
PDF
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
IRJET Journal
 
PDF
Kiona – A Smart Society Automation Project
IRJET Journal
 
PDF
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
IRJET Journal
 
PDF
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
IRJET Journal
 
PDF
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
IRJET Journal
 
PDF
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
IRJET Journal
 
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
PDF
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
IRJET Journal
 
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
IRJET Journal
 
PDF
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
IRJET Journal
 
PDF
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
IRJET Journal
 
PDF
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
IRJET Journal
 
PDF
Breast Cancer Detection using Computer Vision
IRJET Journal
 
PDF
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
PDF
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
PDF
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
IRJET Journal
 
PDF
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
PDF
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
IRJET Journal
 
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
IRJET Journal
 
Kiona – A Smart Society Automation Project
IRJET Journal
 
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
IRJET Journal
 
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
IRJET Journal
 
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
IRJET Journal
 
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
IRJET Journal
 
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
IRJET Journal
 
BRAIN TUMOUR DETECTION AND CLASSIFICATION
IRJET Journal
 
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
IRJET Journal
 
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
IRJET Journal
 
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
IRJET Journal
 
Breast Cancer Detection using Computer Vision
IRJET Journal
 
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
IRJET Journal
 
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
Ad

Recently uploaded (20)

PDF
UNIT-4-FEEDBACK AMPLIFIERS AND OSCILLATORS (1).pdf
Sridhar191373
 
PPT
Oxygen Co2 Transport in the Lungs(Exchange og gases)
SUNDERLINSHIBUD
 
PDF
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
PDF
POWER PLANT ENGINEERING (R17A0326).pdf..
haneefachosa123
 
PDF
Zilliz Cloud Demo for performance and scale
Zilliz
 
PPTX
Innowell Capability B0425 - Commercial Buildings.pptx
regobertroza
 
PDF
International Journal of Information Technology Convergence and services (IJI...
ijitcsjournal4
 
PPTX
原版一样(Acadia毕业证书)加拿大阿卡迪亚大学毕业证办理方法
Taqyea
 
PPTX
Benefits_^0_Challigi😙🏡💐8fenges[1].pptx
akghostmaker
 
DOCX
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
PPTX
site survey architecture student B.arch.
sri02032006
 
PDF
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
PPTX
Hashing Introduction , hash functions and techniques
sailajam21
 
PPTX
EC3551-Transmission lines Demo class .pptx
Mahalakshmiprasannag
 
PDF
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
PPTX
UNIT DAA PPT cover all topics 2021 regulation
archu26
 
PDF
Unified_Cloud_Comm_Presentation anil singh ppt
anilsingh298751
 
PDF
GTU Civil Engineering All Semester Syllabus.pdf
Vimal Bhojani
 
PDF
monopile foundation seminar topic for civil engineering students
Ahina5
 
PPTX
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
UNIT-4-FEEDBACK AMPLIFIERS AND OSCILLATORS (1).pdf
Sridhar191373
 
Oxygen Co2 Transport in the Lungs(Exchange og gases)
SUNDERLINSHIBUD
 
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
POWER PLANT ENGINEERING (R17A0326).pdf..
haneefachosa123
 
Zilliz Cloud Demo for performance and scale
Zilliz
 
Innowell Capability B0425 - Commercial Buildings.pptx
regobertroza
 
International Journal of Information Technology Convergence and services (IJI...
ijitcsjournal4
 
原版一样(Acadia毕业证书)加拿大阿卡迪亚大学毕业证办理方法
Taqyea
 
Benefits_^0_Challigi😙🏡💐8fenges[1].pptx
akghostmaker
 
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
site survey architecture student B.arch.
sri02032006
 
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
Hashing Introduction , hash functions and techniques
sailajam21
 
EC3551-Transmission lines Demo class .pptx
Mahalakshmiprasannag
 
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
UNIT DAA PPT cover all topics 2021 regulation
archu26
 
Unified_Cloud_Comm_Presentation anil singh ppt
anilsingh298751
 
GTU Civil Engineering All Semester Syllabus.pdf
Vimal Bhojani
 
monopile foundation seminar topic for civil engineering students
Ahina5
 
artificial intelligence applications in Geomatics
NawrasShatnawi1
 

IRJET- Text Extraction from Text Based Image using Android

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 3304 Text Extraction from Text Based Image Using Android Amar G. Waghade1, Anuja V. Zopate2, Ankita G. Titare3, Suraj A. Shelke4 1 Professor, Dept. Of CSE Engineering, DES’sCOET Dhamangaon Railway, Maharashtra, India 2 ,3,4 Student, Dept. Of CSE Engineering, DES’sCOET Dhamangaon Railway, Maharashtra, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract – In world the smartphones are everywhere and everyone uses smartphones for their day to day life. The smartphones have been developed for various purposes like capturing images, record videos, surf the internet and etc. With advancement of technology, it is possible to apply some techniques to perform text detection and translation. In this study, we have tried to integrate the TesseractOCRengineand the Google Vision library and develop an application on android platform that allows user to capture the imagesusing camera and extract the text from it. This application recognizes the text that is captured by a mobile phone camera and displays back the recognized text on to the screen. To develop this application we have used the Optical Character Recognition, OCR engine, and we develop our own opensource Android application. This application has come up with the solutions for the problems of retyping any hard copy of any documents, scanning the documents in the scanner then use the computer oriented software to recognize the text in the file, or guessing the text and typing it. It aims to develop a piece of software that runs on smartphones and can beusedto identify and recognize text of any text based image using the phone’s camera. In case of a finding, In order to achieve fast processing times, the project has to deal with the low computing power of smartphones. For this purpose, different approaches to object detection, such as concepts from the machine learning domain, and their implementation on mobile platforms will be analysed. Key Words: Android, OCR, Text translator, Text to speech. 1. INTRODUCTION In society, mobile phones over a wide variety of functionalities that are not always related to calling or sending messages. Those functionalities include web browsing, playing games or music, banking, taking photos and so much more. The Text Extraction from Text Based Image is an Android application that aims to allow the user to extract the text from the image and after extracting if user want to translate it into another language he will also translate into other language and also to hear a text contained in a picture that has been taken with a mobile phone. It is an application meant to help those who cannot read a text they encounter, like non-native speakers, the visually impaired and the blind people, estimated at 285 millions in 2010 by the World Health Organization. This project is mainly related to image processing to recognize characters in an image. Considering text element, image can be classified as: (i) document image and (ii) scene text image. Currently available OCR algorithms have been developed to deal with document images. OCR algorithm expects the input to be black and white image and relatively clean and well structured. In these conditions, the direct application of an OCR engine on a scene text image would result in an almost certain failure. Therefore, the first task while designing application using text information in natural scene image is to transform the image in a way that current OCR engines are able to manage. Although there exist a lot of research activities in this field, scene text extraction is still remained as a challenging problem, mainly due to two issues: different variety of text patterns like fonts, colours, sizes, orientations and presence of background outliers similar to text characters, such as windows, bricks and character-like texture. In this project, an android platform based tourist translator application is developed, which is able to extract the text from scene image and translate the same from one language to other. Extraction is performed using stroke width transform (SWT) approach and connected component analysis. Extracted texts are recognized using Google’sopen source optical character recognition (OCR) engine ‘Tesseract’ and translated to a target languageusingGoogle’s translation. 1.1 OCR OCR technology is allows the conversion of image which is scanned of printed character into text or any other information that user want using android mobile. OCR technology usesthree phases first is Scanning of documents as optical images. Next is Recognition which involves converting those images to character streams representing letters of recognized words and the final element used to accessing or storing the text which are already converted. Converted text is nothing but the extracted text. When, the user begins by capturing an image using mobile camera containing text .To convert extracted text into any language text synthesizer is used. Tesseract is an open source engine developed by HP labs between years1985 to 1995 and then handedovertoGoogle Inc. in 2006. Tesseract combined with the Leptonica Image Processing library which can read a wide variety of image formats and convert them to text in over 60 languages. It works well on all computer operating system as well as Android and iPhone mobile platform. Due to popularity of
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 3305 Tesseract being open source engine, there are a lot of academic experiments and OCR software developments conducted successfully. Based on study conducted between OCRAD, GOCR and Tesseract, found out that the Tesseract outperform other open source engines. 2. DESIGN AND IMPLEMENTSTION Fig -1:Block Diagram 2.1 Text Feature Filtering Text feature extraction that extracts text information is an extraction to represent a text message, it is the basis of a large number of text processing. The basic unit of thefeature is called text features. Selecting a set of features from some effective ways to reduce the dimension of feature space, the purpose of this process is called feature extraction. During feature extraction, uncorrelated or superfluousfeatureswill be deleted. As a method of data preprocessing of learning algorithm, feature extraction can better improve the accuracy of learning algorithm and shorten the time. Selection from the document part can reflecttheinformation on the content words, and the calculation of weight is called the text feature extraction. Common methodsof text feature extraction include filtration, fusion, mapping, and clustering method. 2.2 Text-based retrieval The basic level of multimedia retrieval on the basisoftextual information is keyword searching. Numerousmethodshave been proposed in solving documents indexing and retrieval tasksbased on only text content with noisy data,forexample the output of an OCR system. These methods can improve the retrieval performance on top of simple word matching using fuzzy logic, confusion information for charactersanda bi-gram model, finite state machine, similarity distance measure, and OCR error modeling. 2.3 Google API's We used the API's in this project like Google translator for translating languages and text to speech engine for converting the text retrieved into the audio. Google APIs is a set of application programming interfaces(APIs) developed by Google which allow communication with Google Services and their integration to other services. Google translator used in this project for translating the present language into any other language that user wants. Google Translate can translate multiple forms of text and media, including text, speech, images, sites, or real-time video, from one language to another. Fig -2: Example of Extracting Text 3. CONCLUSION Text extraction is recent research area in the field of Computer Vision. It is challenging problem in the information processing field mainly due to, different variety of text patterns like fonts, colors, sizes, orientations; and presence of background outliers similar to text characters, such as windows, bricks. In this research work, considering few of challenges, real time application named as demo OCR is designed and developed. Extraction is performed using stroke width transform and connected component based approach & using the Tesseract tool using the google vision API. Proposed application assiststhe tourists, while theyare roaming in foreign countries. The performance of system is tested based on extraction rate. With proposed application almost all the text in horizontal orientation extracted correctly, whereas performance of real time images varies with lighting condition and camera resolution. Proposed android application can be further extended to deal with any target and source language for translation. It can be further modified to deal with text having vertical or arbitrary orientation.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 3306 3. SCREENSHOTS Snapshot 1: StartActivity Snapshot 2: MainActivity Snapshot 3: Image to Text Snapshot 4: Output REFERENCES [1] K. Jung, K. I. Kim, and A. K. Jain, “Text information extraction in images and video: A survey,” Pattern Recognit., vol. 37, no. 5, pp. 977–997, 2004. [2] X. Zhao, K.-H. Lin, Y. Fu, Y. Hu, Y. Liu, and T. S. Huang, “Text from corners: A novel approach to detect text and caption in videos,” IEEE Trans. Image Process., vol. 20, no. 3, pp. 790–799, Mar. 2011.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 3307 [3] W. Kim and C. Kim, “A new approach for overlay text detection and extraction from complex video scene,” IEEE Trans. Image Process., vol. 18, no. 2, pp. 401–411, Feb. 2009. [4] Jing Zhang and R. Kasturi, “A noval text detectionsystem based on character and link energies” in IEEE Trans. Image Processing, Vol. 23, No. 9, Sep 2014. [5] Y.-F. Pan, X. Huo, and C.-L. Liu, “A hybrid approach to detect and localize texts in natural scene images,” IEEE Trans. Image Process., vol. 20, no. 3, pp. 800–813, Mar. 2010. [6] Z. Tu, X. Chen, A. L. Yuille, and S.-C. Zhu, “Image parsing: Unifying segmentation, detection, and recognition,” Int. J. Comput. Vis., vol. 63, no. 2, pp. 113– 140, 2005. [7] C. Yi and Y. Tian, “Text string detection from natural scenes by structure-based partition andgrouping,”IEEE Trans. Image Process., vol. 20, no. 9, pp.2594–2605,Sep. 2011.