Text to Speech Conversion using Optical character Recognition for Visually Impaired Persons

International Journal of Computer Trends and Technology (IJCTT)          
© 2015 by IJCTT Journal
Volume-29 Number-2
Year of Publication : 2015
Authors : Prince saini, Rajesh Mehra


Prince saini, Rajesh Mehra "Text to Speech Conversion using Optical character Recognition for Visually Impaired Persons". International Journal of Computer Trends and Technology (IJCTT) V29(2):97-102, November 2015. ISSN:2231-2803. www.ijcttjournal.org. Published by Seventh Sense Research Group.

Abstract -
Disability of visual reading impacts the life of a person to a great extent. Although there are various devices designed for visual impaired persons, but development of the text reading devices is at the beginning stage. This system for text recognition is more user friendly and easy available also so a system which automatically enables access form text to voice mode is needed. In this paper optical recognition technology (Optical Character Recognition) is used to develop a cost effective user friendly text to speech conversion system using MATLAB. The targeted text using optical character technology is recognized through an optical component and is successfully tested using the developed design. This paper aims to design a system which can be implemented in hardware by which the text can be obtained through image and then speech signal from that text is generated. The development of a text to speech conversion device will be of great help to people with visual impairment.

[1] Chucai Yi, Yingi Tian, K.Anuradha,“Text to Speech Conversion,” IEEE Transaction on vol.19,pp.269-278, 2013
[2] R.R. Itkarkar, D.T.Mane, S.D.Suryawanshi, Manoj Kumar Singh, “High Quality Text to Speech Synthesizer using Phonetic Integration” IJARECE , pp-133-13,2014
[3] Haojin Yang, Hasso-Plattner, C.Meinel, “Design of Multilingual Speech Synthesis System,” Intelligent Information Management, pp. 58-64, 2010
[4] Francesc Alias, Xavier Sevillano, Joan Claudi Socoro, Xavier Gonzalvo, “Towards High Quality Next GenerationText-to-Speech Synthesis: A Multidomain Approach by Automatic Domain Classification,” IEEE Transaction on Audio, Speech and Language Processing vol.16, No.7, pp. 1340-1354, 2008
[5] Vijay Laxmi Sahu, Babita Kubde, “Design and Development of a Text-To-Speech Synthesizer for Indian Language:” A Review International Journal of Science and Research (IJSR), India Online ISSN: 2319-7064 Volume 2 Issue 1, January 2013.
[6] Gurpreet Singh, Chandan Jyoti Kumar, RajneeshRani, Dr. RenuDhir, “Building HMM based Unit Selection Speech Synthesis System,” IJARCSSE Volume 3, Issue 1, pp 257-263, January2013.
[7] Chirag I Patel, Ripal Patel, Palak Patel, “Integrated Automatic Expression Prediction and Speech Synthesis “International Journal of Scientific & Engineering Research, Volume 2,Issue 5, May- 2011 .
[8] A. F. Mollah, S. Basu, M. Nasipuri, “Robust Speaker Adaptive HMM based Text to Speech System”, International Journal of Computer Science and Applications, 1(1), pp. 33-37, June 2010.
[9]. A. F. Mollah, S. Basu, M. Nasipuri and D. K. Basu, “Parameter generation methods with Rich context model for Text to Speech Synthesis”, Proc. of the Eighth IAPR International Workshop on Graphics Recognition (GREC`09), pp. 263-270, July, 2009.
[10] Diego J. Romero, Leticia M. Seijas, Ana M. Ruedin,“The IBM Expressive Text to Speech Synthesis for American English,”JCS&T Vol. 7 No. 1.
[11] R.shanta,Selva Kumari,R.Sangeeta,Conversion of English Text-To-Speech (TTS) Using Indian Speech Signal, International Journal of scientific engineering and Technology,Volume No-4 Issue no 8,PP-447-450 Aug 2015
[12]www.mathworks.com/matlabcentral/fileexchange/18091-textto- speech
[13] Math works, “User Guide Filter Design Toolbox4 ”
[14] Rahul Kumar Rastogi and Rajesh Mehra, Efficient Error Reduction In Ultrasonic Distance Measurement Using Temperature Compensation, International Journal of Advanced Electrical and electronics Engineering,Volume-1,Issue-2,2012

OCR, segmentation, character extraction, Graphical user interface (GUI), concatenative synthesis.