Application of Deep Convolutional Neural Networks to Telugu Scriptsfor Optical Character Recognition

  IJCTT-book-cover
 
         
 
© 2023 by IJCTT Journal
Volume-71 Issue-1
Year of Publication : 2023
Authors : M. Gnaneswari, T. Chaitanya Kumar
DOI :  10.14445/22312803/IJCTT-V71I1P108

How to Cite?

M. Gnaneswari, T. Chaitanya Kumar, "Application of Deep Convolutional Neural Networks to Telugu Scriptsfor Optical Character Recognition," International Journal of Computer Trends and Technology, vol. 71, no. 1, pp. 50-55, 2023. Crossref, https://doi.org/10.14445/22312803/IJCTT-V71I1P108

Abstract
This research looks at the process of optical character recognition (OCR) for Telugu scripts. Telugu is anIndian Dravidian language. In English, optical character recognition is widely used, and there is a plethora of smartphone apps available. Telugu has a significantly higher level of complexity due to the number of output classesthat can be formed and the inter-class diversity. In addition, there are no good Telugu OCR systems. We employedthe Deep Convolutional Neural Network (DCNN) model for Telugu character recognition because of its success in other domains, such as segmentation, object identification, and character recognition. Multiple machine learning algorithms such as AdaBoost, Support Vector Machine (SVM), XGBoost, and Decision Tree (DT) are considered for performance evaluation of the proposed DCNN. The proposed DCNN for recognizing the Telugu scripts has yielded promising results, demonstrating its usefulness when compared to other traditional techniques when experimented with using the IEEE Telugu Character Dataset.

Keywords
Text Recognition, Telugu Script, Optical Character Recognition (OCR), Deep Learning, Convolutional Neural Network (CNN), Machine Learning.

Reference

[1] Munish Kumar et al., ‘Character and Numeral Recognition for Non-Indic and Indic Scripts: A Survey,” Artificial Intelligence Review, vol. 52, pp. 2235-2261, 2019. Crossref, https://doi.org/10.1007/s10462-017-9607-x
[2] Konkimalla Chandra Prakash et al., "Optical Character Recognition (OCR) for Telugu: Database, Algorithm and Application," 25th IEEE International Conference on Image Processing (ICIP), pp. 3963-3967, 2018. Crossref, https://doi.org/10.1109/ICIP.2018.8451438
[3] Pradeep Bheemavarapu et al., "An Efficient Method for Coronavirus Detection Through X-rays using deep Neural Network", Journal of Current Medical Imaging, vol.18, no. 6, pp. 587- 592, 2022. Crossref, https://doi.org/10.2174/1573405617999210112193220
[4] Soumyadeep Kundu et al., “Understanding NFC-Net: a Deep Learning Approach to Word-Level Handwritten Indic Script Recognition,” Neural Computing and Applications, vol. 32, pp. 7879-7895, 2020. Crossref, https://doi.org/10.1007/s00521-019-04235-4
[5] Magesh Kasthuri, and Venkatasubramanian Sivaprasatham, “Cognitive Reading and Character Recognition in Image Processing Techniques,” SN Computer Science, vol. 1, no. 131, 2020. Crossref, https://doi.org/10.1007/s42979-020-00142-x
[6] S Satyanarayana and P. Srinivasa Rao, "Privacy-Preserving Data Publishing Based On Sensitivity in Context of Big Data Using Hive", Journal of Bigdata(Springer), vol. 5, no. 20, 2018. Crossref, https://doi.org/10.1186/s40537-018-0130-y
[7] Ismet Zeki Yalniz, and R. Manmatha "Dependence Models for Searching Text in Document Images," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 41, no. 1, pp. 49-63, 2019, Crossref, https://doi.org/10.1109/TPAMI.2017.2780108
[8] Raymond Ptucha et al., “Intelligent Character Recognition Using Fully Convolutional Neural Networks,” Pattern Recognition, vol. 88, 604-613, 2019. Crossref, https://doi.org/10.1016/j.patcog.2018.12.017
[9] Piyush Kiran Redgaonkar et al., "Image Processing Based Pincode Recognizing and Section Wise Courier Sorting System," SSRG International Journal of Electrical and Electronics Engineering, vol. 3, no. 3, pp. 16-18, 2016. Crossref, https://doi.org/10.14445/23488379/IJEEE-V3I3P103
[10] K. Pramod Sankar, R. Manmatha, C.V. Jawahar, “Large Scale Document Image Retrieval By Automatic Word Annotation,” International Journal on Document Analysis and Recognition, vol. 17, no. 1, 2014. Crossref, https://doi.org/10.1007/s10032-013-0207-2
[11] Pawan Kumar Singh et al., “A Comprehensive Handwritten Indic Script Recognition System: A Tree-Based Approach,” Journal of
Ambient Intelligence and Humanized Computing, pp. 1-18, 2018. Crossref, https://doi.org/10.1007/s12652-018-1052-4
[12] T.V. Madhusudhana Rao et al., "Matrix Factorization Based Recommendation System using Hybrid Optimization Technique,” EAI Endorsed Transactions on Energy Web, vol. 5, no. 35, 2021. Crossref, https://doi.org/10.4108/eai.19-2-2021.168725
[13] Harmandeep Kaur, and Munish Kumar, “A Comprehensive Survey on Word Recognition For Non-Indic And Indic Scripts,” Pattern Anal Applic, vol. 21, pp. 897–929, 2018. Crossref, https://doi.org/10.1007/s10044-018-0731-2
[14] Ritesh Sarkhel et al., “A Multi-scale Deep Quad Tree Based Feature Extraction Method for the Recognition of Isolated Handwritten Characters of popular Indic Scripts,” Pattern Recognition, vol. 71, pp. 78-93, 2017. Crossref, https://doi.org/10.1016/j.patcog.2017.05.022
[15] T.V. Madhusudhana Rao, P Srinivasa Rao, and P.S. Latha Kalyampudi, "Iridology based Vital Organs Malfunctioning Identification using Machine learning Techniques," International Journal of Advanced Science and Technology, vol. 29, no. 5, pp. 5544 – 5554, 2020.
[16] Najmeh Samadiani, and Hamid Hassanpour, “A Neural Network-Based Approach for Recognizing Multi-Font Printed English Characters,” Journal of Electrical Systems and Information Technology, vol. 2, no. 2, pp. 207-218, 2015. Crossref, https://doi.org/10.1016/j.jesit.2015.06.003
[17] Shanthi, N., and Duraiswamy, K., “A novel SVM-Based Handwritten Tamil Character Recognition System,” Pattern Analysis and Applications, vol. 13, pp. 173-180, 2010. Crossref, https://doi.org/10.1007/s10044-009-0147-0
[18] S.Vidya sagar Appaji, P. V. Lakshmi, and P. Srinivasa Rao, "Maximizing Joint Probability in Visual Question Answering Models", International Journal of Advanced Science and Technology, vol. 29, no. 3, pp. 3914 – 3923, 2020.
[19] Juhi Ranglani, and Vijay Lachwani, "Automatic Number Plate Recognition (ANPR)," SSRG International Journal of Computer Science and Engineering, vol. 3, no. 8, pp. 64-68, 2016. Crossref, https://doi.org/10.14445/23488387/IJCSE-V3I8P114
[20] Soumen Bag, and Gaurav Harit, A survey on Optical Character Recognition for Bangla and Devanagari Scripts,” Sadhana, vol. 38, pp. 133-168, 2013. Crossref, https://doi.org/10.1007/s12046-013-0121-9
[21] Vidya sagar Appaji setti, and P Srinivasa Rao, "A Novel Scheme For Red Eye Removal With Image Matching", Journal of Advanced Research in Dynamical & Control Systems, vol. 10, no. 13, 2018.
[22] Himadri Nandini Das Bebartta, and Sanghamitra Mohanty, “Algorithm for Segmenting Script-Dependant Portion in a Bilingual Optical Character Recognition System,” Pattern Recognition and Image Analysis, vol. 27, pp. 560-568, 2017. Crossref, https://doi.org/10.1134/S1054661817030142
[23] Arun K. Pujari et al., “An Intelligent Character Recognizer for Telugu Scripts Using Multiresolution Analysis and Associative Memory,” Image and Vision Computing, vol. 22, no. 14, pp. 1221-1227, 2004. Crossref, https://doi.org/10.1016/j.imavis.2004.03.027
[24] P Srinivasa Rao, and P.E.S.N. Krishna Prasad, "A Secure and Efficient Temporal Features Based Framework for Cloud Using MapReduce," 17th International Conference on Intelligent Systems Design and Applications, vol. 736, pp. 114-123, 2017. Crossref, https://doi.org/10.1007/978-3-319-76348-4_12
[25] Mokhtar Hussein, and B. Manjula, "A Hybrid Approach for SVD and Neural Networks Based Robust Image Watermarking," International Journal of Computer Trends and Technology, vol. 61, no. 2, pp. 117-121, 2018. Crossref, https://doi.org/10.14445/22312803/IJCTT-V61P120
[26] N. Prameela, P. Anjusha, and R. Karthik, "Offline Telugu Handwritten Characters Recognition Using Optical Character Recognition," 2017 International conference of Electronics, Communication and Aerospace Technology (ICECA), pp. 223-226, 2017. Crossref, https://doi.org/10.1109/ICECA.2017.8212801
[27] T.V. Madhusudhana Rao, and Y. Srinivas, "A Secure Framework For Cloud Using Map Reduce," Journal Of Advanced Research In Dynamical And Control Systems, vol. 9, no. 14, pp:1850-1861, 2017.
[28] Negi, C. Bhagvati and B. Krishna, "An OCR system for Telugu," Sixth International Conference on Document Analysis and Recognition, pp. 1110-1114, 2001. Crossref, https://doi.org/10.1109/ICDAR.2001.953958
[29] [Online]. Available: https://ieee-dataport.org/open-access/telugu-handwritten-character-dataset#files.
[30] P Srinivasa Rao, Sushma Rani N, and P Parimala, "An Efficient Statistical Computation Technique for Health Care Big Data using R," IOP Conference Series: Materials Science and Engineering, vol. 225, 2017. Crossref, https://doi.org/10.1088/1757-899X/225/1/012159
[31] Satish Kumar,. "A study for Handwritten Devanagari Word Recognition," International Conference on Communication and Signal Processing, pp. 1009-1014, 2016. Crossref, https://doi.org/10.1109/ICCSP.2016.7754301
[32] Chandranath Adak et al. "Offline Cursive Bengali Word Recognition Using CNNs with a Recurrent Model," 15th International C
onference on Frontiers in Handwriting Recognition, pp. 429-434, 2016. Crossref, https://doi.org/10.1109/ICFHR.2016.0086
[33] Ms. Dipalee A. Kolte, Maruti B. Limkar, and Sanjay M. Hundiwale, "Character recognition from deblurred motion distorted Vehicle image using Neural Network," SSRG International Journal of Electronics and Communication Engineering, vol. 1, no. 4, pp. 1-9, 2014. Crossref, https://doi.org/10.14445/23488549/IJECE-V1I4P101
[34] Bikash Shaw, Ujjwal Bhattacharya, and Swapan Kumar Parui, "Offline handwritten Devanagari word recognition: Information fusion at feature and classifier levels," 3rd IAPR Asian Conference on Pattern Recognition, pp. 720-724, 2015. Crossref, https://doi.org/10.1109/ACPR.2015.7486597
[35] Nagesh Vadaparhi et al., "A Novel Clustering Approach using Hadoop Distributed Environment," Applied Science and Technology, vol. 9, pp. 113-119, 2014. Crossref, https://doi.org/10.1007/978-981-287-338-5_9
[36] Rajib Ghosh, and Partha Pratim Roy. "Comparison of Zone-Features for Online Bengali and Devanagari Word Recognition Using HMM," 15th International Conference on Frontiers in Handwriting Recognition, pp. 435-440, 2016. Crossref, https://doi.org/10.1109/ICFHR.2016.0087
[37] Balajee Maram, Guru Kesava Dasu Gopisetty, and P Srinivasa Rao “A Framework for Data Security using Cryptography and Image Steganography,” International Journal of Innovative Technology and Exploring Engineering, pp. 2278-3075, vol. 8, no. 11, 2019.
[38] Zeng Runhua, Zhang Shuqun "Improving Speech Emotion Recognition Method of Convolutional Neural Network," International Journal of Recent Engineering Sciece, vol. 5, no. 3, pp. 1-7, 2018. Crossref, https://doi.org/10.14445/23497157/IJRES-V5I3P101
[39] P.Mahesh Kumar, and P. Srinivasa Rao, "Frequent Pattern Retrieval on Data Streams by using Sliding Window," EAI Endorsed Transactions on Energy web, vol. 5, no. 35, 2021. Crossref, https://doi.org/10.4108/eai.13-1-2021.168091
[40] Mohammed Aarif, & Sivakumar Poruran, “OCR-Nets: Variants of Pre-trained CNN for Urdu Handwritten Character Recognition via Transfer Learning,” Procedia Computer Science, vol. 171, pp. 2294-2301, 2020. Crossref, https://doi.org/10.1016/j.procs.2020.04.248
[41] P. Srinivasa Rao, Krishna Prasad, M.H.M., and K.Thammi Reddy, "A Efficient Data Integration Framework in Hadoop Using MapReduce," Computational Intelligence Techniques for Comparative Genomics, pp. 129-137, 2014.
[42] R Sanjeev Kunte, and R. D. Sudhaker Samuel, “A Simple and Efficient Optical Character Recognition System for Basic Symbols in Printed Kannada Text,” Sadhana-academy Proceedings in Engineering Sciences, vol. 32, no. 5, pp. 521-533, 2008. Crossref, https://doi.org/10.1007/s12046-007-0039-1