Handwritten Nastaleeq Script Recognition with BLSTM-CTC and ANFIS method

  IJCTT-book-cover
 
International Journal of Computer Trends and Technology (IJCTT)          
 
© 2014 by IJCTT Journal
Volume-11 Number-3
Year of Publication : 2014
Authors : Rinku Patel , Mitesh Thakkar
DOI :  10.14445/22312803/IJCTT-V11P128

MLA

Rinku Patel , Mitesh Thakkar. R."Handwritten Nastaleeq Script Recognition with BLSTM-CTC and ANFIS method". International Journal of Computer Trends and Technology (IJCTT) V11(3):131-136, May 2014. ISSN:2231-2803. www.ijcttjournal.org. Published by Seventh Sense Research Group.

Abstract -
A recurrent neural network (RNN) has been successfully applied for recognition of cursive handwritten documents, both in English and Arabic scripts. Ability of RNNs to model context in sequence data like speech and text makes them a suitable candidate to develop OCR systems for printed Nastaleeq scripts (including Nastaleeq for which no OCR system is available to date). In this work, we have presented the results of applying RNN to printed Urdu text in Nastaleeq script. Bidirectional Long Short Term Memory (BLSTM) architecture with Connectionist Temporal Classification (CTC) output layer was employed to recognize printed Urdu text. The propose method use multidimensional BLSTM and ANFIS Method for OCR recognition. The ANFIS approach learns the rules and membership functions from data. ANFIS is an adaptive network. An adaptive network is network of nodes and directional links. These networks are learning a relationship between inputs and outputs. The Recognition error rate is 5.4 %. These results were obtained on synthetically generated UPTI dataset containing artificially degraded images to reflect some real-world scanning artifacts along with clean images. Comparison with shape-matching based method is also presented.

References
[1] Adnan Ul-Hasan, Saad Bin Ahmed, Sheikh Faisal Rashid, Faisal Shafait and Thomas M. Breue , “Offline Printed Urdu Nastaleeq Script Recognition with Bidirectional LSTM Networks” in 12th International Conference on Document Analysis and Recognition, 1520-5363/13 $26.00 © 2013 IEEE DOI 10.1109/ICDAR.2013.212.
[2] A. Graves, “Supervised sequence labelling with recurrent neural network.”Ph.D.Dissertation, Technical University Munich, 2008.
[3] M. Nagata, “Japanese OCR Error Correction using Character Shape Similarity and Statistical Language Model.” in Int. Conf. on Computational Linguistics, 1998, pp. 922–928.
[4] A. Graves, Supervised Sequence Labelling with Recurrent Neural Networks, ser. Studies in Computational Intelligence. Springer, 2012, vol.385.
[5] ——, “ICDAR 2009 Arabic Handwriting Recognition Competition.” In ICDAR. IEEE Computer Society, 2009, pp. 1383–1387.
[6] N. Sankaran and C. V. Jawahar, “Recognition of printed Devanagari text using BLSTM Neural Network.” in ICPR. IEEE, 2012, pp. 322–325.
[7] V. Frinken, A. Fischer, R. Manmatha, and H. Bunke, “A Novel Word Spotting Method Based on Recurrent Neural Networks.” IEEE Trans.Pattern Anal. Mach. Intell., vol. 34, no. 2, pp. 211–224, 2012.
[8] F. Camastra, “A SVM-based cursive character recognizer.” Pattern Recognition, vol. 40, no. 12, pp. 3721–3727, 2007.
[9] A. Graves, “RNNLIB: A recurrent neural network library for sequence learning problems.” [Online]. Available: http://sourceforge.net/projects/ rnnl.
[10] N. Sabbour and F. Shafait, “A Segmentation Free Approach to Arabic and Urdu OCR,” in DRR XX (Part of the IS&T/SPIE 25th Annual Symposium on Electronic Imaging) , Feb. 2013.
[11] H. S. Baird, “Document Image Defect Models ,” in Structured Document Image Analysis, H. S. Baird, H. Bunke, and K. Yamamoto, Eds.New York: Springer-Verlag, 1992.
[12] Emanuel Inderm¨uhle, Volkmar Frinkeny and Horst Bunke” Mode Detection in Online Handwritten Documents Using BLSTM Neural Networks”
[13] Raman Jain , Volkmar Frinken , C.V. Jawahar ,and R. Manmatha,” BLSTMNeural Network basedWord Retrieval for Hindi Documents”
[14] Sorousha Moayer, Parisa A. Bahri” Hybrid intelligent scenario generator for business strategic planning by using ANFIS”. www.elsevier.com/locate/eswa, Expert Systems with Applications 36 (2009) 7729–773
[15] Prof. Sheetal A. Nirve ,Dr. G. S. Sable “Optical character recognition for printed text in Devanagari using ANFIS” International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October-2013 236 ISSN 2229-5518
[16] Sheikh Faisal Rashid, Marc-Peter Schambach, Jörg Rottland, Stephan von der “Low Resolution Arabic Recognition with Multidimensional Recurrent Neural Networks”.

Keywords
URDU character,RNN,BLSTM,ANFIS,CTC