Abstract
Character recognition has been widely used since its inception in applications involved processing of scanned or camera-captured documents. There exist multiple scripts in which the languages are written. The scripts could broadly be divided into cursive and non-cursive scripts. The recurrent neural networks have been proved to obtain state-of-the-art results for optical character recognition. We present a thorough investigation of the performance of recurrent neural network (RNN) for cursive and non-cursive scripts. We employ bidirectional long short-term memory (BLSTM) networks, which is a variant of the standard RNN. The output layer of the architecture used to carry out our investigation is a special layer called connectionist temporal classification (CTC) which does the sequence alignment. The CTC layer takes as an input the activations of LSTM and aligns the target labels with the inputs. The results were obtained at the character level for both cursive Urdu and non-cursive English scripts are significant and suggest that the BLSTM technique is potentially more useful than the existing OCR algorithms.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Wang Y, Ding X, Liu C (2011) MQDF discriminative learning based offline handwritten chinese character recognition. In: ICDAR, IEEE, pp 1100–1104
Tonouchi Y (2010) Path evaluation and character classifier training on integrated segmentation and recognition of online handwritten Japanese character string. In: ICFHR, IEEE Computer Society, pp 513–517
Razzak MI, Hussain SA (2010) Locally baseline detection for online Arabic script based languages character recognition. International Journal of the Physical Sciences 5:955–959
Biadsy F, El-Sana J, Habash N (2006) Online Arabic handwriting recognition using hidden markov models in computer vision and pattern recognition, http://hal.archives-ouvertes.fr/docs/00/10/83/06/PDF/cr102362277910.pdf
Rashid SF, Shafait F, Breuel TM (2012) Scanning neural network for text line recognition. In: Blumenstein M, Pal U, Uchida S (eds) Document analysis systems. IEEE, New Jersey, pp 105–109
Taghva K, Nartker T, Borsack J, Condit A (2000) UNLV-ISRI document collection for research in OCR and information retrieval. SPIE—The International Society for Optical Engineering, pp 157–164
Azmi AN, Nasien D, Shamsuddin SM (2013) A review on handwritten character and numeral recognition for Roman, Arabic, Chinese and Indian scripts. CoRR abs/1308.4902
Camastra F (2008) A SVM Greek character recognizer. IJIDSS 1:290–299
Syal S, Sood E, Sharma S, Randhawa EN (2013) Segmented character recognition using neural networks. Int J Eng Res Appl (IJERA) 1:1731–1735
Camastra F (2007) A SVM-based cursive character recognizer. Pattern Recogn 40:3721–3727
Nagata M (1998) Japanese OCR error correction using character shape similarity and statistical language model. In: Boitet C, Whitelock P (eds) ACL/COLING. Morgan Kaufmann Publishers/ACL, San Francisco, pp 922–928
Pathan IK, Ali AB, Ramteke RJ (2012) Recognition of offline handwritten isolated Urdu character. Advances in Computational Research 4:117–121
Graves A (2012) Supervised sequence labelling with recurrent neural networks, Studies in computational intelligence, vol 385. Springer, Heidelberg, pp 1–131
Schuster M, Paliwal KK (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45:2673–2681
Graves A, Schmidhuber J (2005) Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw 18(5–6):602–610
Danilo P, Mandic JA Chambers (2001) Recurrent neural network for prediction. Wiley, New York
Hochreiter S, Schmidhuber J (1997) Long short term memory. Neural Comput 9:1735–1780
Gers FA, Schmidhuber E (2001) LSTM recurrent networks learn simple context-free and context-sensitive languages. IEEE-EC 12:1333–1340
Fernández S, Graves A, Schmidhuber J (2007) Sequence labelling in structured domains with hierarchical recurrent neural networks. IJCAI, Hyderabad, India, pp 774–779
Ahmed SB, Naz S, Salahuddin (2015) UCOM offline dataset—a Urdu handwritten dataset generation. Int Arab J Inf Technol (IAJIT) (still to be published)
Graves A, Fernández S, Liwicki M, Bunke H, Schmidhuber J (2007) Unconstrained online handwriting recognition with recurrent neural networks. NIPS, Vancouver, Canada
Smith R (2007) An overview of the tesseract OCR engine. In: ICDAR, pp 629–633
Breuel TM (2008) The OCRopus open source OCR system. In: Yanikoglu BA, Berkner K (eds) DRR. SPIE, volume 6815 of SPIE proceedings, p 68150. doi:10.1117/12.783598
Naz S, Hayat K, Razzak MI, Anwar MW, Akbar H (2013) Challenges in baseline detection of cursive script languages. In: Science and information conference (SAI), pp 551–556
Naz S, Hayat K, Razzak MI, Anwar MW, Akbar H (2013) Arabic script based language character recognition: Nasta’liq vs Naskh analysis. In: Proceeding of IEEE World Congress on in Computer and Information Technology (WCCIT), pp 1–7
http://kics.edu.pk. Accessed 27 May 2014
http://crulp.org. Accessed 13 June 2014
http://jang.com.pk. Accessed 02 May 2013
Ul-Hasan A, Bin Ahmed S, Rashid SF, Shafait F, Breuel TM (2013) Offline printed Urdu Nastaleeq script recognition with bidirectional LSTM networks. In: Proceedings IEEE “2013”, International Conference on Document Analysis and Recognition (ICDAR)
Ahmed SB, Naz S, Salahuddin, Razzak MI, Umar AI (2015) Handwritten Urdu character recognition using recurrent neural networks. Neural Comput Appl (NCA) (under reviewed)
AlKhateeb JH (2011) Word-based handwritten Arabic scripts recognition using dynamic Bayesian network, 2011. In: The 5th international conference on information technology, Faculty of Science and Information Technology, AlZaytoonah University of Jordan Ammnan Jordan
Sabbour N 1, Shafait F (2013) A segmentation free approach to Arabic and Urdu OCR. In 20th Document recognition and retrieval conference, 2013, Department of Computer Science, German University in Cairo (GUC), Cairo; German Research Center for Artificial Intelligence (DFKI), Kaiserslautern
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ahmed, S.B., Naz, S., Razzak, M.I. et al. Evaluation of cursive and non-cursive scripts using recurrent neural networks. Neural Comput & Applic 27, 603–613 (2016). https://doi.org/10.1007/s00521-015-1881-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-015-1881-4