Devanagari OCR using a recognition driven segmentation framework and stochastic language models

Suryaprakash Kompalli¹,
Srirangaraj Setlur¹ &
Venu Govindaraju¹

279 Accesses
18 Citations
3 Altmetric
Explore all metrics

Abstract

This paper describes a novel recognition driven segmentation methodology for Devanagari Optical Character Recognition. Prior approaches have used sequential rules to segment characters followed by template matching for classification. Our method uses a graph representation to segment characters. This method allows us to segment horizontally or vertically overlapping characters as well as those connected along non-linear boundaries into finer primitive components. The components are then processed by a classifier and the classifier score is used to determine if the components need to be further segmented. Multiple hypotheses are obtained for each composite character by considering all possible combinations of the classifier results for the primitive components. Word recognition is performed by designing a stochastic finite state automaton (SFSA) that takes into account both classifier scores as well as character frequencies. A novel feature of our approach is that we use sub-character primitive components in the classification stage in order to reduce the number of classes whereas we use an n-gram language model based on the linguistic character units for word recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Segmentation-free optical character recognition for printed Urdu text

Article Open access 06 September 2017

Optical Character Recognition Systems

A Region Based Design of Deterministic Finite State Automata for Online Recognition of Teeline Shorthand Language Alphabet

References

Baker P., Hardie A., McEnery T., Xiao R., Bontcheva K., Cunningham H., Gaizauskas R., Hamza O., Maynard D., Tablan V., Ursu C., Jayaram B., Leisher M.: Corpus linguistics and south asian languages: corpus creation and tool development. Lit. Linguist. Comput. 19(4), 509–524 (2004)
Article Google Scholar
Bansal V., Sinha R.: Integrating knowledge sources in Devanagari text recognition. IEEE Trans. Syst. Man Cybern. A 30(4), 500–505 (2000)
Article Google Scholar
Bansal V., Sinha R.: Partitioning and searching dictionary for correction of optically-read devanagari character strings. Int. J. Doc. Anal. Recognit. 4(4), 269–280 (2002)
Article Google Scholar
Bansal V., Sinha R.: Segmentation of touching and fused Devanagari characters. Pattern Recognit. 35, 875–893 (2002)
Article MATH Google Scholar
Bansal V., Sinha R.: Segmentation of touching and fused Devanagari characters. Pattern Recognit. 35, 875–893 (2002)
Article MATH Google Scholar
Bishop C.M.: Neural Networks for Pattern Recognition. Oxford University Press, New York (1996)
MATH Google Scholar
Bouchaffra D., Govindaraju V., Srihari S.N.: Postprocessing of recognized strings using nonstationary markovian models. IEEE Trans. Pattern Anal. Mach. Intell. 21(10), 990–999 (1999)
Article Google Scholar
Casey R., Lecolinet E.: A survey of methods and strategies in character segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 18, 690–706 (1996)
Article Google Scholar
Chaudhuri, B., Pal, U.: An OCR system to read two Indian language scripts: Bangla and Devanagari. In: Proceedings of the 4th International Conference on Document Analysis and Recognition, pp. 1011–1015 (1997)
Christopher M., Hinrich S.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
MATH Google Scholar
Daniels P.T., Bright W.: The World’s Writing Systems. Oxford University Press, New York (1996)
Google Scholar
Ding, X., Wen, D., Peng, L., Liu, C.: Document digitization technology and its application for digital library in china. In: Proceedings of the 1st International Workshop on Document Image Analysis for Libraries (DIAL 2004), pp. 46–53 (2004)
Duda R.O., Hart P.E., Stork D.G.: Pattern Classification, 2nd edn. Wiley, New York (2000)
Google Scholar
Favata J., Srikantan G.: A multiple feature/resolution approach to handprinted digit and character recognition. Int. J. Imaging Syst. Technol. 7, 304–311 (1996)
Article Google Scholar
Forcada, M.: Corpus-based stochastic finite-state predictive text entry for reduced keyboards: application to catalan. In: Procesamiento del Lenguaje Natural, pp. 65–70 (2001)
Garain U., Chaudhuri B.: Segmentation of touching characters in printed devnagari and bangla scripts using fuzzy multi- factorial analysis. IEEE Trans. Syst. Man. Cybern. C 32(4), 449–459 (2002)
Article Google Scholar
Govindaraju, V., Khedekar, S., Kompalli, S., Farooq, F., Setlur, S., Vemulapati, R.: Tools for enabling digital access to multilingual indic documents. In: Proceedings of the 1st International Workshop on Document Image Analysis for Libraries (DIAL 2004), pp. 122–133 (2004)
Hirsimaki, T., Creutz, M., Siivola, V., Mikko, K.: Morphologically motivated language models in speech recognition. In: Proceedings of International and Interdisciplinary Conference on Adaptive Knowledge Representation and Reasoning, pp. 121–126 (2005)
Hull J.J., Srihari S.N.: Experiments in text recognition with binary n-grams and viterbi algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 4(5), 520–530 (1982)
Article Google Scholar
The cedar-ilt data set. http://www.cedar.buffalo.edu/ilt/
Juan C.A., Enrique V.: Efficient error-correcting viterbi parsing. IEEE Trans. Pattern Anal. Mach. Intell. 20, 1109–1116 (1998)
Article Google Scholar
Juan, C.P.-C., Juan, C.A., Rafael, L.: Stochastic error-correcting parsing for OCR post-processing. In: Proceedings of the 15th International Conference on Pattern Recognition, vol. 4, pp. 405–408 (2000)
Kim G., Govindaraju V., Srihari S.N.: An architecture for handwritten text recognition systems. IJDAR 2, 37–44 (1999)
Article Google Scholar
Kompalli, S., Nayak, S., Setlur, S., Govindaraju, V.: Challenges in ocr of devanagari documents. In: Proceedings of the 8th International Conference on Document Analysis and Recognition, pp. 327–333 (2005)
Kompalli, S., Setlur, S., Govindaraju, V.: Design and comparison of segmentation driven and recognition driven Devanagari ocr. In: Proceedings of the 2nd International Conference on Document Image Analysis for Libraries, pp. 96–102 (2006)
Kompalli, S., Setlur, S., Govindaraju, V., Vemulapati, R.: Creation of data resources and design of an evaluation test bed for Devanagari script recognition. In: Proceedings of the 13th International Workshop on Research Issues on Data Engineering: Multi-lingual Information Management, pp. 55–61 (2003)
Kukich K.: Techniques for automatically correcting words in text. ACM Comput. Surv. 24(4), 377–439 (1992)
Article Google Scholar
Kunihio, F., Imagawa, T., Ashida, E.: Character recognition with selective attention. In: Proceedings of the International Joint Conference on Neural Networks, vol. 1, pp. 593–598 (1991)
Lee S.-W., Lee D.-J., Park H.-S.: A new methodology for gray-scale character segmentation and recognition. IEEE Trans. Pattern Anal. Mach. Intell. 18, 1045–1050 (1996)
Article Google Scholar
Ma H., Doermann D.: Adaptive hindi OCR using generalized hausdorff image comparison. ACM Trans. Asian Lang. Inf. Process. 26(2), 198–213 (2003)
Google Scholar
Mitchell T.M.: Machine Learning. McGraw-Hill, New York (1997)
MATH Google Scholar
Mori S., Suen C.Y., Yamamoto K.: Historical review of OCR research and development. Proc. IEEE 80, 1029–1058 (1992)
Article Google Scholar
Ohala, M.: Aspects of Hindi Phonology. Motilal Banarasidas, Delhi (1983). ISBN: 0895811162.
Rocha J., Pavlidis T.: Character recognition without segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 17, 903–909 (1995)
Article Google Scholar
Rosenfeld R.: A maximum entropy approach to adaptive statistical language modeling. Comput. Speech Lang. 10, 187–228 (1996)
Article Google Scholar
Sinha R.: Plang: a picture language schema for a class of pictures. Pattern Recognit. 16(4), 373–383 (1983)
Article Google Scholar
Sinha R.: Rule based contextual post-processing for devanagari text recognition. Pattern Recognit. 20, 475–485 (1987)
Article Google Scholar
Sinha R., Mahabala H.: Machine recognition of Devnagari script. IEEE Trans. Syst. Man Cybern. 9, 435–441 (1979)
Article MATH MathSciNet Google Scholar
Sinha R., Prasada B., Houle G., Sabourin M.: Hybrid contextural text recognition with string matching. IEEE Trans. Pattern Anal. Mach. Intell. 15, 915–925 (1993)
Article Google Scholar
Slavik, P., Govindaraju, V.: An overview of run-length encoding of handwritten word images. Technical report, SUNY, Buffalo (2000)
Song J., Li Z., Lyu M., Cai S.: Recognition of merged characters based on forepart prediction, necessity-sufficiency matching, and character-adaptive masking. IEEE Trans. Syst. Man Cybern. B 35, 2–11 (2005)
Article Google Scholar
Sonka M., Hlavac V., Boyle R.: Image Processing, Analysis and Machine Vision, 2nd edn. Brooks-Cole, Belmont (1999)
Google Scholar
Woo, K.J., George, T.R.: Automated labeling in document images. In: Proceedings of the SPIE, Document Recognition and Retrieval VIII, vol. 4307, pp. 111–122, January 2001
Wu Y., Ianakiev K.G., Govindaraju V.: Improved k-nearest neighbor classification. Pattern Recognit. 35, 2311–2318 (2002)
Article MATH Google Scholar
Xue, H.: Stochastic Modeling of High-Level Structures in Handwriting Recognition. PhD thesis, University at Buffalo, The State University of New York (2002)
Yu B., Jain A.: A generic system for form dropout. IEEE Trans. Pattern Anal. Mach. Intell. 18, 1127–1134 (1996)
Article Google Scholar
Zheng, J., Ding, X., Wu, Y.: Recognizing on-line handwritten chinese character via farg matching. In: Proceedings of the 4th International Conference on Document Analysis and Recognition, vol. 2, pp. 621–624, August 1997

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, University at Buffalo, State University of New York, Buffalo, USA
Suryaprakash Kompalli, Srirangaraj Setlur & Venu Govindaraju

Authors

Suryaprakash Kompalli
View author publications
You can also search for this author in PubMed Google Scholar
Srirangaraj Setlur
View author publications
You can also search for this author in PubMed Google Scholar
Venu Govindaraju
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Srirangaraj Setlur.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kompalli, S., Setlur, S. & Govindaraju, V. Devanagari OCR using a recognition driven segmentation framework and stochastic language models. IJDAR 12, 123–138 (2009). https://doi.org/10.1007/s10032-009-0086-8

Download citation

Received: 11 January 2008
Revised: 25 March 2009
Accepted: 13 April 2009
Published: 19 May 2009
Issue Date: July 2009
DOI: https://doi.org/10.1007/s10032-009-0086-8

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Segmentation-free optical character recognition for printed Urdu text

Optical Character Recognition Systems

A Region Based Design of Deterministic Finite State Automata for Online Recognition of Teeline Shorthand Language Alphabet

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Devanagari OCR using a recognition driven segmentation framework and stochastic language models

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Segmentation-free optical character recognition for printed Urdu text

Optical Character Recognition Systems

A Region Based Design of Deterministic Finite State Automata for Online Recognition of Teeline Shorthand Language Alphabet

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation