[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

An HMM-Based Approach for Off-Line Unconstrained Handwritten Word Modeling and Recognition

Published: 01 August 1999 Publication History

Abstract

This paper describes a hidden Markov model-based approach designed to recognize off-line unconstrained handwritten words for large vocabularies. After preprocessing, a word image is segmented into letters or pseudoletters and represented by two feature sequences of equal length, each consisting of an alternating sequence of shape-symbols and segmentation-symbols, which are both explicitly modeled. The word model is made up of the concatenation of appropriate letter models consisting of elementary HMMs and an HMM-based interpolation technique is used to optimally combine the two feature sets. Two rejection mechanisms are considered depending on whether or not the word image is guaranteed to belong to the lexicon. Experiments carried out on real-life data show that the proposed approach can be successfully used for handwritten word recognition.

References

[1]
M. Gilloux and M. Leroux, “Recognition of Cursive Script Amounts on Postal Cheques,” Proc. Fifth U.S. Postal Service Advanced Technology Conf., pp. 545-556, 1992.
[2]
S. Knerr O. Baret D. Price and J.C. Simon, “The A2iA Recognition System for Handwritten Checks,” Proc. Document Analysis Systems, pp. 431-494, 1996.
[3]
C.Y. Suen L. Lam D. Guillevic N.W. Strathy M. Cheriet J.N. Said and R. Fan, “Bank Check Processing System,” Int'l J. Imaging Systems and Technology, vol. 7, pp. 392-403, 1996.
[4]
A.M. Gillies, “Cursive Word Recognition Using Hidden Markov Models,” Proc. Fifth U.S. Postal Service Advanced Technology Conf., pp. 557-562, 1992.
[5]
M. Gilloux M. Leroux and J.M. Bertille, “Strategies for Cursive Script Recognition Using Hidden Markov Models,” Machine Vision and Applications, vol. 8, no. 4, pp. 197-205, 1995.
[6]
M.Y. Chen A. Kundu and S.N. Srihari, “Variable Duration Hidden Markov Model and Morphological Segmentation for Handwritten Word Recognition,” IEEE Trans. Image Processing, vol. 4, no. 12, pp. 1,675-1,687, Dec. 1995.
[7]
R.M. Bozinovic and S.N. Srihari, “Off-Line Cursive Word Recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 11, no.1, pp. 68-83, Jan. 1989.
[8]
F. Kimura M. Shridhar and Z. Chen, “Improvements of a Lexicon Directed Algorithm for Recognition of Unconstrained Handwritten Words,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 18-22, Oct. 1993.
[9]
J. Picone, “Continuous Speech Recognition Using Hidden Markov Models,” IEEE Signal Processing Magazine, pp. 26-41, July 1990.
[10]
L. Bahl F. Jelinek and R. Mercer, “A Maximum Likelihood Approach to Speech Recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 5, no. 2, pp. 179-190, Mar. 1983.
[11]
L.R. Rabiner, “A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition,” Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
[12]
M. Magdi and P. Gader, “Handwritten Word Recognition Using Segmentation-Free Hidden Markov Modeling and Segmentation-Based Dynamic Programming Techniques,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 18, no. 5, pp. 548-554, May 1996.
[13]
F. Jelinek R.L. Mercer and S. Roukos, “Principles of Lexical Language Modeling for Speech Recognition,” Advances in Speech Signal Processing, S. Furui and M.M. Sondhi, eds., pp. 651-699, 1992.
[14]
L. Yang B.K. Widjaja and R. Prasad, “Application of Hidden Markov Models for Signature Verification,” Pattern Recognition, vol. 28, no. 2, pp. 161-170, Feb. 1995.
[15]
R.M. Gray, “Vector Quantization,” IEEE Signal Processing Magazine, pp. 4-29, Apr. 1984.
[16]
L.A. Liporace, “Maximum Likelihood Estimation for Multivariate Observation of Markov Sources,” IEEE Trans. Information Theory, vol. 28, no. 5, pp. 729-734, Sept. 1982.
[17]
X.D. Huang and M.A. Jack, “Semi-Continuous Hidden Markov Models for Speech Signals,” Computer Speech and Language, vol. 3, pp. 239-251, 1989.
[18]
T.K. Moon, “The Expectation-Maximization Algorithm,” IEEE Signal Processing Magazine, pp. 47-60, Nov. 1996.
[19]
G.D. Forney, “The Viterbi Algorithm,” Proc. IEEE, vol. 61, no. 3, pp. 268-278, Mar. 1973.
[20]
A. El-Yacoubi J.M. Bertille and M. Gilloux, “Towards a More Effective Handwritten Word Recognition System,” Proc. Fourth Int'l Workshop Frontiers in Handwriting Recognition, pp. 378-385, Dec. 1994.
[21]
W. Cho S.W. Lee and J.H. Kim, “Modeling and Recognition of Cursive Words with Hidden Markov Models,” Pattern Recognition, vol. 28, no. 12, pp. 1,941-1,953, Dec. 1995.
[22]
R. Nag K.H. Wong and F. Fallside, “Script Recognition Using Hidden Markov Models,” IEEE Int'l Conf. Acoustics Signal and Speech Processing, pp. 2,071-2,074, 1986.
[23]
S. Bercu and G. Lorette, “On-Line Handwritten Word Recognition: An Approach based on Hidden Markov Models,” Proc. Third Int'l Workshop Frontiers in Handwriting Recognition, pp. 385-390, May 1993.
[24]
J.Y. Ha S.C. Oh and J.H. Kim, “Recognition of Unconstrained Handwritten English Words With Character and Ligature Modeling,” Int'l J. Pattern Recognition and Artificial Intelligence, vol. 9, no. 3, pp. 535-556, June 1995.
[25]
H. Bunke M. Roth and E.G. Schukat-Talamazzini, “Off-Line Cursive Handwriting Recognition Using Hidden Markov Models,” Pattern Recognition, vol. 28, no. 9, pp. 1,399-1,413, Sept. 1995.
[26]
E.G. Schukat-Talamazzini and H. Niemann, “Das ISADORA-System—Ein Akustischphonetisches Netzwerk zur Automatischen Spracherkennung,” Musterekennung, 13. DAGM Symp., pp. 251-258, 1991.
[27]
M.Y. Chen A. Kundu and J. Zhou, “Off-Line Handwritten Word Recognition Using a Hidden Markov Model Type Stochastic Network,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 16, no. 5, pp. 481-496, May 1994.
[28]
A. El-Yacoubi R. Sabourin M. Gilloux and C.Y. Suen, “Improved Model Architecture and Training Phase in an Off-Line HMM-Based Word Recognition System,” Int'l Conf. Pattern Recognition, pp. 16-20, Aug. 1998.

Cited By

View all
  • (2023)Faster DAN: Multi-target Queries with Document Positional Encoding for End-to-End Handwritten Document RecognitionDocument Analysis and Recognition - ICDAR 202310.1007/978-3-031-41685-9_12(182-199)Online publication date: 21-Aug-2023
  • (2023)A Comprehensive Handwritten Paragraph Text Recognition System: LexiconNetDocument Analysis and Recognition – ICDAR 2023 Workshops10.1007/978-3-031-41501-2_16(226-241)Online publication date: 21-Aug-2023
  • (2021)An image database of handwritten Bangla words with automatic benchmarking facilities for character segmentation algorithmsNeural Computing and Applications10.1007/s00521-020-04981-w33:1(449-468)Online publication date: 1-Jan-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Pattern Analysis and Machine Intelligence
IEEE Transactions on Pattern Analysis and Machine Intelligence  Volume 21, Issue 8
August 1999
145 pages
ISSN:0162-8828
Issue’s Table of Contents

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 August 1999

Author Tags

  1. Handwriting modeling
  2. feature extraction
  3. hidden Markov models
  4. preprocessing
  5. rejection.
  6. segmentation
  7. word recognition

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 24 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Faster DAN: Multi-target Queries with Document Positional Encoding for End-to-End Handwritten Document RecognitionDocument Analysis and Recognition - ICDAR 202310.1007/978-3-031-41685-9_12(182-199)Online publication date: 21-Aug-2023
  • (2023)A Comprehensive Handwritten Paragraph Text Recognition System: LexiconNetDocument Analysis and Recognition – ICDAR 2023 Workshops10.1007/978-3-031-41501-2_16(226-241)Online publication date: 21-Aug-2023
  • (2021)An image database of handwritten Bangla words with automatic benchmarking facilities for character segmentation algorithmsNeural Computing and Applications10.1007/s00521-020-04981-w33:1(449-468)Online publication date: 1-Jan-2021
  • (2021)Offline handwritten Gurumukhi word recognition using eXtreme Gradient Boosting methodologySoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-020-05455-w25:6(4451-4464)Online publication date: 1-Mar-2021
  • (2021)A Self-attention Based Model for Offline Handwritten Text RecognitionPattern Recognition10.1007/978-3-031-02444-3_27(356-369)Online publication date: 9-Nov-2021
  • (2020)Handwriting recognition using cohort of LSTM and lexicon verification with extremely large lexiconMultimedia Tools and Applications10.1007/s11042-020-09198-679:45-46(34407-34427)Online publication date: 1-Dec-2020
  • (2016)Bangla Handwritten Character Segmentation Using Structural FeaturesACM Transactions on Asian and Low-Resource Language Information Processing10.1145/289049715:4(1-26)Online publication date: 12-Apr-2016
  • (2016)Lexicon reduction of handwritten Arabic subwords based on the prominent shape regionsInternational Journal on Document Analysis and Recognition10.1007/s10032-016-0262-619:2(139-153)Online publication date: 1-Jun-2016
  • (2015)Label EmbeddingInternational Journal of Computer Vision10.1007/s11263-014-0793-6113:3(193-207)Online publication date: 1-Jul-2015
  • (2014)Neural networks for document image preprocessingArtificial Intelligence Review10.1007/s10462-012-9337-z42:2(253-273)Online publication date: 1-Aug-2014
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media