Abstract
In this paper, we present a novel segmentation-free Arabic handwriting recognition system based on hidden Markov model (HMM). Two main contributions are introduced: a new technique for dividing the image into nonuniform horizontal segments to extract the features and a new technique for solving the problems of the skewing of characters by fusing multiple HMMs. Moreover, two enhancements are introduced: the pre-processing method and feature extraction using concavity space. The proposed system first pre-processes the input image by setting the thickness of the input word to three pixels and fixing the spacing between the different parts of the word. The input image is divided into constant number of nonuniform horizontal segments depending on the distribution of the foreground pixels. A set of robust features representing the gradient of the foreground pixels is extracted using sliding windows. The input image is decomposed into several images representing the vertical, horizontal, left diagonal and right diagonal edges in the image. A set of robust features representing the densities of the foreground pixels in the various edge images is extracted using sliding windows. The proposed system builds character HMM models and learns word HMM models using embedded training. Besides the vertical sliding window, two slanted sliding windows are used to extract the features. Three different HMMs are used: one for the vertical sliding window and two for the slanted windows. A fusion scheme is used to combine the three HMMs. The proposed system is very promising and outperforms all the other Arabic handwriting recognition systems reported in the literature.
Similar content being viewed by others
References
Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Arabic handwriting recognition using baseline dependant features and hidden Markov modeling. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR’05) (2005)
Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Combination of HMM-based classifiers for the recognition of arabic handwritten words. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition (ICDAR’07) (2007)
Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Combining slanted-frame classifiers for improved HMM-based arabic handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(7), 1165–1177 (2009)
AlKhateeb, J.H., Ren, J., Jiang, J., Al-Muhtaseb, H.: Offline handwritten Arabic cursive text recognition using hidden Markov models and re-ranking. Pattern Recognit. Lett. 32, 8 (2011)
Benouareth, A., Ennaji, A., Sellami, M.: HMMs with explicit state duration applied to handwritten Arabic word recognition. In: Proceeding of 18th International Conference Pattern Recognition (ICPR) (2006)
Benouareth, A., Ennaji, A., Sellami, M.: Semi-continuous HMMs with explicit state duration for unconstrained arabic word modeling and recognition. Pattern Recognit. Lett. 29, 1742–1752 (2008)
Bianne-Bernard, A.-L., Menasri, F., Al-Hajj Mohamad, R., Mokbel, C., Kermorvant, C., Likforman-Sulem, L.: Dynamic and contextual information in HMM modeling for handwritten word recognition. IEEE Trans. Pattern Anal. Mach. Intell. 33(10), 2066–2080 (2011)
Dreuw, P., Jonas, S., Ney, H.: White-space models for offline Arabic handwriting recognition. In: Proceeding of 19th Int. Conf. Pattern Recognition (ICPR) (2008)
El Abed, H., Märgner, V.: Comparison of different preprocessing and feature extraction methods for offline recognition of handwritten Arabic words. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition (ICDAR’07) (2007)
Elbaati, A., Boubaker, H., Kherallah, M., Alimi, A.M., Ennaji, A., El Abed, H.: Arabic handwriting recognition using restored stroke chronology. In: Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR), pp. 411–415, July (2009)
Gatos, B., Pratikakis, I., Kesidis, A.L., Perantonis, S.J.: Efficient off-line cursive handwriting word recognition. In: Proceedings of the Tenth International Workshop on Frontiers in Handwriting Recognition, Oct. La Baule (2006)
Gonzales, R.C., Woods, R.E.: Digital Image Processing, vol. 2. Addison-Wesley, Reading, MA (2002)
Hamdani, M., El Abed, H., Kherallah, M., Alimi Adel, M.: Combining multiple HMMs using online and offline features for offline Arabic handwriting recognition. In: Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR) (2009)
HTK Speech Recognition Toolkit, pp. 108–122. http://htk.eng.cam.ac.uk/
Kessentini, Y., Paquet, T., Ben Hamado, A.M.: Offline handwritten word recognition using multi-stream hidden Markov models. J. Pattern Recognit. Lett. 1(1) (2010)
Khorsheed, M.S.: Offline Arabic character recognition—a review. Pattern Anal. Appl. 5, 31–45 (2002)
Liu, C., Nakashima, K., Sako, H., Fujisawa, H.: Handwritten digit recognition: benchmarking of state-of-the-art techniques. Pattern Recognit. 36, 2271–2285 (2003)
Märgner, V., El Abed, H.: ICDAR 2007 Arabic handwriting recognition competition. In: Proceedings 9th Int. Conf. on Document Analysis and Recognition (ICDAR), pp. 1274–1278 (2007)
Märgner, V., El Abed, H.: ICDAR 2009 Arabic handwriting recognition competition. In: Proceedings of the 10th Int. Conf. on Document Analysis and Recognition (ICDAR), pp. 1383–1387 (2009)
Märgner, V., El Abed, H.: ICDAR 2011 Arabic handwriting recognition competition. In: Proceedings of the 11th Int. Conf. on Document Analysis and Recognition (ICDAR) (2011)
Märgner, V., El Abed, H.: ICFHR 2010 Arabic handwriting recognition competition. In: Proceedings of the 12th International Conference on Frontiers in Handwriting Recognition(ICFHR) (2010)
Märgner, V., Pechwitz, M., El Abed, H.: ICDAR 2005 Arabic handwriting recognition competition. Proc. 8th Int. Conf. Doc. Anal. Recognit. 1, 70–74 (2005)
Pechwitz, M., Maddouri, S.S., Maergner, V., Ellouze, N., Amiri, H.: IFN/ENIT-database of handwritten Arabic words. In: Proceedings of the Colloque International Francophone surl’Ècrit et le Document (CIFED ’02), pp. 129–136. Hammamet, Tunisia, October (2002)
Pechwitz, M., Maergner, V.: HMM based approach for handwritten Arabic word recognition using the IFN/ENIT-database. In: Proceedings of the Seventh International Conference on Document Analysis and Recognition (ICDAR’03) (2003)
Rodríguez, J.A., Perronnin, F.: Local gradient histogram features for word spotting in unconstrained handwritten documents. In: Proceeding of International Conference on Frontiers and Handwriting Recognition (ICFHR2008) Montréal, Québec (2008)
Suen, C.Y., Lam, L., Lee, S.-W.: Thinning methodologies—a comprehensive survey. IEEE Trans. Pattern Anal. Mach. Intell. 14(9), 879 (1992)
Xiang, D., Yan, H., Chen, X., Cheng, Y.: Offline Arabic handwriting recognition system based on HMM. In: Computer Science and Information Technology ICCSIT, 3rd IEEE International Conference (2010)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Azeem, S.A., Ahmed, H. Effective technique for the recognition of offline Arabic handwritten words using hidden Markov models. IJDAR 16, 399–412 (2013). https://doi.org/10.1007/s10032-013-0201-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-013-0201-8