Abstract
Off-line Arabic segmentation has been a popular field of research. It still remains an open problem for discussion. In fact, the challenging nature of Arabic writing which increases the complexity of recognition and segmentation task has attracted the attention of many researchers. This paper proposes and investigates an enhanced algorithm based on the vertical histogram projection and some rules to segment Arabic words with small size. These rules are based on not only the structural characteristics of Arabic language, but also on the baselines positions and their relation with the characters. Our approach aims at cooperating together the segmentation method based on histogram projection and the contextual topographies of Arabic writing in order to improve the segmentation rate. Thus, we use the vertical histogram to detect the preliminary segmentation points and some other rules to find real segmentation points. The proposed approach has been tested with Arabic Printed Text Image Database (APTI). Actually, promising results have been obtained. Compared with the previously-proposed approach, our algorithm gives better result if applied on smaller size.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Amara, M., Zidi, K.: Feature selection using a neuro-genetic approach for arabic text recognition. In: Meta heuristics and Nature Inspired Computing (2012)
Amara, M., Zidi, K.: Arabic text recognition based on neuro-genetic feature selection approach. In: Advanced Machine Learning Technologies and Applications, pp. 3–10. Springer International Publishing (2014)
Alginahi, Y.M.: A survey on arabic character segmentation. Int. J. Doc. Anal. Recogn. 16(2), 105–126 (2013)
Najoua, B.A., Noureddine, E.: A robust approach for arabic printed character segmentation. Doc. Anal. Recogn. 38(4), 420–433 (1995)
Parhami, B., Taraghi, M.: Automatic recognition of printed Farsi texts. Pattern Recogn. 14(1), 395–403 (1981)
Amin, A., Masini, G.: Machine recognition of multifont printed arabic texts. In: Proceedings of International Conference on Pattern Recognition, pp 392–395 (1986)
Hamami, L., Berkani, D.: Recognition system for printed multi-font and multi-size arabic characters. Arab. J. Sci. Eng. 27(1), 57–72 (2002)
Zheng, L., Hassin, A.H., Tang, X.: A new algorithm for machine printed Arabic character segmentation. Pattern Recogn. Lett. 25(15), 1723–1729 (2004)
Abuhaiba, I.S.: A discrete arabic script for better automatic document understanding. Arab. J. Sci. Eng. 28(1), 77–94 (2003)
Amara, M., Zidi, K., Zidi, S., Ghedira, K.: Arabic character recognition based M-SVM: review. In: Advanced Machine Learning Technologies and Applications, pp 18–25. Springer International Publishing (2014)
Amara, M., Ghedira, K., Zidi, K., Zidi, S.: A Comparative study of multi-class support vector machine methods for Arabic characters recognition. In International Conference on Computer Systems and Applications (2015)
Slimane, F., Ingold, R., Kanoun, S., Alimi, A. M., Hennebert, J.: A new arabic printed text image database and evaluation protocols. In: Document Analysis and Recognition, pp. 946–950 (2009)
Acknowledgment
This research and innovation work is carried out within a MOBIDOC thesis funded by the EU under the PASRI project.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Amara, M., Zidi, K., Ghedira, K., Zidi, S. (2016). New Rules to Enhance the Performances of Histogram Projection for Segmenting Small-Sized Arabic Words. In: Abraham, A., Han, S., Al-Sharhan, S., Liu, H. (eds) Hybrid Intelligent Systems. HIS 2016. Advances in Intelligent Systems and Computing, vol 420. Springer, Cham. https://doi.org/10.1007/978-3-319-27221-4_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-27221-4_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27220-7
Online ISBN: 978-3-319-27221-4
eBook Packages: EngineeringEngineering (R0)