Abstract
Extraction of foreground contents in complex background document images is very difficult as background texture, color and foreground font, size, color, tilt are not known in advance. In this work, we propose a RGB color model for the input of complex color document images. An algorithm to detect the text regions using Gabor filters followed by extraction of text using color feature luminance is developed too. The proposed approach consists of three stages. Based on the Gabor features, the candidate image segments containing text are detected in stage-1. Because of complex background, certain amount of high frequency non-text objects in the background are also detected as text objects in stage-1. In stage-2, certain amount of false text objects is dropped by performing the connected component analysis. In stage-3, the image segments containing textual information, which are obtained from the previous stage are binarized to extract the foreground text. The color feature luminance is extracted from the input color document image. The threshold value is derived automatically using this color feature. The proposed approach handles both printed and handwritten color document images with foreground text in any color, font, size and orientation. For experimental evaluations, we have considered a variety of document images having non-uniform/uniform textured and multicolored background. Performance of segmentation of foreground text is evaluated on a commercially available OCR. Evaluation results show better recognition accuracy of foreground characters in the processed document images against unprocessed document images.
Similar content being viewed by others
References
Bovik A.C., Clark M., Geisler W.S.: Multichannel texture analysis using localized spatial filters. IEEE Trans. Pattern Anal. Mach. Intell. 12(1), 55–73 (1990)
Campbell F.W., Robson J.G.: Application of fourier analysis of the visibility of gratings. J. Physiol. 197(3), 551–566 (1968)
Farhoodi, R., Kasaei, S.: Text segmentation from images with textured and colored background. In: Proceedings of 13th Iranian Conference on Electrical Engineering. Zanjan, Iran, May (2005)
Gabor D.: Theory of communication. J. IEE 93(26), 429–457 (1946)
Gonzalez, R.C., Woods, R.E.: Digital Image Processing, Second Edition, Ch.9. Prentice Hall, pp. 523–528 (2002)
Jain A.K., Bhattacharjee S.K.: Text segmentation using gabor filters for automatic document processing. Mach. Vis. Appl. 5(3), 169–184 (1992a)
Jain A.K., Bhattacharjee S.K.: Address block location on envelopes using gabor filters. Pattern Recognit. 25(12), 1459–1477 (1992b)
Jain, A.K., Bhattacharjee, S.K., Chen, Y.: On texture in document images. In: Proceedings of IEEE Conference on CVPR, Champaign, IL, pp. 677–680, 15–18 June (1992)
Kim K.I., Jung K., Kim H.J.: Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 25(12), 1631–1639 (2003)
Leedham, G., Chen, Y., Takru, K., Tan, J.H.N., Mian, L.: Comparison of some thresholding algorithms for text/background segmentation in difficult document images. In: Proceedings of Seventh International Conf. on Document Analysis and Recognition, Edinburgh, Scotland, pp. 859–864, 3–6 Aug. (2003)
Liu, Y., Goto, S., Ikenaga, T.: A robust algorithm for text detection in color images. In: Proceedings of Eighth International Conference on Document Analysis and Recognition, Seoul, Korea, pp. 399–403, 29 Aug.–1 Sept. (2005)
Movellan, J.R.: Tutorial on Gabor Filters. http://www.mplab.ucsd.edu/wordpress/tutorials/gabor.pdf
Navon, Y.: Layer Based Binarization for Textual Images. ICPR, Tampa, Florida, USA, pp. 1–5, 8–11 Dec. (2008)
Pietikainen, M., Okun, O.: Edge-based method for text detection from complex document images. In: Proceedings of Sixth International Conference on Document analysis and Recognition, Seattle, WA, USA, pp. 286–291, 10–13 Sept. (2001)
Qiao, Y.L., Li, M., Lu, Z.-M., Sun, S.-H.: Gabor filter based text extraction from digital document images. In: Proceedings of the International Conference on Intelligent Information Hiding and Multimedia Signal Processing. Pasadena, CA, USA, pp. 297–300, 18–20 Dec. (2006)
Sezgin M., Sankur B.: Survey over image thresholding techniques and quantitative performance evaluation. J. Electronic imaging 13(1), 146–165 (2004)
Tsai C.M., Lee H.J.: Binarization of color document images via luminance and saturation color features. IEEE Trans. Image process. 11(4), 434–451 (2002)
Wang, Q., Xia, T., Li, L., Tan, C.L. (2003) Document image enhancement using directional wavelet. IEEE International Conference on Computer Vision and Pattern Recognition. Madison, WI, USA, pp. 534–542, 16–22 June (2003)
Zhong Y., Karu K., Jain A.K.: Locating text in complex color images. Pattern Recognit. 28(10), 1523–1536 (1995)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Nirmala, S., Nagabhushan, P. Foreground text segmentation in complex color document images using Gabor filters. SIViP 6, 669–678 (2012). https://doi.org/10.1007/s11760-010-0196-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-010-0196-2