[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ Skip to main content
Log in

Foreground text segmentation in complex color document images using Gabor filters

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Extraction of foreground contents in complex background document images is very difficult as background texture, color and foreground font, size, color, tilt are not known in advance. In this work, we propose a RGB color model for the input of complex color document images. An algorithm to detect the text regions using Gabor filters followed by extraction of text using color feature luminance is developed too. The proposed approach consists of three stages. Based on the Gabor features, the candidate image segments containing text are detected in stage-1. Because of complex background, certain amount of high frequency non-text objects in the background are also detected as text objects in stage-1. In stage-2, certain amount of false text objects is dropped by performing the connected component analysis. In stage-3, the image segments containing textual information, which are obtained from the previous stage are binarized to extract the foreground text. The color feature luminance is extracted from the input color document image. The threshold value is derived automatically using this color feature. The proposed approach handles both printed and handwritten color document images with foreground text in any color, font, size and orientation. For experimental evaluations, we have considered a variety of document images having non-uniform/uniform textured and multicolored background. Performance of segmentation of foreground text is evaluated on a commercially available OCR. Evaluation results show better recognition accuracy of foreground characters in the processed document images against unprocessed document images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bovik A.C., Clark M., Geisler W.S.: Multichannel texture analysis using localized spatial filters. IEEE Trans. Pattern Anal. Mach. Intell. 12(1), 55–73 (1990)

    Article  Google Scholar 

  2. Campbell F.W., Robson J.G.: Application of fourier analysis of the visibility of gratings. J. Physiol. 197(3), 551–566 (1968)

    Google Scholar 

  3. Farhoodi, R., Kasaei, S.: Text segmentation from images with textured and colored background. In: Proceedings of 13th Iranian Conference on Electrical Engineering. Zanjan, Iran, May (2005)

  4. Gabor D.: Theory of communication. J. IEE 93(26), 429–457 (1946)

    Google Scholar 

  5. Gonzalez, R.C., Woods, R.E.: Digital Image Processing, Second Edition, Ch.9. Prentice Hall, pp. 523–528 (2002)

  6. Jain A.K., Bhattacharjee S.K.: Text segmentation using gabor filters for automatic document processing. Mach. Vis. Appl. 5(3), 169–184 (1992a)

    Article  Google Scholar 

  7. Jain A.K., Bhattacharjee S.K.: Address block location on envelopes using gabor filters. Pattern Recognit. 25(12), 1459–1477 (1992b)

    Article  Google Scholar 

  8. Jain, A.K., Bhattacharjee, S.K., Chen, Y.: On texture in document images. In: Proceedings of IEEE Conference on CVPR, Champaign, IL, pp. 677–680, 15–18 June (1992)

  9. Kim K.I., Jung K., Kim H.J.: Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 25(12), 1631–1639 (2003)

    Article  MathSciNet  Google Scholar 

  10. Leedham, G., Chen, Y., Takru, K., Tan, J.H.N., Mian, L.: Comparison of some thresholding algorithms for text/background segmentation in difficult document images. In: Proceedings of Seventh International Conf. on Document Analysis and Recognition, Edinburgh, Scotland, pp. 859–864, 3–6 Aug. (2003)

  11. Liu, Y., Goto, S., Ikenaga, T.: A robust algorithm for text detection in color images. In: Proceedings of Eighth International Conference on Document Analysis and Recognition, Seoul, Korea, pp. 399–403, 29 Aug.–1 Sept. (2005)

  12. Movellan, J.R.: Tutorial on Gabor Filters. http://www.mplab.ucsd.edu/wordpress/tutorials/gabor.pdf

  13. Navon, Y.: Layer Based Binarization for Textual Images. ICPR, Tampa, Florida, USA, pp. 1–5, 8–11 Dec. (2008)

  14. Pietikainen, M., Okun, O.: Edge-based method for text detection from complex document images. In: Proceedings of Sixth International Conference on Document analysis and Recognition, Seattle, WA, USA, pp. 286–291, 10–13 Sept. (2001)

  15. Qiao, Y.L., Li, M., Lu, Z.-M., Sun, S.-H.: Gabor filter based text extraction from digital document images. In: Proceedings of the International Conference on Intelligent Information Hiding and Multimedia Signal Processing. Pasadena, CA, USA, pp. 297–300, 18–20 Dec. (2006)

  16. Sezgin M., Sankur B.: Survey over image thresholding techniques and quantitative performance evaluation. J. Electronic imaging 13(1), 146–165 (2004)

    Article  Google Scholar 

  17. Tsai C.M., Lee H.J.: Binarization of color document images via luminance and saturation color features. IEEE Trans. Image process. 11(4), 434–451 (2002)

    Article  Google Scholar 

  18. Wang, Q., Xia, T., Li, L., Tan, C.L. (2003) Document image enhancement using directional wavelet. IEEE International Conference on Computer Vision and Pattern Recognition. Madison, WI, USA, pp. 534–542, 16–22 June (2003)

  19. Zhong Y., Karu K., Jain A.K.: Locating text in complex color images. Pattern Recognit. 28(10), 1523–1536 (1995)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Nirmala.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nirmala, S., Nagabhushan, P. Foreground text segmentation in complex color document images using Gabor filters. SIViP 6, 669–678 (2012). https://doi.org/10.1007/s11760-010-0196-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-010-0196-2

Keywords

Navigation