Abstract
Text detection in natural scene image is the extraction of the text regions from a natural scene image. The extraction information can be used in the system of text recognition. The texts in natural scene image contain important information. Text detection is an important prerequisite for many computer vision applications, such as license plate recognitions system, information filtering system, automatic navigation and so on. Text detection as a real-life application has to quickly and successfully process the texts in different fonts and under different environmental conditions. It should also be generalized to process texts in different languages and directions. We categorize different text detection techniques according to the methods used for each stage, and compare them in terms of merits, demerits and performance. Feature forecasts of text detection in natural scene image are given at the end.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010. LNCS, vol. 6494, pp. 770–783. Springer, Heidelberg (2011). doi:10.1007/978-3-642-19318-7_60
Kim, K.I., Jung, K., Jin, H.K.: Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 25(12), 1631–1639 (2003)
Pan, Y.F., Hou, X., Liu, C.L.: A robust system to detect and localize texts in natural scene images. In: The Eighth IAPR International Workshop on Document Analysis Systems, pp. 35–42. IEEE (2008)
Ye, J., Huang, L.L., Hao, X.: Neural network based text detection in videos using local binary patterns. In: Chinese Conference on Pattern Recognition, CCPR 2009, pp. 1–5 (2009)
Lee, J., Lee, P.H., Lee, S.W., et al.: AdaBoost for text detection in natural scene. In: International Conference on Document Analysis and Recognition, pp. 429–434. IEEE Computer Society (2011)
Song, Y., He, Y., Li, Q., et al.: Reading text in street views using Adaboost: towards a system for searching target places. In: 2009 IEEE Intelligent Vehicles Symposium, pp. 227–232. IEEE (2009)
Gllavata, J., Ewerth, R., Freisleben, B.: Text detection in images based on unsupervised classification of high-frequency wavelet coefficients. Proc. Int. Conf. Pattern Recogn. 1(3), 425–428 (2004)
Shivakumara, P., Phan, T.Q., Tan, C.L.: New fourier-statistical features in RGB space for video text detection. IEEE Trans. Circ. Syst. Video Technol. 20(11), 1520–1532 (2010)
Lyu, M.R., Song, J., Cai, M.: A comprehensive method for multilingual video text detection, localization, and extraction. IEEE Trans. Circ. Syst. Video Technol. 15(2), 243–255 (2005)
Zhao, M., Li, S., Kwok, J.: Text detection in images using sparse representation with discriminative dictionaries. Image Vis. Comput. 28(12), 1590–1599 (2010)
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2963–2970. IEEE (2010)
Yao, C., Bai, X., Liu, W., et al.: Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on Computer Vision & Pattern Recognition, pp. 1083–1090 (2012)
Matas, J., Chum, O., Urban, M., et al.: Robust wide-baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2004)
Yin, X.C., Yin, X., Huang, K., et al.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2013)
Karatzas, D., Gomezbigorda, L., Nicolaou, A., et al.: ICDAR 2015 Competition on Robust Reading. International Conference on Document Analysis and Recognition (2015)
Neumann, L., Matas, J.: Real-time lexicon-free scene text localization and recognition. 1 (2015)
Lucas, S.M., Panaretos, A., Sosa, L., et al.: ICDAR 2003 robust reading competitions. In: International Conference on Document Analysis and Recognition, p. 682. IEEE Computer Society (2003)
Wolf, C., Jolion, J.M.: Object count/area graphs for the evaluation of object detection and segmentation algorithms. Doc. Anal. Recogn. 8(4), 280–296 (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Wang, S., Fu, C., Li, Q. (2017). Text Detection in Natural Scene Image: A Survey. In: Xin-lin, H. (eds) Machine Learning and Intelligent Communications. MLICOM 2016. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 183. Springer, Cham. https://doi.org/10.1007/978-3-319-52730-7_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-52730-7_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-52729-1
Online ISBN: 978-3-319-52730-7
eBook Packages: Computer ScienceComputer Science (R0)