[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3109761.3109803acmotherconferencesArticle/Chapter ViewAbstractPublication PagesimlConference Proceedingsconference-collections
research-article

Text localization in natural images through effective re-identification of the MSER

Published: 17 October 2017 Publication History

Abstract

Text detection and recognition from images have numerous applications for document analysis and information retrieval tasks. An accurate and robust method for detecting texts in natural scene images is proposed in this paper. Text-region candidates are detected using maximally stable extremal regions (MSER) and a machine learning based method is then applied to refine and validate the initial detection. The effectiveness of features based on aspect ratio, GLSM, LBP, HOG descriptors are investigated. Text-region classifiers of MLP, SVM and RF are trained using selections of these features and their combination. A publicly available multilingual dataset ICDAR 2003,2011 has been used to evaluate the method. The proposed method achieved excellent performance on both databases and the improvements are significant in terms of Precision, Recall, and F-measure. The results show that using a suitable feature combination and selection approach can can significantly increase the accuracy of the algorithms. Keywords---text detection; scene images; ICDAR; feature selection.

References

[1]
Chen, H. et al., 2011. Robust text detection in natural images with edge-enhanced maximally stable extremal regions. Proceedings - International Conference on Image Processing, ICIP, pp.2609--2612.
[2]
Chowdhury, A., Bhattacharya, U. & Parui, S.K., 2012. Scene text detection using sparse stroke information and MLP. 21st International Conference on Pattern Recognition (ICPR 2012), (Icpr), pp.294--297.
[3]
Clausi, D. A., (2002). An analysis of co-occurrence texture statistics as a function of grey level quantization. Canadian Journal of remote sensing, 28(1):,pp45--62.
[4]
Dalal, N., and Triggs, B.,. (2005) Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, volume 1, PP: 886--893. IEEE.
[5]
Epshtein, B., Ofek, E. & Wexler, Y., 2010. Detecting Text in Natural Scenes with stroke width transform. IEEE Conf. Comput. Vis. Pattern Recognit, (d), pp.2963--2970
[6]
Gomez, L., Karatzas, D., 2013 "Multi-script text extraction from natural scenes," in ICDAR,
[7]
Gonzalez, Alvaro, et al. (2012) "Text location in complex images." Pattern Recognition (ICPR), 21st International Conference on. IEEE, 2012.
[8]
Hanif S. M., Prevost, L., (2009), Text Detection and Localization in Complex Scene Images using Constrained AdaBoost Algorithm, 10th International Conference on Document Analysis and Recognition.
[9]
Haralick, R. M., Shanmuga, K., & Dinstein, H. (1973). Textural features for image classification. IEEE Transactions on Systems Man and Cybernetics, 3(6),pp:610--621.
[10]
Kim, K.I., Jung, K. & Kim, J.H., 2003. Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm æ. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 25(12), pp.1631--1639.
[11]
Kwak, J. T., Xu, S., & Wood, B. J. (2015). Efficient Data Mining for Local Binary Pattern in Texture Image Analysis. Expert Systems with Applications, 42(9), 4529--4539.
[12]
Lucas, S. M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R., Ashida, K., Nagai, H., Okamoto, M., Yamamoto, H. Miyao, H., J. Zhu, W. Ou, C. Wolf, Jolion, Todoran, J.-M., L., Worring, M., and Lin. X., (2005), ICDAR 2003 robust reading competitions: entries, results, and future directions. IJDAR, 7(2-3), PP:105--122.
[13]
Lucas, S.M. Panaretos, A. Sosa, L. Tang, A. Wong, S. Young, R., 2003. ICDAR 2003 Robust Reading Competitions. In INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR).
[14]
Matas, J. Chum, M. Urban, and T. Pajdla, 2002. Robust Wide Baseline Stereo from Maximally Stable Extremal Regions. Proc. of British Machine Vision Conference, pp.384--393.
[15]
Minetto, R., Thome, N., Cord, M., Stolfi, J. Precioso F., Guyomard, J., and Leite, N. J., (2011), "Text detection and recognition in urban scenes," in IEEE ICCV Workshops, 2011, pp: 227--234.
[16]
Neumann, L. & Matas, J., 2010. a Method for Text Localization and Recognition in Real - World Images. In 10th Asian conference on computer Vision, ACCV. pp. 770--783.
[17]
Neumann, L. Matas, J. 2013, On combining multiple segmentations in scene text recognition, in: Proceedings of the ICDAR, pp. 523--527.
[18]
Neumann, L. Matas, J., 2012, Real-time scene text localization and recognition, in: Proceedings of the CVPR, pp. 3538--3545.
[19]
Pan, Y., Hou, X. & Liu, C., 2011. A Hybrid Approach to Detect and Localize Texts in. IEEE TRANSACTIONS ON IMAGE PROCESSING, 20(3), pp.800--813.
[20]
Seeri, S. V, Pujari, J.D. & Hiremath, P.S., 2015. Multilingual Text Localization in Natural Scene Images using Wavelet based Edge Features and Fuzzy Classification., 4(1), pp.210--218.
[21]
Serra, J., Simon (Ed.) J.C., (1989), Toggle Mappings: From Pixels to Features, Elsevier, pp: 61--72
[22]
Shi, C. Chunheng, W. Baihua, X. Yang, Z Song, G., 2013. Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recognition Letters, 34(2), pp.107--116.
[23]
Soh L. K. and satsoulis, C. T, (1999), Texture analysis of sar sea ice imagery using gray level co-occurrence matrices. Geoscience and Remote Sensing, IEEE Transactions on, 37(2), PP:780--795.
[24]
Wang, X. Song, Y., Zhang, Y., Xin, J, 2015. Natural scene text detection with multi-layer segmentation and higher order conditional random field based analysis. Pattern Recognition Letters, 60-61, pp.41--47.
[25]
Wang, X. Song, Y., Zhang, Y., Xin, J, 2015. Natural scene text detection with multi-layer segmentation and higher order conditional random field based analysis. Pattern Recognition Letters, 60-61, pp.41--47.
[26]
Yao, C. et al., 2012. Detecting Texts of Arbitrary Orientations in Natural Images. In IEEE Conf. CVPR. pp. 1083--1090.
[27]
Ye, Q. & Doermann, D., 2015. Text Detection and Recognition in Images and Video : a Survey. IEEE transactions on pattern analysis and machine intelligence, 37(7), pp.1480--1500
[28]
Yi, C., Tian, Y., (2012), Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
IML '17: Proceedings of the 1st International Conference on Internet of Things and Machine Learning
October 2017
581 pages
ISBN:9781450352437
DOI:10.1145/3109761
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. ICDAR
  2. feature selection
  3. scene images
  4. text detection

Qualifiers

  • Research-article

Conference

IML 2017

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 77
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media