More Web Proxy on the site http://driver.im/

research-article

Text localization in natural images through effective re-identification of the MSER

Authors:

Hanaa F Mahmood,

Eran EdirisingheAuthors Info & Claims

IML '17: Proceedings of the 1st International Conference on Internet of Things and Machine Learning

Article No.: 42, Pages 1 - 9

https://doi.org/10.1145/3109761.3109803

Published: 17 October 2017 Publication History

Abstract

Text detection and recognition from images have numerous applications for document analysis and information retrieval tasks. An accurate and robust method for detecting texts in natural scene images is proposed in this paper. Text-region candidates are detected using maximally stable extremal regions (MSER) and a machine learning based method is then applied to refine and validate the initial detection. The effectiveness of features based on aspect ratio, GLSM, LBP, HOG descriptors are investigated. Text-region classifiers of MLP, SVM and RF are trained using selections of these features and their combination. A publicly available multilingual dataset ICDAR 2003,2011 has been used to evaluate the method. The proposed method achieved excellent performance on both databases and the improvements are significant in terms of Precision, Recall, and F-measure. The results show that using a suitable feature combination and selection approach can can significantly increase the accuracy of the algorithms. Keywords---text detection; scene images; ICDAR; feature selection.

References

[1]

Chen, H. et al., 2011. Robust text detection in natural images with edge-enhanced maximally stable extremal regions. Proceedings - International Conference on Image Processing, ICIP, pp.2609--2612.

[2]

Chowdhury, A., Bhattacharya, U. & Parui, S.K., 2012. Scene text detection using sparse stroke information and MLP. 21st International Conference on Pattern Recognition (ICPR 2012), (Icpr), pp.294--297.

[3]

Clausi, D. A., (2002). An analysis of co-occurrence texture statistics as a function of grey level quantization. Canadian Journal of remote sensing, 28(1):,pp45--62.

[4]

Dalal, N., and Triggs, B.,. (2005) Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, volume 1, PP: 886--893. IEEE.

Digital Library

[5]

Epshtein, B., Ofek, E. & Wexler, Y., 2010. Detecting Text in Natural Scenes with stroke width transform. IEEE Conf. Comput. Vis. Pattern Recognit, (d), pp.2963--2970

[6]

Gomez, L., Karatzas, D., 2013 "Multi-script text extraction from natural scenes," in ICDAR,

Digital Library

[7]

Gonzalez, Alvaro, et al. (2012) "Text location in complex images." Pattern Recognition (ICPR), 21st International Conference on. IEEE, 2012.

[8]

Hanif S. M., Prevost, L., (2009), Text Detection and Localization in Complex Scene Images using Constrained AdaBoost Algorithm, 10th International Conference on Document Analysis and Recognition.

Digital Library

[9]

Haralick, R. M., Shanmuga, K., & Dinstein, H. (1973). Textural features for image classification. IEEE Transactions on Systems Man and Cybernetics, 3(6),pp:610--621.

[10]

Kim, K.I., Jung, K. & Kim, J.H., 2003. Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm æ. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 25(12), pp.1631--1639.

Digital Library

[11]

Kwak, J. T., Xu, S., & Wood, B. J. (2015). Efficient Data Mining for Local Binary Pattern in Texture Image Analysis. Expert Systems with Applications, 42(9), 4529--4539.

Digital Library

[12]

Lucas, S. M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R., Ashida, K., Nagai, H., Okamoto, M., Yamamoto, H. Miyao, H., J. Zhu, W. Ou, C. Wolf, Jolion, Todoran, J.-M., L., Worring, M., and Lin. X., (2005), ICDAR 2003 robust reading competitions: entries, results, and future directions. IJDAR, 7(2-3), PP:105--122.

Digital Library

[13]

Lucas, S.M. Panaretos, A. Sosa, L. Tang, A. Wong, S. Young, R., 2003. ICDAR 2003 Robust Reading Competitions. In INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR).

Digital Library

[14]

Matas, J. Chum, M. Urban, and T. Pajdla, 2002. Robust Wide Baseline Stereo from Maximally Stable Extremal Regions. Proc. of British Machine Vision Conference, pp.384--393.

[15]

Minetto, R., Thome, N., Cord, M., Stolfi, J. Precioso F., Guyomard, J., and Leite, N. J., (2011), "Text detection and recognition in urban scenes," in IEEE ICCV Workshops, 2011, pp: 227--234.

[16]

Neumann, L. & Matas, J., 2010. a Method for Text Localization and Recognition in Real - World Images. In 10th Asian conference on computer Vision, ACCV. pp. 770--783.

Digital Library

[17]

Neumann, L. Matas, J. 2013, On combining multiple segmentations in scene text recognition, in: Proceedings of the ICDAR, pp. 523--527.

Digital Library

[18]

Neumann, L. Matas, J., 2012, Real-time scene text localization and recognition, in: Proceedings of the CVPR, pp. 3538--3545.

Digital Library

[19]

Pan, Y., Hou, X. & Liu, C., 2011. A Hybrid Approach to Detect and Localize Texts in. IEEE TRANSACTIONS ON IMAGE PROCESSING, 20(3), pp.800--813.

Digital Library

[20]

Seeri, S. V, Pujari, J.D. & Hiremath, P.S., 2015. Multilingual Text Localization in Natural Scene Images using Wavelet based Edge Features and Fuzzy Classification., 4(1), pp.210--218.

[21]

Serra, J., Simon (Ed.) J.C., (1989), Toggle Mappings: From Pixels to Features, Elsevier, pp: 61--72

[22]

Shi, C. Chunheng, W. Baihua, X. Yang, Z Song, G., 2013. Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recognition Letters, 34(2), pp.107--116.

Digital Library

[23]

Soh L. K. and satsoulis, C. T, (1999), Texture analysis of sar sea ice imagery using gray level co-occurrence matrices. Geoscience and Remote Sensing, IEEE Transactions on, 37(2), PP:780--795.

[24]

Wang, X. Song, Y., Zhang, Y., Xin, J, 2015. Natural scene text detection with multi-layer segmentation and higher order conditional random field based analysis. Pattern Recognition Letters, 60-61, pp.41--47.

Digital Library

[25]

Wang, X. Song, Y., Zhang, Y., Xin, J, 2015. Natural scene text detection with multi-layer segmentation and higher order conditional random field based analysis. Pattern Recognition Letters, 60-61, pp.41--47.

Digital Library

[26]

Yao, C. et al., 2012. Detecting Texts of Arbitrary Orientations in Natural Images. In IEEE Conf. CVPR. pp. 1083--1090.

Digital Library

[27]

Ye, Q. & Doermann, D., 2015. Text Detection and Recognition in Images and Video : a Survey. IEEE transactions on pattern analysis and machine intelligence, 37(7), pp.1480--1500

[28]

Yi, C., Tian, Y., (2012), Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment

Recommendations

An enhanced text detection technique for the visually impaired to read text

An enhanced text detection technique (ETDT) is proposed, which is expected to aid the visually impaired to overcome their reading challenges. This work enhances the edge-preserving maximally stable extremal regions (eMSER) algorithm using the pyramid ...
Text detection in natural scene images based on color prior guided MSER
Abstract
In this paper, we focus on text detection in natural scene images which is conducive to content-based wild image analysis and understanding. This task is still an open problem and usually includes two key issues: text candidate ...
A Robust Ensemble of ResNets for Character Level End-to-end Text Detection in Natural Scene Images
CBMI '17: Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing

Detecting text in natural scene images is a challenging task. In this paper, we propose a character-level end-to-end text detection algorithm in natural scene images. In general, text detection tasks are categorized into three parts: text localization, ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

IML '17: Proceedings of the 1st International Conference on Internet of Things and Machine Learning

October 2017

581 pages

ISBN:9781450352437

DOI:10.1145/3109761

General Chairs:
Hani Hamdan
University of Paris-Saclay, Paris, France
,
Djallel Eddine Boubiche
University of Batna, Algeria
,
Program Chair:
Fanny Klett
German Workforce ADL Partnership Laboratory, Germany

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

IML 2017

IML 2017: International Conference on Internet of Things and Machine Learning

October 17 - 18, 2017

Liverpool, United Kingdom

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
77
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 10 Dec 2024

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents