Guided Text Spotting for Assistive Blind Navigation in Unfamiliar Indoor Environments

Xuejian Rong²⁵,
Bing Li²⁵,
J. Pablo Muñoz²⁶,
Jizhong Xiao^25,26,
Aries Arditi²⁷ &
…
Yingli Tian^25,26

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10073))

Included in the following conference series:

International Symposium on Visual Computing

1970 Accesses
6 Citations

Abstract

Scene text in indoor environments usually preserves and communicates important contextual information which can significantly enhance the independent travel of blind and visually impaired people. In this paper, we present an assistive text spotting navigation system based on an RGB-D mobile device for blind or severely visually impaired people. Specifically, a novel spatial-temporal text localization algorithm is proposed to localize and prune text regions, by integrating stroke-specific features with a subsequent text tracking process. The density of extracted text-specific feature points serves as an efficient text indicator to guide the user closer to text-likely regions for better recognition performance. Next, detected text regions are binarized and recognized by off-the-shelf optical character recognition methods. Significant non-text indicator signage can also be matched to provide additional environment information. Both recognized results are then transferred to speech feedback for user interaction. Our proposed video text localization approach is quantitatively evaluated on the ICDAR 2013 dataset, and the experimental results demonstrate the effectiveness of our proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 35.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 44.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Scene Text Detection and Tracking for a Camera-Equipped Wearable Reading Assistant for the Blind

Assistive Text Reading from Natural Scene for Blind Persons

Real-Time Text Tracking for Text-to-Speech Translation Camera for the Blind

Notes

References

Xiong, B., Grauman, K.: Text detection in stores using a repetition prior. In: WACV (2016)
Google Scholar
Qin, S., Manduchi, R.: A fast and robust text spotter. In: WACV (2016)
Google Scholar
Yin, X., Zuo, Z., Tian, S., Liu, C.: Text detection, tracking and recognition in video: a comprehensive survey. IEEE Trans. Image Process. (2016)
Google Scholar
Busta, M., Neumann, L., Matas, J.: FASText: efficient unconstrained scene text detector. In: ICCV (2015)
Google Scholar
Jaderberg, M., Vedaldi, A., Zisserman, A.: Deep features for text spotting. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8692, pp. 512–528. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10593-2_34
Google Scholar
Yin, X., Yin, X., Huang, K., Hao, H.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. (2014)
Google Scholar
Rakshit, S., Basu, S.: Recognition of handwritten roman script using tesseract open source ocr engine. arXiv.org (2010)
Munõz, J.P., Li, B., Rong, X., Xiao, J., Tian, Y., Arditi, A.: Demo: assisting visually impaired people navigate indoors. In: International Joint Conference on Artificial Intelligence (IJCAI), pp. 4260–4261 (2016)
Google Scholar
Lees, Y., Medioni, G.: RGB-D camera based wearable navigation system for the visually impaired. Comput. Vis. Image Underst. 149, 3–20 (2016)
Article Google Scholar
Li, B., Muñoz, J.P., Rong, X., Xiao, J., Tian, Y., Arditi, A.: ISANA: wearable context-aware indoor assistive navigation with obstacle avoidance for the blind. In: Hua, G., Jégou, H. (eds.) ECCV 2016 Workshop. LNCS, vol. 9914, pp. 448–462. Springer, Heidelberg (2016)
Chapter Google Scholar
Li, B., Zhang, X., Muñoz, J.P., Xiao, J., Rong, X., Tian, Y.: Assisting blind people to avoid obstacles: an wearable obstacle stereo feedback system based on 3D detection. In: IEEE International Conference on Robotics and Biomimetics (ROBIO) (2015)
Google Scholar
Rong, X., Yi, C., Yang, X., Tian, Y.: Scene text recognition in multiple frames based on text tracking. In: IEEE International Conference on Multimedia and Expo (2014)
Google Scholar
Rong, X., Yi, C., Tian, Y.: Recognizing text-based traffic guide panels with cascaded localization network. In: Hua, G., Jégou, H. (eds.) ECCV 2016. LNCS, vol. 9913, pp. 109–121. Springer, Heidelberg (2016). doi:10.1007/978-3-319-46604-0_8
Chapter Google Scholar
Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans. Image Process. 20, 2594–2605 (2011)
Article MathSciNet Google Scholar
Yi, C., Tian, Y., Arditi, A.: Portable camera-based assistive text and product label reading from hand-held objects for blind persons. IEEE Trans. Mechatron. 19, 808–817 (2014)
Article Google Scholar
Balntas, V., Tang, L., Mikolajczyk, K.: Bold - binary online learned descriptor for efficient image matching. In: CVPR (2015)
Google Scholar
Ozuysal, M., Calonder, M., Lepetit, V., Fua, P.: Fast keypoint recognition using random ferns. IEEE Trans. Pattern Anal. Mach. Intell. 32, 448–461 (2010)
Article Google Scholar
Rosten, E., Drummond, T.: Fusing points and lines for high performance tracking. In: ICCV (2005)
Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression a statistical view of boosting. Ann. Stat. 28, 337–407 (2000)
Article MathSciNet MATH Google Scholar
Karatzas, D.: ICDAR 2013 robust reading competition. In: ICDAR (2013)
Google Scholar
Goto, H., Tanaka, M.: Text-tracking wearable camera system for the blind. In: ICDAR (2009)
Google Scholar
Wu, L., Shivakumara, P., Lu, T.: A new technique for multi-oriented scene text line detection and tracking in video. IEEE Trans. Multimed. 17, 1137–1152 (2015)
Article Google Scholar
Cambra, A., Murillo, A.: Towards robust and efficient text sign reading from a mobile phone (2011)
Google Scholar
Li, H., Doermann, D., Kia, O.: Automatic text detection and tracking in digital video. IEEE Trans. Image Process. 9, 147–156 (2000)
Article Google Scholar
Mosleh, A., Bouguila, N., Hamza, A.: Automatic inpainting scheme for video text detection and removal. IEEE Trans. Image Process. 22, 4460–4472 (2013)
Article MathSciNet Google Scholar
Zhao, X., Lin, K., Fu, Y., Hu, Y., Liu, Y.: Text from corners: a novel approach to detect text and caption in videos. IEEE Trans. Image Process. 20, 790–799 (2011)
Article MathSciNet Google Scholar

Download references

Acknowledgements

This work was supported in part by U.S. Federal Highway Administration (FHWA) grant DTFH 61-12-H-00002, National Science Foundation (NSF) grants CBET-1160046, EFRI-1137172 and IIP-1343402, National Institutes of Health (NIH) grant EY023483.

Author information

Authors and Affiliations

The City College, City University of New York, New York, New York, USA
Xuejian Rong, Bing Li, Jizhong Xiao & Yingli Tian
The Graduate Center, City University of New York, New York, New York, USA
J. Pablo Muñoz, Jizhong Xiao & Yingli Tian
Visibility Metrics LLC, Chappaqua, New York, USA
Aries Arditi

Authors

Xuejian Rong
View author publications
You can also search for this author in PubMed Google Scholar
Bing Li
View author publications
You can also search for this author in PubMed Google Scholar
J. Pablo Muñoz
View author publications
You can also search for this author in PubMed Google Scholar
Jizhong Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Aries Arditi
View author publications
You can also search for this author in PubMed Google Scholar
Yingli Tian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xuejian Rong .

Editor information

Editors and Affiliations

University of Nevada, Reno, Nevada, USA
George Bebis
NASA Ames Research Center, Moffett Field, California, USA
Richard Boyle
Lawrence Berkeley National Laboratory, Berkeley, California, USA
Bahram Parvin
Desert Research Institute, Reno, Nevada, USA
Darko Koracin
The Australian National University, O’Malley, Aust Capital Terr, Australia
Fatih Porikli
Pilot AI Labs, Redwood City, California, USA
Sandra Skaff
University of Florida, Gainesville, Florida, USA
Alireza Entezari
Google Inc., Mountain View, California, USA
Jianyuan Min
Osaka University, Osaka, Japan
Daisuke Iwai
The MOVES Institute, Monterey, California, USA
Amela Sadagic
University of Arizona, Tucson, Arizona, USA
Carlos Scheidegger
Université Paris-Sud, Orsay, France
Tobias Isenberg

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rong, X., Li, B., Muñoz, J.P., Xiao, J., Arditi, A., Tian, Y. (2016). Guided Text Spotting for Assistive Blind Navigation in Unfamiliar Indoor Environments. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2016. Lecture Notes in Computer Science(), vol 10073. Springer, Cham. https://doi.org/10.1007/978-3-319-50832-0_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-50832-0_2
Published: 10 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50831-3
Online ISBN: 978-3-319-50832-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics