More Web Proxy on the site http://driver.im/

research-article

Man made structure detection and verification of object recognition in images for the visually impaired

Authors:

Volker BlanzAuthors Info & Claims

MIRAGE '13: Proceedings of the 6th International Conference on Computer Vision / Computer Graphics Collaboration Techniques and Applications

Article No.: 18, Pages 1 - 8

https://doi.org/10.1145/2466715.2466732

Published: 06 June 2013 Publication History

Abstract

This paper presents two learning based algorithms that are designed for the purpose of extracting and processing suitable information in images for the visually impaired. Both algorithms are developed to be used within a specific modular sonification system. This system is designed to allow visually impaired people to explore images, actively on a touch screen, and to receive an auditory response about the image content at any current finger position. The first algorithm presented in this paper therefore addresses the problem of labeling regions within images, incorporating spatial dependencies. The second algorithm strives to alleviate the rejection of false object detections before sonification. This is crucial to avoid confusion on the side of the blind user, who can not check for a correct image labeling or object detection visually. Due to the modular design principle of the modular sonification system, both algorithms can be incorporated easily and efficiently.

References

[1]

M. Banf and V. Blanz. A modular computer vision sonification model for the visually impaired. In 18th Int. Con. on Auditory Display, 2012.

[2]

M. Banf and V. Blanz. Sonification of images for the visually impaired using a multi-level approach. Augm. Human Int. Conf. in coop. with ACM SIGCHI, 2013.

Digital Library

[3]

G. Bologna et al. Toward local and global perception modules for vision substitution. Neurocomput., 74(8):1182--1190, Mar. 2011.

Digital Library

[4]

C.-C. Chang and C.-J. Lin. Libsvm: A library for support vector machines. ACM Trans. Intell. Syst. Technol., 2(3):1--27, 2011.

Digital Library

[5]

B. Chapman et al. Using OpenMP: Portable Shared Memory Parallel Programming. MIT Press, 2007.

Digital Library

[6]

T. H. Cormen et al. Introduction to Algorithms. MIT Press, 2009. 3rd Edition.

Digital Library

[7]

G. Csurka et al. Visual categorization with bags of keypoints. In Work. on SLCV, ECCV, 2004.

[8]

R. Duda, P. Hart, and D. Stork. Pattern Classification. John Wiley and Sons, 2001.

Digital Library

[9]

M. Everingham et al. The PASCAL Visual Object Classes Challenge 2010 (VOC2010). http://www.pascal-network.org/.

[10]

P. F. Felzenszwalb et al. Object detection with discriminatively trained part based models. Trans. on Patt. Anal. and Mach. Intell., 32(9), 2010.

Digital Library

[11]

F. Glover and M. Laguna. Tabu Search. Kluwer Academic Publishers, 1997.

Digital Library

[12]

S. Gould et al. Decomposing a scene into geometric and semantically consistent regions. In Proc. of ICCV, pages 1--8, 2009.

[13]

R. Grompone et al. Lsd: A fast line segment detector. Trans. on PAMI, 32:722--732, 2010.

Digital Library

[14]

V. Kolmogorov and R. Zabih. What energy functions can be minimized via graph cuts? In Proc. of ECCV, pages 65--81, 2002.

Digital Library

[15]

F. Korc and W. Foerstner. Approximate parameter learning in conditional random fields: An empirical investigation. In Proceedings of DAGM symp.on PR, pages 11--20, 2008.

Digital Library

[16]

G. Kramer et al. Sonification report. Technical report, Int. Comm. for Auditory Display, 1999.

[17]

S. Kumar. Discriminative graphical models for context-based classification. volume 285 of Studies in Comp. Intell., pages 109--134. Springer, 2010.

[18]

S. Kumar and M. Hebert. Discriminative random fields: A discriminative framework for contextual interaction in classification. In Int. Conf. on Comp. Vision, pages 1150--1157, 2003.

Digital Library

[19]

S. Kumar and M. Hebert. Man-made structure detection in natural images using a causal multiscale random field. In Int. Conf. on CVPR, 2003.

Digital Library

[20]

J. D. Lafferty et al. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Int. Conf. on Mach. Learn., 2001.

Digital Library

[21]

R. Laganière. OpenCV 2 Computer Vision Application Programming Cookbook. Packt Publishing, 2011.

[22]

C. Lee et al. Segmenting brain tumor with conditional random fields and support vector machines. In Work. on Comp. Vision for Biomed. I. Appl. at ICCV, 2005.

Digital Library

[23]

C. Lee, R. Greiner, and M. Schmidt. Support vector random fields for spatial classification. In Proc. of PDMKD, pages 121--132, 2005.

Digital Library

[24]

P. McCullagh and J. A. Nelder. Generalized linear models. Chapman and Hall, 1983.

[25]

P. B. Meijer. An experimental system for auditory image representations. Trans. on Bio-Medical Engineering, 39(2):112--121, 1992.

[26]

M. Nixon and A. Aguado. Feature Extraction and Image Processing. Academic Press, 2007.

Digital Library

[27]

W. Press et al. Numerical Recipes in C. Cambridge University Press, 1992.

Digital Library

[28]

B. Schoelkopf and A. Smola. Learning with Kernels. MIT Press, 2002.

[29]

J. Shawe-Taylor and N. Cristianini. Kernel Methods for Pattern Analysis. Cambridge Univ. Press, 2004.

Digital Library

[30]

J. Shotton et al. Textonboost for image understanding. Int. Journal on Comp. Vision, 81(1):2--23, 2009.

Digital Library

[31]

S. Shoval et al. Auditory guidance with the navbelt. In Transactions on Systems, Man, and Cybernetics, 1998.

[32]

C. Tomasi and R. Manduchi. Bilateral filtering for gray and color images. In Proc. of ICCV, 1998.

Digital Library

[33]

K. van den Doel. Soundview: Sensing color images by kinesthetic audio. pages 303--306, 2003.

[34]

T.-F. Wu et al. Probability estimates for multi-class classification by pairwise coupling. Journal of Mach. Learn. Research, 5:975--1005, 2004.

Digital Library

[35]

J. Xu et al. An outdoor navigation aid system for the visually impaired. In Int. Conf. on IEEM, 2010.

[36]

T. Yoshida et al. Edgesonic: Image feature sonification for the visually impaired. In Augmented Human, 2011.

Digital Library

[37]

H. Zhou and D. Suter. Fast sparse gaussian processes learning for man-made structure classification. In Online Learning for Classification Workshop, 2007.

[38]

M. Zhou and H. Wei. Face verification using gaborwavelets and adaboost. In Proc. of ICPR, 2006.

Digital Library

Cited By

Bine LCosta YAylon L(2018)Automata Classification with Convolutional Neural Networks for Use in Assistive Technologies for the Visually ImpairedProceedings of the 11th PErvasive Technologies Related to Assistive Environments Conference10.1145/3197768.3201529(157-164)Online publication date: 26-Jun-2018
https://dl.acm.org/doi/10.1145/3197768.3201529
Banf MRhee S(2017)Enhancing gene regulatory network inference through data integration with markov random fieldsScientific Reports10.1038/srep411747:1Online publication date: 1-Feb-2017
https://doi.org/10.1038/srep41174
XU GWANG ZCHENG YTIAN YZHANG C(2017)A man-made object detection algorithm based on contour complexity evaluationChinese Journal of Aeronautics10.1016/j.cja.2017.09.00130:6(1931-1940)Online publication date: Dec-2017
https://doi.org/10.1016/j.cja.2017.09.001
Show More Cited By

Index Terms

Man made structure detection and verification of object recognition in images for the visually impaired
1. Computing methodologies
  1. Artificial intelligence
    1. Philosophical/theoretical foundations of artificial intelligence
      1. Cognitive science
2. Human-centered computing
  1. Human computer interaction (HCI)
    1. Empirical studies in HCI

Recommendations

Computer vision-based object recognition for the visually impaired in an indoors environment: a survey

Though several electronic assistive devices have been developed for the visually impaired in the past few decades, however, relatively few solutions have been devised to aid them in recognizing generic objects in their environment, particularly indoors. ...
The evaluation of visually impaired people's ability of defining the object location on touch-screen
ASSETS '10: Proceedings of the 12th international ACM SIGACCESS conference on Computers and accessibility

Touch-screen has been used not only on home appliances but also on so many kinds of machines in public facilities at the present time. However the fact is that most of visually impaired people have a problem with the difficult usability of a touch-...
Transfer Learning based Computer Vision Technology for Assisting Visually Impaired for detection of Common Places
APPIS 2020: Proceedings of the 3rd International Conference on Applications of Intelligent Systems

Visually impaired people face several problems in their daily life. One of the biggest problems is to visiting unfamiliar places and identifying public places like pharmacy store, restrooms, pedestrian signs on roads, etc. Although there are some ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

MIRAGE '13: Proceedings of the 6th International Conference on Computer Vision / Computer Graphics Collaboration Techniques and Applications

June 2013

137 pages

ISBN:9781450320238

DOI:10.1145/2466715

Conference Chairs:
Peter Eisert
Fraunhofer HHI / Humboldt-Universitát zu Berlin
,
André Gagalowicz
Inria Paris - Rocquencourt, France

Copyright © 2013 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Humboldt Univ.: Humboldt-Universität zu Berlin
FHHI: Fraunhofer Heinrich Hertz Institute

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 June 2013

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MIRAGE '13

Sponsor:

Humboldt Univ.
FHHI

MIRAGE '13: 6th International Conference on Computer Vision / Computer Graphics Collaboration Techniques and Applications

June 6 - 7, 2013

Berlin, Germany

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
157
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Bine LCosta YAylon L(2018)Automata Classification with Convolutional Neural Networks for Use in Assistive Technologies for the Visually ImpairedProceedings of the 11th PErvasive Technologies Related to Assistive Environments Conference10.1145/3197768.3201529(157-164)Online publication date: 26-Jun-2018
https://dl.acm.org/doi/10.1145/3197768.3201529
Banf MRhee S(2017)Enhancing gene regulatory network inference through data integration with markov random fieldsScientific Reports10.1038/srep411747:1Online publication date: 1-Feb-2017
https://doi.org/10.1038/srep41174
XU GWANG ZCHENG YTIAN YZHANG C(2017)A man-made object detection algorithm based on contour complexity evaluationChinese Journal of Aeronautics10.1016/j.cja.2017.09.00130:6(1931-1940)Online publication date: Dec-2017
https://doi.org/10.1016/j.cja.2017.09.001
Banf MMikalay RWatzke BBlanz V(2016)PictureSensation – a mobile application to help the blind explore the visual world through touch and soundJournal of Rehabilitation and Assistive Technologies Engineering10.1177/20556683166745823Online publication date: 27-Oct-2016
https://doi.org/10.1177/2055668316674582
Salamon NJacques JMusse S(2014)Seeing the Movement through SoundProceedings of the 2014 Brazilian Symposium on Computer Games and Digital Entertainment10.1109/SBGAMES.2014.12(165-172)Online publication date: 12-Nov-2014
https://dl.acm.org/doi/10.1109/SBGAMES.2014.12

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten