[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2466715.2466732acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmirageConference Proceedingsconference-collections
research-article

Man made structure detection and verification of object recognition in images for the visually impaired

Published: 06 June 2013 Publication History

Abstract

This paper presents two learning based algorithms that are designed for the purpose of extracting and processing suitable information in images for the visually impaired. Both algorithms are developed to be used within a specific modular sonification system. This system is designed to allow visually impaired people to explore images, actively on a touch screen, and to receive an auditory response about the image content at any current finger position. The first algorithm presented in this paper therefore addresses the problem of labeling regions within images, incorporating spatial dependencies. The second algorithm strives to alleviate the rejection of false object detections before sonification. This is crucial to avoid confusion on the side of the blind user, who can not check for a correct image labeling or object detection visually. Due to the modular design principle of the modular sonification system, both algorithms can be incorporated easily and efficiently.

References

[1]
M. Banf and V. Blanz. A modular computer vision sonification model for the visually impaired. In 18th Int. Con. on Auditory Display, 2012.
[2]
M. Banf and V. Blanz. Sonification of images for the visually impaired using a multi-level approach. Augm. Human Int. Conf. in coop. with ACM SIGCHI, 2013.
[3]
G. Bologna et al. Toward local and global perception modules for vision substitution. Neurocomput., 74(8):1182--1190, Mar. 2011.
[4]
C.-C. Chang and C.-J. Lin. Libsvm: A library for support vector machines. ACM Trans. Intell. Syst. Technol., 2(3):1--27, 2011.
[5]
B. Chapman et al. Using OpenMP: Portable Shared Memory Parallel Programming. MIT Press, 2007.
[6]
T. H. Cormen et al. Introduction to Algorithms. MIT Press, 2009. 3rd Edition.
[7]
G. Csurka et al. Visual categorization with bags of keypoints. In Work. on SLCV, ECCV, 2004.
[8]
R. Duda, P. Hart, and D. Stork. Pattern Classification. John Wiley and Sons, 2001.
[9]
M. Everingham et al. The PASCAL Visual Object Classes Challenge 2010 (VOC2010). http://www.pascal-network.org/.
[10]
P. F. Felzenszwalb et al. Object detection with discriminatively trained part based models. Trans. on Patt. Anal. and Mach. Intell., 32(9), 2010.
[11]
F. Glover and M. Laguna. Tabu Search. Kluwer Academic Publishers, 1997.
[12]
S. Gould et al. Decomposing a scene into geometric and semantically consistent regions. In Proc. of ICCV, pages 1--8, 2009.
[13]
R. Grompone et al. Lsd: A fast line segment detector. Trans. on PAMI, 32:722--732, 2010.
[14]
V. Kolmogorov and R. Zabih. What energy functions can be minimized via graph cuts? In Proc. of ECCV, pages 65--81, 2002.
[15]
F. Korc and W. Foerstner. Approximate parameter learning in conditional random fields: An empirical investigation. In Proceedings of DAGM symp.on PR, pages 11--20, 2008.
[16]
G. Kramer et al. Sonification report. Technical report, Int. Comm. for Auditory Display, 1999.
[17]
S. Kumar. Discriminative graphical models for context-based classification. volume 285 of Studies in Comp. Intell., pages 109--134. Springer, 2010.
[18]
S. Kumar and M. Hebert. Discriminative random fields: A discriminative framework for contextual interaction in classification. In Int. Conf. on Comp. Vision, pages 1150--1157, 2003.
[19]
S. Kumar and M. Hebert. Man-made structure detection in natural images using a causal multiscale random field. In Int. Conf. on CVPR, 2003.
[20]
J. D. Lafferty et al. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Int. Conf. on Mach. Learn., 2001.
[21]
R. Laganière. OpenCV 2 Computer Vision Application Programming Cookbook. Packt Publishing, 2011.
[22]
C. Lee et al. Segmenting brain tumor with conditional random fields and support vector machines. In Work. on Comp. Vision for Biomed. I. Appl. at ICCV, 2005.
[23]
C. Lee, R. Greiner, and M. Schmidt. Support vector random fields for spatial classification. In Proc. of PDMKD, pages 121--132, 2005.
[24]
P. McCullagh and J. A. Nelder. Generalized linear models. Chapman and Hall, 1983.
[25]
P. B. Meijer. An experimental system for auditory image representations. Trans. on Bio-Medical Engineering, 39(2):112--121, 1992.
[26]
M. Nixon and A. Aguado. Feature Extraction and Image Processing. Academic Press, 2007.
[27]
W. Press et al. Numerical Recipes in C. Cambridge University Press, 1992.
[28]
B. Schoelkopf and A. Smola. Learning with Kernels. MIT Press, 2002.
[29]
J. Shawe-Taylor and N. Cristianini. Kernel Methods for Pattern Analysis. Cambridge Univ. Press, 2004.
[30]
J. Shotton et al. Textonboost for image understanding. Int. Journal on Comp. Vision, 81(1):2--23, 2009.
[31]
S. Shoval et al. Auditory guidance with the navbelt. In Transactions on Systems, Man, and Cybernetics, 1998.
[32]
C. Tomasi and R. Manduchi. Bilateral filtering for gray and color images. In Proc. of ICCV, 1998.
[33]
K. van den Doel. Soundview: Sensing color images by kinesthetic audio. pages 303--306, 2003.
[34]
T.-F. Wu et al. Probability estimates for multi-class classification by pairwise coupling. Journal of Mach. Learn. Research, 5:975--1005, 2004.
[35]
J. Xu et al. An outdoor navigation aid system for the visually impaired. In Int. Conf. on IEEM, 2010.
[36]
T. Yoshida et al. Edgesonic: Image feature sonification for the visually impaired. In Augmented Human, 2011.
[37]
H. Zhou and D. Suter. Fast sparse gaussian processes learning for man-made structure classification. In Online Learning for Classification Workshop, 2007.
[38]
M. Zhou and H. Wei. Face verification using gaborwavelets and adaboost. In Proc. of ICPR, 2006.

Cited By

View all
  • (2018)Automata Classification with Convolutional Neural Networks for Use in Assistive Technologies for the Visually ImpairedProceedings of the 11th PErvasive Technologies Related to Assistive Environments Conference10.1145/3197768.3201529(157-164)Online publication date: 26-Jun-2018
  • (2017)Enhancing gene regulatory network inference through data integration with markov random fieldsScientific Reports10.1038/srep411747:1Online publication date: 1-Feb-2017
  • (2017)A man-made object detection algorithm based on contour complexity evaluationChinese Journal of Aeronautics10.1016/j.cja.2017.09.00130:6(1931-1940)Online publication date: Dec-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
MIRAGE '13: Proceedings of the 6th International Conference on Computer Vision / Computer Graphics Collaboration Techniques and Applications
June 2013
137 pages
ISBN:9781450320238
DOI:10.1145/2466715
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • Humboldt Univ.: Humboldt-Universität zu Berlin
  • FHHI: Fraunhofer Heinrich Hertz Institute

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 June 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. computer vision
  2. dual support vector fields
  3. graphical models
  4. labeling
  5. object recognition
  6. visually impaired

Qualifiers

  • Research-article

Conference

MIRAGE '13
Sponsor:
  • Humboldt Univ.
  • FHHI

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Automata Classification with Convolutional Neural Networks for Use in Assistive Technologies for the Visually ImpairedProceedings of the 11th PErvasive Technologies Related to Assistive Environments Conference10.1145/3197768.3201529(157-164)Online publication date: 26-Jun-2018
  • (2017)Enhancing gene regulatory network inference through data integration with markov random fieldsScientific Reports10.1038/srep411747:1Online publication date: 1-Feb-2017
  • (2017)A man-made object detection algorithm based on contour complexity evaluationChinese Journal of Aeronautics10.1016/j.cja.2017.09.00130:6(1931-1940)Online publication date: Dec-2017
  • (2016)PictureSensation – a mobile application to help the blind explore the visual world through touch and soundJournal of Rehabilitation and Assistive Technologies Engineering10.1177/20556683166745823Online publication date: 27-Oct-2016
  • (2014)Seeing the Movement through SoundProceedings of the 2014 Brazilian Symposium on Computer Games and Digital Entertainment10.1109/SBGAMES.2014.12(165-172)Online publication date: 12-Nov-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media