[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2674396.2674432acmotherconferencesArticle/Chapter ViewAbstractPublication PagespetraConference Proceedingsconference-collections
research-article

A supervised learning approach for fast object recognition from RGB-D data

Published: 27 May 2014 Publication History

Abstract

Object recognition serves obvious purposes in assisted living environments, where robotic devices can be used as companions to assist humans in need. The recent introduction of vision based sensors, which are able to extract depth sensing information about the environment, in addition to the traditional RGB video, presents new opportunities and challenges for more accurate object recognition.
The current work, presents an object recognition approach that uses RGB-D point cloud data and a novel feature extraction methodology, in combination with well-known supervised learning algorithms, to achieve accurate, real-time recognition of a large number of objects. In our experiments, we use a dataset of household objects organized into 51 categories, and evaluate the recognition accuracy and time efficiency of a set of different supervised learning methods.

References

[1]
Belongie, S., Malik, J., and Puzicha, J. Shape matching and object recognition using shape contexts. Pattern Analysis and Machine Intelligence, IEEE Transactions on 24, 4 (2002), 509--522.
[2]
Bo, L., Ren, X., and Fox, D. Unsupervised feature learning for rgb-d based object recognition. In 13th International Symposium on Experimental Robotics (ISER) (2012).
[3]
Bo, L., Ren, X., and Fox, D. Unsupervised feature learning for rgb-d based object recognition. In Experimental Robotics (2013), Springer, pp. 387--402.
[4]
Chang, C.-C., and Lin, C.-J. Libsvm: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2, 3 (2011), 27.
[5]
Cignoni, P., Corsini, M., and Ranzuglia, G. Meshlab: an open-source 3d mesh processing system. Ercim news 73 (2008), 45--46.
[6]
Fischler, M. A., and Bolles, R. C. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM 24, 6 (1981), 381--395.
[7]
Khoshelham, K., and Elberink, S. O. Accuracy and resolution of kinect depth data for indoor mapping applications. Sensors 12, 2 (2012), 1437--1454.
[8]
Lai, K., Bo, L., Ren, X., and Fox, D. A large-scale hierarchical multi-view rgb-d object dataset. In IEEE International Conference on Robotics and Automation (ICRA) (2011).
[9]
Lai, K., Bo, L., Ren, X., and Fox, D. Detection-based object labeling in 3d scenes. In IEEE International Conference on Robotics and Automation (ICRA) (2012).
[10]
Lowe, D. G. Object recognition from local scale-invariant features. In Computer vision, 1999. The proceedings of the seventh IEEE international conference on (1999), vol. 2, Ieee, pp. 1150--1157.
[11]
McMurrough, C., Rich, J., Conly, C., Athitsos, V., and Makedon, F. Multi-modal object of interest detection using eye gaze and rgb-d cameras. In Proceedings of the 4th Workshop on Eye Gaze in Intelligent Human Machine Interaction (2012), ACM, p. 2.
[12]
McMurrough, C., Rich, J., Metsis, V., Nguyen, A., and Makedon, F. Low-cost head position tracking for gaze point estimation. In Proceedings of the 5th International Conference on PErvasive Technologies Related to Assistive Environments (PETRA) (2012).
[13]
McMurrough, C. D., Metsis, V., Rich, J., and Makedon, F. An eye tracking dataset for point of gaze detection. In Proceedings of the Symposium on Eye Tracking Research and Applications (ETRA) (2012).
[14]
Russell, S., and Norvig, P. Artificial Intelligence: A Modern Approach. Pearson Education, Inc., 2010.
[15]
Rusu, R. B., Marton, Z. C., Blodow, N., Dolha, M., and Beetz, M. Towards 3d point cloud based object maps for household environments. Robotics and Autonomous Systems 56 (2008).
[16]
Shapire, R. E., and Freund, Y. Boosting: Foundations and Algorithms. Massachusetts Institute of Technology, 2012.
[17]
Shi, L., Kodagoda, S., and Ranasinghe, R. Fast indoor classification using 3d point clouds. In Proceedings of the Australasian Conference on Robotics and Automation (ACRA) (2011).
[18]
Sural, S., Qian, G., and Pramanik, S. Segmentation and histogram generation using the hsv color space for image retrieval. In Image Processing. 2002. Proceedings. 2002 International Conference on (2002), vol. 2, IEEE, pp. II--589.
[19]
Van De Weijer, J., and Schmid, C. Coloring local feature extraction. In Computer Vision--ECCV 2006. Springer, 2006, pp. 334--348.

Cited By

View all
  • (2024)A Review of RGB-D Image Classification MethodsAdvances in Data-Driven Computing and Intelligent Systems10.1007/978-981-99-9531-8_2(9-22)Online publication date: 11-Apr-2024
  • (2019)RGB-D-Based Object Recognition Using Multimodal Convolutional Neural Networks: A SurveyIEEE Access10.1109/ACCESS.2019.29070717(43110-43136)Online publication date: 2019

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
PETRA '14: Proceedings of the 7th International Conference on PErvasive Technologies Related to Assistive Environments
May 2014
408 pages
ISBN:9781450327466
DOI:10.1145/2674396
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • iPerform Center: iPerform Center for Assistive Technologies to Enhance Human Performance
  • CSE@UTA: Department of Computer Science and Engineering, The University of Texas at Arlington
  • HERACLEIA: HERACLEIA Human-Centered Computing Laboratory at UTA
  • U of Tex at Arlington: U of Tex at Arlington
  • NCRS: Demokritos National Center for Scientific Research
  • Fulbrigh, Greece: Fulbright Foundation, Greece

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 May 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. RGB-D
  2. adaboost
  3. artificial neural network
  4. classification
  5. object recognition
  6. point cloud
  7. supervised learning
  8. support vector machine

Qualifiers

  • Research-article

Funding Sources

Conference

PETRA '14
Sponsor:
  • iPerform Center
  • CSE@UTA
  • HERACLEIA
  • U of Tex at Arlington
  • NCRS
  • Fulbrigh, Greece

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Review of RGB-D Image Classification MethodsAdvances in Data-Driven Computing and Intelligent Systems10.1007/978-981-99-9531-8_2(9-22)Online publication date: 11-Apr-2024
  • (2019)RGB-D-Based Object Recognition Using Multimodal Convolutional Neural Networks: A SurveyIEEE Access10.1109/ACCESS.2019.29070717(43110-43136)Online publication date: 2019

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media