[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

RGB-D action recognition using linear coding

Published: 03 February 2015 Publication History

Abstract

In this paper, we investigate action recognition using an inexpensive RGB-D sensor (Microsoft Kinect). First, a depth spatial-temporal descriptor is developed to extract the interested local regions in depth image. Such descriptors are very robust to the illumination and background clutter. Then the intensity spatial-temporal descriptor and the depth spatial-temporal descriptor are combined and feeded into a linear coding framework to get an effective feature vector, which can be used for action classification. Finally, extensive experiments are conducted on a publicly available RGB-D action recognition dataset and the proposed method shows promising results.

References

[1]
B. Ni, G. Wang, M. Pierre, RGBD-HuDaAct: a color-depth video database for human daily activity recognition, in: IEEE International Conference on Computer Vision Workshop, IEEE, Barcelona, Spain 2011, pp. 1147-1153.
[2]
J. Sung, C. Ponce, B. Selman, A. Saxena, Unstructured human activity detection from RGBD images, in: IEEE International Conference on Robotics and Automation, IEEE, St. Paul, MN, USA 2012, pp. 842-849.
[3]
W. Li, Z. Zhang, Z. Liu, Action recognition based on a bag of 3d points, in: IEEE Conference on Computer Vision and Pattern Recognition Workshop, 2010.
[4]
L. Morency, A. Quattoni, T. Darrell, Latent-dynamic discriminative models for continuous gesture recognition, in: IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Minneapolis, USA 2007, pp. 1-8.
[5]
I. Laptev, On space-time interest points, Int. J. Comput. Vis., 64 (2005) 107-123.
[6]
N. Dalal, B. Triggs, Histograms of oriented gradients for human detection, in: IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, IEEE, San Diego, USA 2005, pp. 886-893.
[7]
N. Dalal, B. Triggs, C. Schmid, Human detection using oriented histograms of flow and appearance, in: European Conference on Computer Vision, Springer, Graz, Austria 2006, pp. 428-441.
[8]
A. Jain, M. Murty, P. Flynn, Data clustering, ACM Comput. Surv., 31 (1999) 264-323.
[9]
J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, Y. Gong, Locality-constrained linear coding for image classification, in: IEEE Conference on Computer Vision and Pattern Recognition, IEEE, San Francisco, CA, USA 2010, pp. 3360-3367.
[10]
L. Fei-Fei, P. Perona, A Bayesian hierarchical model for learning natural scene categories, in: IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, IEEE, San Diego, USA 2005, pp. 524-531.
[11]
H. Wang, M. Ullah, A. Klaser, I. Laptev, C. Schmid, Evaluation of local spatio-temporal features for action recognition, in: British Machine Vision Conference, 2009.
[12]
S. Lazebnik, C. Schmid, J. Ponce, Beyond bags of features: spatial pyramid matching for recognizing natural scene categories, in: IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, IEEE, New York, NY, USA 2006, pp. 2169-2178.
[13]
Public STIPs Binaries, {http://www.di.ens.fr/~laptev/download.html}, 2005.
[14]
C. Chang, C. Lin, Libsvm, ACM Trans. Intell. Syst. Technol., 2 (2011) 27.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Neurocomputing
Neurocomputing  Volume 149, Issue PA
February 2015
472 pages

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 03 February 2015

Author Tags

  1. Action recognition
  2. Linear coding
  3. RGB-D

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022)FT-HID: a large-scale RGB-D dataset for first- and third-person human interaction analysisNeural Computing and Applications10.1007/s00521-022-07826-w35:2(2007-2024)Online publication date: 7-Oct-2022
  • (2017)Combining depth-skeleton feature with sparse coding for action recognitionNeurocomputing10.1016/j.neucom.2016.12.041230:C(417-426)Online publication date: 22-Mar-2017
  • (2017)Semi-supervised convex nonnegative matrix factorizations with graph regularized for image representationNeurocomputing10.1016/j.neucom.2016.04.028237:C(1-11)Online publication date: 10-May-2017
  • (2017)The spatial Laplacian and temporal energy pyramid representation for human action recognition using depth sequencesKnowledge-Based Systems10.1016/j.knosys.2017.01.035122:C(64-74)Online publication date: 15-Apr-2017
  • (2016)A novel hierarchical Bag-of-Words model for compact action representationNeurocomputing10.1016/j.neucom.2015.09.074174:PB(722-732)Online publication date: 22-Jan-2016
  • (2016)Recent trends in gesture recognitionImage and Vision Computing10.1016/j.imavis.2016.05.00752:C(56-72)Online publication date: 1-Aug-2016

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media