[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/CVPR.2011.5995470guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Joint segmentation and classification of human actions in video

Published: 20 June 2011 Publication History

Abstract

Automatic video segmentation and action recognition has been a long-standing problem in computer vision. Much work in the literature treats video segmentation and action recognition as two independent problems; while segmentation is often done without a temporal model of the activity, action recognition is usually performed on pre-segmented clips. In this paper we propose a novel method that avoids the limitations of the above approaches by jointly performing video segmentation and action recognition. Unlike standard approaches based on extensions of dynamic Bayesian networks, our method is based on a discriminative temporal extension of the spatial bag-of-words model that has been very popular in object recognition. The classification is performed robustly within a multi-class SVM framework whereas the inference over the segments is done efficiently with dynamic programming. Experimental results on honeybee, Weizmann, and Hollywood datasets illustrate the benefits of our approach compared to state-of-the-art methods.

Cited By

View all
  • (2019)HILCACM Transactions on Interactive Intelligent Systems10.1145/32345089:2-3(1-27)Online publication date: 18-Mar-2019
  • (2019)Soft video parsing by label distribution learningFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-018-8015-y13:2(302-317)Online publication date: 1-Apr-2019
  • (2018)Sequence-to-segments networks for segment detectionProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3327144.3327269(3511-3520)Online publication date: 3-Dec-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
CVPR '11: Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
June 2011
3558 pages
ISBN:9781457703942

Publisher

IEEE Computer Society

United States

Publication History

Published: 20 June 2011

Author Tags

  1. Hollywood dataset
  2. Weizmann dataset
  3. action recognition
  4. automatic video segmentation
  5. computer vision
  6. discriminative temporal extension
  7. dynamic Bayesian networks
  8. dynamic programming
  9. honeybee dataset
  10. human action classification
  11. human action segmentation
  12. inference
  13. multiclass SVM framework
  14. object recognition
  15. spatial bag-of-words model

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2019)HILCACM Transactions on Interactive Intelligent Systems10.1145/32345089:2-3(1-27)Online publication date: 18-Mar-2019
  • (2019)Soft video parsing by label distribution learningFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-018-8015-y13:2(302-317)Online publication date: 1-Apr-2019
  • (2018)Sequence-to-segments networks for segment detectionProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3327144.3327269(3511-3520)Online publication date: 3-Dec-2018
  • (2018)Leveraging information from imperfect examplesProceedings of the 11th Indian Conference on Computer Vision, Graphics and Image Processing10.1145/3293353.3293416(1-8)Online publication date: 18-Dec-2018
  • (2018)A discriminative structural model for joint segmentation and recognition of human actionsMultimedia Tools and Applications10.1007/s11042-018-6189-977:24(31627-31645)Online publication date: 1-Dec-2018
  • (2018)Action recognition based on hierarchical dynamic Bayesian networkMultimedia Tools and Applications10.1007/s11042-017-4614-077:6(6955-6968)Online publication date: 1-Mar-2018
  • (2017)Soft video parsing by label distribution learningProceedings of the Thirty-First AAAI Conference on Artificial Intelligence10.5555/3298239.3298434(1331-1337)Online publication date: 4-Feb-2017
  • (2017)Real-time Action Recognition Based on Key Frame DetectionProceedings of the 9th International Conference on Machine Learning and Computing10.1145/3055635.3056569(272-277)Online publication date: 24-Feb-2017
  • (2017)Help, It Looks ConfusingProceedings of the 22nd International Conference on Intelligent User Interfaces10.1145/3025171.3025176(233-243)Online publication date: 7-Mar-2017
  • (2017)Efficient Unsupervised Temporal Segmentation of Motion DataIEEE Transactions on Multimedia10.1109/TMM.2016.263503019:4(797-812)Online publication date: 1-Apr-2017
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media