research-article

3D Convolutional Neural Networks for Human Action Recognition

Authors:

Shuiwang Ji,

Wei Xu,

Ming Yang,

Kai YuAuthors Info & Claims

IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 35, Issue 1

Pages 221 - 231

https://doi.org/10.1109/TPAMI.2012.59

Published: 01 January 2013 Publication History

Abstract

We consider the automated recognition of human actions in surveillance videos. Most current methods build classifiers based on complex handcrafted features computed from the raw inputs. Convolutional neural networks (CNNs) are a type of deep model that can act directly on the raw inputs. However, such models are currently limited to handling 2D inputs. In this paper, we develop a novel 3D CNN model for action recognition. This model extracts features from both the spatial and the temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames. The developed model generates multiple channels of information from the input frames, and the final feature representation combines information from all channels. To further boost the performance, we propose regularizing the outputs with high-level features and combining the predictions of a variety of different models. We apply the developed models to recognize human actions in the real-world environment of airport surveillance videos, and they achieve superior performance in comparison to baseline methods.

Cited By

View all

Shi CLiu S(2024)Human action recognition with transformer based on convolutional featuresIntelligent Decision Technologies10.3233/IDT-24015918:2(881-896)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.3233/IDT-240159
T GHR M(2024)How to Improve Video Analytics with Action Recognition: A SurveyACM Computing Surveys10.1145/367901157:1(1-36)Online publication date: 7-Oct-2024
https://dl.acm.org/doi/10.1145/3679011
Xu YCao HXie LLi XChen ZYang J(2024)Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive SurveyACM Computing Surveys10.1145/367901056:12(1-36)Online publication date: 22-Jul-2024
https://dl.acm.org/doi/10.1145/3679010
Show More Cited By

Recommendations

Human Action Recognition using Pre-trained Convolutional Neural Networks
VSIP '20: Proceedings of the 2020 2nd International Conference on Video, Signal and Image Processing

Recognition of human action is one of the challenges in the field of artificial intelligence. Deep learning model has become a research issue in action recognition applications due to its ability to outperform traditional machine learning approaches. ...
Human Action Recognition by Fusion of Convolutional Neural Networks and spatial-temporal Information
ICIMCS'16: Proceedings of the International Conference on Internet Multimedia Computing and Service

In order to improve the accuracy of recognizing human action, a human action recognition model is proposed based on improved convolutional neural networks. Most of the current methods build classifiers based on complex handcrafted features computed from ...
Towards dropout training for convolutional neural networks

Recently, dropout has seen increasing use in deep learning. For deep convolutional neural networks, dropout is known to work well in fully-connected layers. However, its effect in convolutional and pooling layers is still not clear. This paper ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Pattern Analysis and Machine Intelligence

IEEE Transactions on Pattern Analysis and Machine Intelligence Volume 35, Issue 1

January 2013

256 pages

ISSN:0162-8828

Issue’s Table of Contents

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 January 2013

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1,140
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 12 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Shi CLiu S(2024)Human action recognition with transformer based on convolutional featuresIntelligent Decision Technologies10.3233/IDT-24015918:2(881-896)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.3233/IDT-240159
T GHR M(2024)How to Improve Video Analytics with Action Recognition: A SurveyACM Computing Surveys10.1145/367901157:1(1-36)Online publication date: 7-Oct-2024
https://dl.acm.org/doi/10.1145/3679011
Xu YCao HXie LLi XChen ZYang J(2024)Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive SurveyACM Computing Surveys10.1145/367901056:12(1-36)Online publication date: 22-Jul-2024
https://dl.acm.org/doi/10.1145/3679010
Li JFeng ZGao YTian SZhang HYe HZhang J(2024)High-Performance 3D convolution on the Latest Generation Sunway ProcessorProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673093(241-251)Online publication date: 12-Aug-2024
https://dl.acm.org/doi/10.1145/3673038.3673093
Xue JChen HHu YChen MWu LChang X(2024)Reduce Detection Latency of YOLOv5 to Prevent Real-Time Tracking Failures for Lightweight RobotsProceedings of the 15th Asia-Pacific Symposium on Internetware10.1145/3671016.3671392(437-446)Online publication date: 24-Jul-2024
https://dl.acm.org/doi/10.1145/3671016.3671392
Li QHuang XChen HHe FChen QWang ZCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Advancing Micro-Action Recognition with Multi-Auxiliary Heads and Hybrid Loss OptimizationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3688975(11313-11319)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3688975
Huang ZYu YYang LQin CZheng BZheng XZhou ZWang YYang WCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Motion-aware Latent Diffusion Models for Video Frame InterpolationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680846(1043-1052)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3680846
Ibanez-Lissen LDe Fuentes JGonzalez-Manzano LAnciaux N(2024)Continuous Authentication Leveraging Matrix ProfileProceedings of the 19th International Conference on Availability, Reliability and Security10.1145/3664476.3664481(1-13)Online publication date: 30-Jul-2024
https://dl.acm.org/doi/10.1145/3664476.3664481
Yu JYang JYang HPan RLai PZhai G(2024)Psychology-Guided Environment Aware Network for Discovering Social Interaction Groups from VideosACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365729520:8(1-23)Online publication date: 13-Jun-2024
https://dl.acm.org/doi/10.1145/3657295
Li FWu YLi ABai HCong RZhao Y(2024)Enhanced Video Super-Resolution Network towards Compressed DataACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365130920:7(1-21)Online publication date: 6-Mar-2024
https://dl.acm.org/doi/10.1145/3651309
Show More Cited By

Abstract

Cited By

Recommendations

Human Action Recognition using Pre-trained Convolutional Neural Networks

Human Action Recognition by Fusion of Convolutional Neural Networks and spatial-temporal Information

Towards dropout training for convolutional neural networks

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations