[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

3D Convolutional Neural Networks for Human Action Recognition

Published: 01 January 2013 Publication History

Abstract

We consider the automated recognition of human actions in surveillance videos. Most current methods build classifiers based on complex handcrafted features computed from the raw inputs. Convolutional neural networks (CNNs) are a type of deep model that can act directly on the raw inputs. However, such models are currently limited to handling 2D inputs. In this paper, we develop a novel 3D CNN model for action recognition. This model extracts features from both the spatial and the temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames. The developed model generates multiple channels of information from the input frames, and the final feature representation combines information from all channels. To further boost the performance, we propose regularizing the outputs with high-level features and combining the predictions of a variety of different models. We apply the developed models to recognize human actions in the real-world environment of airport surveillance videos, and they achieve superior performance in comparison to baseline methods.

Cited By

View all
  • (2024)Human action recognition with transformer based on convolutional featuresIntelligent Decision Technologies10.3233/IDT-24015918:2(881-896)Online publication date: 1-Jan-2024
  • (2024)How to Improve Video Analytics with Action Recognition: A SurveyACM Computing Surveys10.1145/367901157:1(1-36)Online publication date: 7-Oct-2024
  • (2024)Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive SurveyACM Computing Surveys10.1145/367901056:12(1-36)Online publication date: 22-Jul-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Pattern Analysis and Machine Intelligence
IEEE Transactions on Pattern Analysis and Machine Intelligence  Volume 35, Issue 1
January 2013
256 pages

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 January 2013

Author Tags

  1. 3D convolution
  2. Deep learning
  3. action recognition
  4. convolutional neural networks
  5. model combination

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Human action recognition with transformer based on convolutional featuresIntelligent Decision Technologies10.3233/IDT-24015918:2(881-896)Online publication date: 1-Jan-2024
  • (2024)How to Improve Video Analytics with Action Recognition: A SurveyACM Computing Surveys10.1145/367901157:1(1-36)Online publication date: 7-Oct-2024
  • (2024)Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive SurveyACM Computing Surveys10.1145/367901056:12(1-36)Online publication date: 22-Jul-2024
  • (2024)High-Performance 3D convolution on the Latest Generation Sunway ProcessorProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673093(241-251)Online publication date: 12-Aug-2024
  • (2024)Reduce Detection Latency of YOLOv5 to Prevent Real-Time Tracking Failures for Lightweight RobotsProceedings of the 15th Asia-Pacific Symposium on Internetware10.1145/3671016.3671392(437-446)Online publication date: 24-Jul-2024
  • (2024)Advancing Micro-Action Recognition with Multi-Auxiliary Heads and Hybrid Loss OptimizationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3688975(11313-11319)Online publication date: 28-Oct-2024
  • (2024)Motion-aware Latent Diffusion Models for Video Frame InterpolationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680846(1043-1052)Online publication date: 28-Oct-2024
  • (2024)Continuous Authentication Leveraging Matrix ProfileProceedings of the 19th International Conference on Availability, Reliability and Security10.1145/3664476.3664481(1-13)Online publication date: 30-Jul-2024
  • (2024)Psychology-Guided Environment Aware Network for Discovering Social Interaction Groups from VideosACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365729520:8(1-23)Online publication date: 13-Jun-2024
  • (2024)Enhanced Video Super-Resolution Network towards Compressed DataACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365130920:7(1-21)Online publication date: 6-Mar-2024
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media