Wu et al., 2024 - Google Patents
Spatial–temporal hypergraph based on dual-stage attention network for multi-view data lightweight action recognitionWu et al., 2024
View PDF- Document ID
- 3352275844590982213
- Author
- Wu Z
- Ma N
- Wang C
- Xu C
- Xu G
- Li M
- Publication year
- Publication venue
- Pattern Recognition
External Links
Snippet
For the problems of irrelevant frames and high model complexity in action recognition, we propose a Spatial–Temporal Hypergraph based on Dual-Stage Attention Network (STHG- DAN) for multi-view data lightweight action recognition. It includes two stages: Temporal …
- 230000009471 action 0 title abstract description 125
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G06K9/4604—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes, intersections
- G06K9/4609—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes, intersections by matching or filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Si et al. | Skeleton-based action recognition with hierarchical spatial reasoning and temporal stack learning network | |
Liang et al. | Multi-modal interactive attention and dual progressive decoding network for RGB-D/T salient object detection | |
Qi et al. | Image-based action recognition using hint-enhanced deep neural networks | |
Tang et al. | DFFNet: An IoT-perceptive dual feature fusion network for general real-time semantic segmentation | |
Liu et al. | Symmetry-Driven hyper feature GCN for skeleton-based gait recognition | |
Chen et al. | Video saliency prediction using enhanced spatiotemporal alignment network | |
Wu et al. | Spatial–temporal hypergraph based on dual-stage attention network for multi-view data lightweight action recognition | |
Guan et al. | AFE-CNN: 3D skeleton-based action recognition with action feature enhancement | |
Li et al. | Multi-scale residual network model combined with Global Average Pooling for action recognition | |
Zhai et al. | FPANet: feature pyramid attention network for crowd counting | |
Jiang et al. | Contour-aware network for semantic segmentation via adaptive depth | |
Fang et al. | M2RNet: Multi-modal and multi-scale refined network for RGB-D salient object detection | |
Xu et al. | CCFNet: Cross-complementary fusion network for RGB-D scene parsing of clothing images | |
Hu et al. | Forward-reverse adaptive graph convolutional networks for skeleton-based action recognition | |
Xu et al. | Dual pyramid network for salient object detection | |
Xu et al. | SA-DPNet: Structure-aware dual pyramid network for salient object detection | |
Li et al. | Learning residual refinement network with semantic context representation for real-time saliency object detection | |
Wang et al. | Spatiotemporal module for video saliency prediction based on self-attention | |
Zhu et al. | RGB-D salient object detection via cross-modal joint feature extraction and low-bound fusion loss | |
Wang et al. | Tmf: Temporal motion and fusion for action recognition | |
Li et al. | Multi-granularity Cross Transformer Network for person re-identification | |
Wang et al. | GaitParsing: Human semantic parsing for gait recognition | |
Wu et al. | Joint Semantic Segmentation using representations of LiDAR point clouds and camera images | |
Deng et al. | Ternary symmetric fusion network for camouflaged object detection | |
Ma et al. | Multi-View Time-Series Hypergraph Neural Network for Action Recognition |