[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Wu et al., 2024 - Google Patents

Spatial–temporal hypergraph based on dual-stage attention network for multi-view data lightweight action recognition

Wu et al., 2024

View PDF
Document ID
3352275844590982213
Author
Wu Z
Ma N
Wang C
Xu C
Xu G
Li M
Publication year
Publication venue
Pattern Recognition

External Links

Snippet

For the problems of irrelevant frames and high model complexity in action recognition, we propose a Spatial–Temporal Hypergraph based on Dual-Stage Attention Network (STHG- DAN) for multi-view data lightweight action recognition. It includes two stages: Temporal …
Continue reading at papers.ssrn.com (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30781Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F17/30784Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
    • G06F17/30799Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • G06K9/4604Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes, intersections
    • G06K9/4609Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes, intersections by matching or filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30861Retrieval from the Internet, e.g. browsers
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F19/00Digital computing or data processing equipment or methods, specially adapted for specific applications
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models

Similar Documents

Publication Publication Date Title
Si et al. Skeleton-based action recognition with hierarchical spatial reasoning and temporal stack learning network
Liang et al. Multi-modal interactive attention and dual progressive decoding network for RGB-D/T salient object detection
Qi et al. Image-based action recognition using hint-enhanced deep neural networks
Tang et al. DFFNet: An IoT-perceptive dual feature fusion network for general real-time semantic segmentation
Liu et al. Symmetry-Driven hyper feature GCN for skeleton-based gait recognition
Chen et al. Video saliency prediction using enhanced spatiotemporal alignment network
Wu et al. Spatial–temporal hypergraph based on dual-stage attention network for multi-view data lightweight action recognition
Guan et al. AFE-CNN: 3D skeleton-based action recognition with action feature enhancement
Li et al. Multi-scale residual network model combined with Global Average Pooling for action recognition
Zhai et al. FPANet: feature pyramid attention network for crowd counting
Jiang et al. Contour-aware network for semantic segmentation via adaptive depth
Fang et al. M2RNet: Multi-modal and multi-scale refined network for RGB-D salient object detection
Xu et al. CCFNet: Cross-complementary fusion network for RGB-D scene parsing of clothing images
Hu et al. Forward-reverse adaptive graph convolutional networks for skeleton-based action recognition
Xu et al. Dual pyramid network for salient object detection
Xu et al. SA-DPNet: Structure-aware dual pyramid network for salient object detection
Li et al. Learning residual refinement network with semantic context representation for real-time saliency object detection
Wang et al. Spatiotemporal module for video saliency prediction based on self-attention
Zhu et al. RGB-D salient object detection via cross-modal joint feature extraction and low-bound fusion loss
Wang et al. Tmf: Temporal motion and fusion for action recognition
Li et al. Multi-granularity Cross Transformer Network for person re-identification
Wang et al. GaitParsing: Human semantic parsing for gait recognition
Wu et al. Joint Semantic Segmentation using representations of LiDAR point clouds and camera images
Deng et al. Ternary symmetric fusion network for camouflaged object detection
Ma et al. Multi-View Time-Series Hypergraph Neural Network for Action Recognition