Yang et al., 2023 - Google Patents
Transformer-based deep learning model and video dataset for unsafe action identification in construction projectsYang et al., 2023
View PDF- Document ID
- 18063609762129618786
- Author
- Yang M
- Wu C
- Guo Y
- Jiang R
- Zhou F
- Zhang J
- Yang Z
- Publication year
- Publication venue
- Automation in Construction
External Links
Snippet
A large proportion of construction accidents are caused by unintentional and unsafe actions and behaviors. It is of significant difficulties and ineffectiveness to monitor unsafe behaviors using conventional manual supervision due to the complex and dynamic working conditions …
- 238000010276 construction 0 title abstract description 101
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/00771—Recognising scenes under surveillance, e.g. with Markovian modelling of scene activity
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00335—Recognising movements or behaviour, e.g. recognition of gestures, dynamic facial expressions; Lip-reading
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00362—Recognising human body or animal bodies, e.g. vehicle occupant, pedestrian; Recognising body parts, e.g. hand
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Yang et al. | Transformer-based deep learning model and video dataset for unsafe action identification in construction projects | |
Wei et al. | Recognizing people’s identity in construction sites with computer vision: A spatial and temporal attention pooling network | |
Irfanullah et al. | Real time violence detection in surveillance videos using Convolutional Neural Networks | |
Manaf et al. | Computer vision-based survey on human activity recognition system, challenges and applications | |
Chang et al. | A pose estimation-based fall detection methodology using artificial intelligence edge computing | |
CN114241379B (en) | Passenger abnormal behavior identification method, device, equipment and passenger monitoring system | |
Wang et al. | Vision-based hand signal recognition in construction: A feasibility study | |
Chen et al. | Vision-based skeleton motion phase to evaluate working behavior: case study of ladder climbing safety | |
Lima et al. | Human action recognition with 3D convolutional neural network | |
Chen et al. | Multi-modality gesture detection and recognition with un-supervision, randomization and discrimination | |
Wang et al. | Multi-sensor fusion based industrial action recognition method under the environment of intelligent manufacturing | |
Liu et al. | Human activity recognition through deep learning: Leveraging unique and common feature fusion in wearable multi-sensor systems | |
Aftab et al. | A boosting framework for human posture recognition using spatio-temporal features along with radon transform | |
Wang et al. | Intelligent design and optimization of exercise equipment based on fusion algorithm of yolov5-resnet 50 | |
Peng et al. | Emerging techniques in vision-based human posture detection: Machine learning methods and applications | |
Huang et al. | Spatial relationship-aware rapid entire body fuzzy assessment method for prevention of work-related musculoskeletal disorders | |
Itano et al. | Human actions recognition in video scenes from multiple camera viewpoints | |
Sha et al. | An improved two-stream CNN method for abnormal behavior detection | |
Chen et al. | Long term hand tracking with proposal selection | |
Zhu et al. | Human risky behaviour recognition during ladder climbing based on multi-modal feature fusion and adaptive graph convolutional network | |
Ramanathan et al. | Combining pose-invariant kinematic features and object context features for rgb-d action recognition | |
Liu et al. | Development of a fatigue detection and early warning system for crane operators: A preliminary study | |
Wang et al. | Context‐aware hand gesture interaction for human–robot collaboration in construction | |
Duth et al. | Human Activity Detection Using Pose Net | |
Yang et al. | Video quality evaluation toward complicated sport activities for clustering analysis |