Huang et al., 2021 - Google Patents
One-shot imitation drone filming of human motion videosHuang et al., 2021
- Document ID
- 5055989491863828098
- Author
- Huang C
- Dang Y
- Chen P
- Yang X
- Cheng K
- Publication year
- Publication venue
- IEEE Transactions on Pattern Analysis and Machine Intelligence
External Links
Snippet
Imitation learning has recently been applied to mimic the operation of a cameraman in existing autonomous camera systems. To imitate a certain demonstration video, existing methods require users to collect a significant number of training videos with a similar filming …
- 238000000034 method 0 abstract description 11
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
- H04N5/225—Television cameras; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
- H04N5/232—Devices for controlling television cameras, e.g. remote control; Control of cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in, e.g. mobile phones, computers or vehicles
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Luo et al. | End-to-end active object tracking and its real-world deployment via reinforcement learning | |
Hu et al. | Deep 360 pilot: Learning a deep agent for piloting through 360 sports videos | |
Huang et al. | Learning to film from professional human motion videos | |
Huang et al. | One-shot imitation drone filming of human motion videos | |
Yu et al. | A deep ranking model for spatio-temporal highlight detection from a 360◦ video | |
Huang et al. | Learning to capture a film-look video with a camera drone | |
Zhu et al. | SUES-200: A multi-height multi-scene cross-view image benchmark across drone and satellite | |
Gärtner et al. | Deep reinforcement learning for active human pose estimation | |
CN112132866A (en) | Target object tracking method, device and equipment and computer readable storage medium | |
Elhayek et al. | Fully automatic multi-person human motion capture for vr applications | |
Chen et al. | Where should cameras look at soccer games: Improving smoothness using the overlapped hidden Markov model | |
Hu et al. | Deep 360 pilot: Learning a deep agent for piloting through 360deg sports videos | |
Yang et al. | A framework for knowing who is doing what in aerial surveillance videos | |
Khattar et al. | Visual localization and servoing for drone use in indoor remote laboratory environment | |
Huang et al. | One-shot imitation filming of human motion videos | |
Dang et al. | Path-analysis-based reinforcement learning algorithm for imitation filming | |
Wang et al. | Attention-Based Deep Reinforcement Learning for Virtual Cinematography of 360$^{\circ} $ Videos | |
Su et al. | Social behavior prediction from first person videos | |
US10728427B2 (en) | Apparatus, systems and methods for nonlinear synchronization of action videos | |
Wang et al. | 3D object detection algorithm for panoramic images with multi-scale convolutional neural network | |
Riaz et al. | Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction | |
Yang et al. | Design and implementation of intelligent analysis technology in sports video target and trajectory tracking algorithm | |
Dang et al. | Imitation learning-based algorithm for drone cinematography system | |
CN116954365A (en) | Virtual human limb movement interaction system based on monocular video drive | |
Fawzi et al. | Quadcopter control using onboard monocular camera for enriching remote laboratory facilities |