Huang et al., 2021 - Google Patents

One-shot imitation drone filming of human motion videos

Huang et al., 2021

Document ID: 5055989491863828098
Author: Huang C; Dang Y; Chen P; Yang X; Cheng K
Publication year: 2021
Publication venue: IEEE Transactions on Pattern Analysis and Machine Intelligence

External Links

Cited by

Snippet

Imitation learning has recently been applied to mimic the operation of a cameraman in existing autonomous camera systems. To imitate a certain demonstration video, existing methods require users to collect a significant number of training videos with a similar filming …

Continue reading at ieeexplore.ieee.org (other versions)

238000000034 method 0 abstract description 11

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
- H04N5/225—Television cameras; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
- H04N5/232—Devices for controlling television cameras, e.g. remote control; Control of cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in, e.g. mobile phones, computers or vehicles

Similar Documents

Publication	Publication Date	Title
Luo et al.	2019	End-to-end active object tracking and its real-world deployment via reinforcement learning
Hu et al.	2017	Deep 360 pilot: Learning a deep agent for piloting through 360 sports videos
Huang et al.	2019	Learning to film from professional human motion videos
Huang et al.	2021	One-shot imitation drone filming of human motion videos
Yu et al.	2018	A deep ranking model for spatio-temporal highlight detection from a 360◦ video
Huang et al.	2019	Learning to capture a film-look video with a camera drone
Zhu et al.	2023	SUES-200: A multi-height multi-scene cross-view image benchmark across drone and satellite
Gärtner et al.	2020	Deep reinforcement learning for active human pose estimation
CN112132866A (en)	2020-12-25	Target object tracking method, device and equipment and computer readable storage medium
Elhayek et al.	2018	Fully automatic multi-person human motion capture for vr applications
Chen et al.	2017	Where should cameras look at soccer games: Improving smoothness using the overlapped hidden Markov model
Hu et al.	2017	Deep 360 pilot: Learning a deep agent for piloting through 360deg sports videos
Yang et al.	2019	A framework for knowing who is doing what in aerial surveillance videos
Khattar et al.	2021	Visual localization and servoing for drone use in indoor remote laboratory environment
Huang et al.	2019	One-shot imitation filming of human motion videos
Dang et al.	2022	Path-analysis-based reinforcement learning algorithm for imitation filming
Wang et al.	2020	Attention-Based Deep Reinforcement Learning for Virtual Cinematography of 360$^{\circ} $ Videos
Su et al.	2016	Social behavior prediction from first person videos
US10728427B2 (en)	2020-07-28	Apparatus, systems and methods for nonlinear synchronization of action videos
Wang et al.	2019	3D object detection algorithm for panoramic images with multi-scale convolutional neural network
Riaz et al.	2023	Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction
Yang et al.	2022	Design and implementation of intelligent analysis technology in sports video target and trajectory tracking algorithm
Dang et al.	2020	Imitation learning-based algorithm for drone cinematography system
CN116954365A (en)	2023-10-27	Virtual human limb movement interaction system based on monocular video drive
Fawzi et al.	2018	Quadcopter control using onboard monocular camera for enriching remote laboratory facilities