Gu et al., 2020 - Google Patents
OnionNet: Single-view depth prediction and camera pose estimation for unlabeled videoGu et al., 2020
View PDF- Document ID
- 16025546698509098081
- Author
- Gu T
- Wang Z
- Li D
- Yang H
- Du W
- Zhou Y
- Publication year
- Publication venue
- IEEE Transactions on Cognitive and Developmental Systems
External Links
Snippet
In real scenes, humans can easily infer their positions and distances from other objects with their own eyes. To make the robots have the same visual ability, this article presents an unsupervised OnionNet framework, including LeafNet and ParachuteNet, for single-view …
- 230000000007 visual effect 0 abstract description 22
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wang et al. | Mvster: Epipolar transformer for efficient multi-view stereo | |
Wei et al. | Deepsfm: Structure from motion via deep bundle adjustment | |
Shu et al. | Feature-metric loss for self-supervised learning of depth and egomotion | |
Xu et al. | Aanet: Adaptive aggregation network for efficient stereo matching | |
Liu et al. | Local similarity pattern and cost self-reassembling for deep stereo matching networks | |
CN111311666B (en) | Monocular vision odometer method integrating edge features and deep learning | |
Yin et al. | Scale recovery for monocular visual odometry using depth estimated with deep convolutional neural fields | |
Ye et al. | Fusionad: Multi-modality fusion for prediction and planning tasks of autonomous driving | |
Tong et al. | Adaptive cost volume representation for unsupervised high-resolution stereo matching | |
Joung et al. | Unsupervised stereo matching using confidential correspondence consistency | |
He et al. | Learning scene dynamics from point cloud sequences | |
Hwang et al. | Lidar depth completion using color-embedded information via knowledge distillation | |
Lin et al. | Unsupervised monocular visual odometry with decoupled camera pose estimation | |
Lin et al. | Efficient and high-quality monocular depth estimation via gated multi-scale network | |
Gu et al. | OnionNet: Single-view depth prediction and camera pose estimation for unlabeled video | |
Wang et al. | NVDS $^{\mathbf {+}} $: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation | |
Yusiong et al. | AsiANet: Autoencoders in autoencoder for unsupervised monocular depth estimation | |
Gu et al. | Unsupervised cycle optimization learning for single-view depth and camera pose with Kalman filter | |
Ou et al. | A scene segmentation algorithm combining the body and the edge of the object | |
Liu et al. | Robust visual odometry using sparse optical flow network | |
Cao et al. | IBCO-Net: Integrity-Boundary-Corner Optimization in a General Multi-Stage Network for Building Fine Segmentation from Remote Sensing Images | |
Zhou et al. | DecoupledPoseNet: Cascade decoupled pose learning for unsupervised camera ego-motion estimation | |
Liu et al. | Mono-ViFI: A Unified Learning Framework for Self-supervised Single and Multi-frame Monocular Depth Estimation | |
Habekost et al. | Learning 3D Global Human Motion Estimation from Unpaired, Disjoint Datasets. | |
Wei et al. | LAM-Depth: Laplace-Attention Module-Based Self-Supervised Monocular Depth Estimation |