Gu et al., 2020 - Google Patents

OnionNet: Single-view depth prediction and camera pose estimation for unlabeled video

Gu et al., 2020

Document ID: 16025546698509098081
Author: Gu T; Wang Z; Li D; Yang H; Du W; Zhou Y
Publication year: 2020
Publication venue: IEEE Transactions on Cognitive and Developmental Systems

External Links

Cited by

Snippet

In real scenes, humans can easily infer their positions and distances from other objects with their own eyes. To make the robots have the same visual ability, this article presents an unsupervised OnionNet framework, including LeafNet and ParachuteNet, for single-view …

Continue reading at www.researchgate.net (PDF) (other versions)

230000000007 visual effect 0 abstract description 22

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation

Similar Documents

Publication	Publication Date	Title
Wang et al.	2022	Mvster: Epipolar transformer for efficient multi-view stereo
Wei et al.	2020	Deepsfm: Structure from motion via deep bundle adjustment
Shu et al.	2020	Feature-metric loss for self-supervised learning of depth and egomotion
Xu et al.	2020	Aanet: Adaptive aggregation network for efficient stereo matching
Liu et al.	2022	Local similarity pattern and cost self-reassembling for deep stereo matching networks
Yin et al.	2017	Scale recovery for monocular visual odometry using depth estimated with deep convolutional neural fields
Tong et al.	2022	Adaptive cost volume representation for unsupervised high-resolution stereo matching
Joung et al.	2019	Unsupervised stereo matching using confidential correspondence consistency
He et al.	2022	Learning scene dynamics from point cloud sequences
Duan et al.	2021	RGB-Fusion: Monocular 3D reconstruction with learned depth prediction
Lin et al.	2021	Unsupervised monocular visual odometry with decoupled camera pose estimation
Lin et al.	2020	Efficient and high-quality monocular depth estimation via gated multi-scale network
Ren et al.	2023	DeepSFM: robust deep iterative refinement for structure from motion
Gu et al.	2020	OnionNet: Single-view depth prediction and camera pose estimation for unlabeled video
Liu et al.	2024	Mono-ViFI: A Unified Learning Framework for Self-supervised Single and Multi-frame Monocular Depth Estimation
Ou et al.	2022	A scene segmentation algorithm combining the body and the edge of the object
Yusiong et al.	2019	AsiANet: Autoencoders in autoencoder for unsupervised monocular depth estimation
Liu et al.	2022	Robust visual odometry using sparse optical flow network
Dai et al.	2019	Unsupervised learning of depth estimation based on attention model and global pose optimization
Cao et al.	2023	IBCO-Net: Integrity-boundary-corner optimization in a general multistage network for building fine segmentation from remote sensing images
Zhou et al.	2022	DecoupledPoseNet: Cascade decoupled pose learning for unsupervised camera ego-motion estimation
Xia et al.	2024	PCDR-DFF: Multi-modal 3D object detection based on point cloud diversity representation and dual feature fusion
Habekost et al.	2020	Learning 3D Global Human Motion Estimation from Unpaired, Disjoint Datasets.
Wei et al.	2024	LAM-depth: Laplace-attention module-based self-supervised monocular depth estimation
Xu et al.	2024	4d contrastive superflows are dense 3d representation learners