Piccinelli et al., 2024 - Google Patents
UniDepth: Universal Monocular Metric Depth EstimationPiccinelli et al., 2024
View PDF- Document ID
- 14879970238027697730
- Author
- Piccinelli L
- Yang Y
- Sakaridis C
- Segu M
- Li S
- Van Gool L
- Yu F
- Publication year
- Publication venue
- Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
External Links
Snippet
Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to generalize to unseen domains …
- 238000000034 method 0 abstract description 68
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/04—Texture mapping
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
- G06T15/205—Image-based rendering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Piccinelli et al. | UniDepth: Universal Monocular Metric Depth Estimation | |
JP7485482B2 (en) | Computer Vision System | |
Vespa et al. | Efficient octree-based volumetric SLAM supporting signed-distance and occupancy mapping | |
Hernandez-Juarez et al. | Embedded real-time stereo estimation via semi-global matching on the GPU | |
Lei et al. | Shape from polarization for complex scenes in the wild | |
Banz et al. | Real-time semi-global matching disparity estimation on the GPU | |
Nguyen et al. | Three-dimensional shape reconstruction from single-shot speckle image using deep convolutional neural networks | |
Hu et al. | Perspective flow aggregation for data-limited 6d object pose estimation | |
Alismail et al. | Robust tracking in low light and sudden illumination changes | |
Xu et al. | Mv-jar: Masked voxel jigsaw and reconstruction for lidar-based self-supervised pre-training | |
CN108122280A (en) | The method for reconstructing and device of a kind of three-dimensional point cloud | |
Li et al. | An efficient photogrammetric stereo matching method for high-resolution images | |
Michailidis et al. | High performance stereo system for dense 3-D reconstruction | |
Sautier et al. | Bevcontrast: Self-supervision in bev space for automotive lidar point clouds | |
Xue et al. | Imp: Iterative matching and pose estimation with adaptive pooling | |
Khaleghi et al. | An improved real-time miniaturized embedded stereo vision system (MESVS-II) | |
Tong et al. | 3D-CDRNet: Retrieval-based dense point cloud reconstruction from a single image under complex background | |
Li et al. | Sparse-to-local-dense matching for geometry-guided correspondence estimation | |
Bhayani et al. | Partially calibrated semi-generalized pose from hybrid point correspondences | |
Miao et al. | Pseudo-lidar for visual odometry | |
Clifford et al. | Particle matching and triangulation using light-field ray bundling | |
Wientapper et al. | A universal, closed-form approach for absolute pose problems | |
Zhao et al. | Structure-aware cross-modal transformer for depth completion | |
Liu et al. | Camera Auto-calibration from the Steiner Conic of the Fundamental Matrix | |
Tu et al. | PanoPose: Self-supervised Relative Pose Estimation for Panoramic Images |