Zhao et al., 2022 - Google Patents
Jperceiver: Joint perception network for depth, pose and layout estimation in driving scenesZhao et al., 2022
View PDF- Document ID
- 13530700697256163950
- Author
- Zhao H
- Zhang J
- Zhang S
- Tao D
- Publication year
- Publication venue
- European Conference on Computer Vision
External Links
Snippet
Depth estimation, visual odometry (VO), and bird's-eye-view (BEV) scene layout estimation present three critical tasks for driving scene perception, which is fundamental for motion planning and navigation in autonomous driving. Though they are complementary to each …
- 230000000007 visual effect 0 abstract description 20
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/00791—Recognising scenes perceived from the perspective of a land vehicle, e.g. recognising lanes, obstacles or traffic signs on road scenes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Sakaridis et al. | Map-guided curriculum domain adaptation and uncertainty-aware evaluation for semantic nighttime image segmentation | |
Dong et al. | Towards real-time monocular depth estimation for robotics: A survey | |
Hu et al. | Deep depth completion from extremely sparse data: A survey | |
CN110651310B (en) | Deep learning method for estimating object density and/or flow, and related method and software | |
Ranft et al. | The role of machine vision for intelligent vehicles | |
Lopez-Rodriguez et al. | Desc: Domain adaptation for depth estimation via semantic consistency | |
Bešić et al. | Dynamic object removal and spatio-temporal RGB-D inpainting via geometry-aware adversarial learning | |
Zhao et al. | Jperceiver: Joint perception network for depth, pose and layout estimation in driving scenes | |
Yang et al. | A fusion network for road detection via spatial propagation and spatial transformation | |
Huang et al. | Neural correspondence field for object pose estimation | |
Tian et al. | Adaptive and azimuth-aware fusion network of multimodal local features for 3D object detection | |
Charco et al. | Camera pose estimation in multi-view environments: From virtual scenarios to the real world | |
Yan et al. | Forging vision foundation models for autonomous driving: Challenges, methodologies, and opportunities | |
Zuo et al. | LGADet: Light-weight anchor-free multispectral pedestrian detection with mixed local and global attention | |
Yen et al. | 3d-pl: Domain adaptive depth estimation with 3d-aware pseudo-labeling | |
Han et al. | Self-supervised monocular Depth estimation with multi-scale structure similarity loss | |
CN112800822A (en) | 3D automatic tagging with structural and physical constraints | |
Vinoth et al. | Lightweight object detection in low light: Pixel-wise depth refinement and TensorRT optimization | |
Lin et al. | Mlf-det: Multi-level fusion for cross-modal 3d object detection | |
Wang et al. | Cbwloss: constrained bidirectional weighted loss for self-supervised learning of depth and pose | |
Zhang et al. | Hvdistill: Transferring knowledge from images to point clouds via unsupervised hybrid-view distillation | |
Dai et al. | Unsupervised learning of depth estimation based on attention model and global pose optimization | |
Zhai et al. | Geometry understanding from autonomous driving scenarios based on feature refinement | |
Yue et al. | Vehicle motion segmentation via combining neural networks and geometric methods | |
Bui et al. | GAC3D: improving monocular 3D object detection with ground-guide model and adaptive convolution |