Cao et al., 2024 - Google Patents

VSL-Net: Voxel structure learning for 3D object detection

Cao et al., 2024

Document ID: 5833317292797535817
Author: Cao F; Zhou F; Tao C; Xue J; Gao Z; Zhang Z; Zhu Y
Publication year: 2024
Publication venue: Advanced Engineering Informatics

External Links

Cited by

Snippet

Current detection methods with single stage generally lack contextual structure information, the classification and location confidence are inconsistent, which are not able to achieve accurate dynamic multi-object detection. Therefore, a VSL-Net method is proposed based …

Continue reading at www.sciencedirect.com (other versions)

238000001514 detection method 0 title abstract description 176

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G06K9/4604—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes, intersections
- G06K9/4609—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes, intersections by matching or filtering
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/00791—Recognising scenes perceived from the perspective of a land vehicle, e.g. recognising lanes, obstacles or traffic signs on road scenes
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G06T17/05—Geographic models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K2209/00—Indexing scheme relating to methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions

Similar Documents

Publication	Publication Date	Title
Zhou et al.	2020	Joint 3d instance segmentation and object detection for autonomous driving
Zhu et al.	2022	Vpfnet: Improving 3d object detection with virtual point based lidar and stereo data fusion
Wang et al.	2023	Multi-modal 3d object detection in autonomous driving: A survey and taxonomy
Chen et al.	2021	RoIFusion: 3D object detection from LiDAR and vision
Miao et al.	2021	Pvgnet: A bottom-up one-stage 3d object detector with integrated multi-level features
Lu et al.	2023	A cnn-transformer hybrid model based on cswin transformer for uav image object detection
CN112347987A (en)	2021-02-09	Multimode data fusion three-dimensional target detection method
Liu et al.	2024	Segment any point cloud sequences by distilling vision foundation models
Song et al.	2024	Robustness-aware 3d object detection in autonomous driving: A review and outlook
Wu et al.	2021	Multi-modal 3D object detection by 2D-guided precision anchor proposal and multi-layer fusion
Bu et al.	2019	Pedestrian planar LiDAR pose (PPLP) network for oriented pedestrian detection based on planar LiDAR and monocular images
Hoang et al.	2023	3onet: 3d detector for occluded object under obstructed conditions
Shi et al.	2022	An improved lightweight deep neural network with knowledge distillation for local feature extraction and visual localization using images and LiDAR point clouds
Xie et al.	2022	Recent advances in conventional and deep learning-based depth completion: A survey
Luo et al.	2022	Dynamic multitarget detection algorithm of voxel point cloud fusion based on pointrcnn
Tao et al.	2023	An efficient 3D object detection method based on fast guided anchor stereo RCNN
Huang et al.	2021	SSA3D: Semantic segmentation assisted one-stage three-dimensional vehicle object detection
Song et al.	2024	Voxelnextfusion: A simple, unified and effective voxel fusion framework for multi-modal 3d object detection
Hoang et al.	2024	TSSTDet: Transformation-based 3-D Object Detection via a Spatial Shape Transformer
Contreras et al.	2024	A survey on 3D object detection in real time for autonomous driving
Wang et al.	2023	LiDAR-only 3D object detection based on spatial context
Cao et al.	2024	VSL-Net: Voxel structure learning for 3D object detection
Zhang et al.	2024	MMAF-Net: Multi-view multi-stage adaptive fusion for multi-sensor 3D object detection
Jhong et al.	2023	Density-Aware and Semantic-Guided Fusion for 3D Object Detection using LiDAR-Camera Sensors
Guo et al.	2024	Multi-Layer Fusion 3D Object Detection via Lidar Point Cloud and Camera Image