[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Yu et al., 2023 - Google Patents

Surrounding-aware representation prediction in Birds-Eye-View using transformers

Yu et al., 2023

View HTML
Document ID
5338157868439054089
Author
Yu J
Zheng W
Chen Y
Zhang Y
Huang R
Publication year
Publication venue
Frontiers in Neuroscience

External Links

Snippet

Birds-Eye-View (BEV) maps provide an accurate representation of sensory cues present in the surroundings, including dynamic and static elements. Generating a semantic representation of BEV maps can be a challenging task since it relies on object detection and …
Continue reading at www.frontiersin.org (HTML) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00624Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
    • G06K9/00791Recognising scenes perceived from the perspective of a land vehicle, e.g. recognising lanes, obstacles or traffic signs on road scenes
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/20Image acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30781Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F17/30784Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
    • G06F17/30799Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass

Similar Documents

Publication Publication Date Title
Mahaur et al. Small-object detection based on YOLOv5 in autonomous driving systems
Ni et al. A survey on theories and applications for self-driving cars based on deep learning methods
Huang et al. Autonomous driving with deep learning: A survey of state-of-art technologies
Zhao et al. Improved vision-based vehicle detection and classification by optimized YOLOv4
Yu et al. Surrounding-aware representation prediction in Birds-Eye-View using transformers
Sellat et al. Intelligent Semantic Segmentation for Self‐Driving Vehicles Using Deep Learning
Yang et al. A fusion network for road detection via spatial propagation and spatial transformation
Qian et al. Gated-residual block for semantic segmentation using RGB-D data
Yang et al. Multi-granularity scenarios understanding network for trajectory prediction
Abdeljaber et al. Extraction of vehicle turning trajectories at signalized intersections using convolutional neural networks
Li et al. Multi-modal neural feature fusion for automatic driving through perception-aware path planning
Xu et al. Two-stage 3D object detection guided by position encoding
Jia et al. Real-time traffic sign detection based on weighted attention and model refinement
Yuan et al. Multi-level object detection by multi-sensor perception of traffic scenes
Srihari et al. Partially supervised image captioning model for urban road views
Wang et al. A multi-modal spatial–temporal model for accurate motion forecasting with visual fusion
Ouyang et al. Multiview cnn model for sensor fusion based vehicle detection
Yu et al. YOLO-MPAM: Efficient real-time neural networks based on multi-channel feature fusion
Yuan et al. DDCAttNet: road segmentation network for remote sensing images
Zhang et al. DNet-CNet: A novel cascaded deep network for real-time lane detection and classification
Wu et al. APPFNet: Adaptive point-pixel fusion network for 3D semantic segmentation with neighbor feature aggregation
Azam et al. Exploring Contextual Representation and Multi-modality for End-to-end Autonomous Driving
Xia et al. Enhancing 3D object detection through multi-modal fusion for cooperative perception
Iancu et al. An improved vehicle trajectory prediction model based on video generation
Chen et al. Road Marking Defect Detection Based on CFG_SI_YOLO Network