Starred repositories
OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)
An open-source codebase for exploring autonomous driving pre-training
[NeurIPS 2024] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to improve performance of numerous vision language performances f…
VMamba: Visual State Space Models,code is based on mamba
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
A curated collection of papers, tutorials, videos, and other valuable resources related to Mamba.
This is the official repository for Talk2LiDAR project.
[CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space
Predictive Coding Network (PreCNet) for Next Frame Video Prediction.
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
Bridging Large Vision-Language Models and End-to-End Autonomous Driving
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
A series of large language models trained from scratch by developers @01-ai
[ICRA 2025] OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
[ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“
A curated list of awesome knowledge-driven autonomous driving (continually updated)
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)
[CVPR 2023] ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
[ECCV 2024] Embodied Understanding of Driving Scenarios