Stars
LAVIS - A One-stop Library for Language-Vision Intelligence
Code for the CVPR 2024 paper highlight and demo "PIGEON: Predicting Image Geolocations".
High-Resolution Image Synthesis with Latent Diffusion Models
3D Gaussian Rendering PlayGround: an open-source autonomous driving closed-loop simulator demo using 3D Gaussian Splatting tech
Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)
Official code for CVPR 2022 (Oral) paper "Deep Visual Geo-localization Benchmark"
Code release for Revisit Anything: Visual Place Recognition via Image Segment Retrieval (ECCV 2024)
AnyLoc: Universal Visual Place Recognition (RA-L 2023)
Datasets for long-term visual localization with sequential images in large-scale spaces
Segment-Anything + 3D. Let's lift anything to 3D.
Source codes of “Autonomous Driving on Curvy Roads without Reliance on Frenet Frame: A Cartesian-based Trajectory Planning Method” published in IEEE Trans. Intelligent Transportation Systems
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
[ICLR 2024] Map Learning with Lane Segment for Autonomous Driving
📑 Simple and lightweight Hierarchical/Finite-State Machine (H-FSM) class (C++11)
ArduPlane, ArduCopter, ArduRover, ArduSub source
[CVPR'24, Demo Track Honourable Mention] SuperPrimitive: Scene Reconstruction at a Primitive Level
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
[3DV'25] 3D Reconstruction with Spatial Memory
Official PyTorch implementation of "UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video"
Minimize Energy in Images.