Lists (13)
Sort Name ascending (A-Z)
Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields
A Unified Framework for Surface Reconstruction
A collaboration friendly studio for NeRFs
GNSS/INS/Camera Integrated Navigation Library
3D Gaussian Rendering PlayGround: an open-source autonomous driving closed-loop simulator demo using 3D Gaussian Splatting tech
Open-source and strong foundation image recognition models.
PLUTO: Push the Limit of Imitation Learning-based Planning for Autonomous Driving
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
[CVPR2024] SchurVINS: Schur Complement-Based Lightweight Visual Inertial Navigation System
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
Paper reading notes on Deep Learning and Machine Learning
Xtreme1 is an all-in-one data labeling and annotation platform for multimodal data training and supports 3D LiDAR point cloud, image, and LLM.
3D Point Cloud Annotation Platform for Autonomous Driving
Web labeling tool for bitmap images and point clouds
我的导航算法学习笔记,内容涵盖导航定位开源程序的源码解读、开源项目梳理、书籍讲义、博客翻译、教程讲座推荐;所有内容都可以随意转载,原始文件都放在这里了,大家可以在我的基础上整理出自己的一些文档。(Tips:①主要是写给初学者,已经有基础的同学应该多看论文和代码,看我的笔记学不到啥;②仓库持续更新中,不建议 fork)
[RAL 2023] A globally consistent LiDAR map optimization module
基于OpenVINO,本地部署大模型智能体Agent,控制TonyPi人形机器人
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
A Modular Framework for 3D Gaussian Splatting and Beyond
ECCV'2022 PanoFormer: Panorama Transformer for Indoor 360° Depth Estimation
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving