Starred repositories
Matplotlib styles for scientific plotting
COLMAP - Structure-from-Motion and Multi-View Stereo
"Probabilistic Machine Learning" - a book series by Kevin Murphy
SpatialLM: Training Large Language Models for Structured Indoor Modeling
Hydra is a framework for elegantly configuring complex applications
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Python dictionaries with advanced dot notation access
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Annotation UI for human speech and animal vocalizations featuring support for customizable spectrograms, multiple audio tracks, audio playback, CSV import/export functionality, and integration with…
This is the pytorch implement of our paper "RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model"
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide
Minimal reproduction of DeepSeek R1-Zero
Hiera: A fast, powerful, and simple hierarchical vision transformer.
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions
Rich is a Python library for rich text and beautiful formatting in the terminal.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
An extremely fast Python linter and code formatter, written in Rust.
This repository contains utility scripts for the KITTI-360 dataset.
A collection of resources and papers on Diffusion Models