Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
Control the Google Presentation slides with Hand Gesture
Implementation of CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation.
Web3 healthcare app that puts patients in control of their medical data with secure blockchain-based sharing and data management
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
[CVPR2025] We present SleeperMark, a novel framework designed to embed resilient watermarks into T2I diffusion models
Official implementation of LangCoop: Collaborative Driving with Natural Language
Discriminative Constrained Optimization for Reinforcing Large Reasoning Models
a comprehensive and critical synthesis of the emerging role of GenAI across the full autonomous driving stack
OpenMMLab Foundational Library for Training Deep Learning Models
Neighborhood Attention Extension. Bringing attention to a neighborhood near you!
Examples for Recommenders - easy to train and deploy on accelerated infrastructure.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
A high-throughput and memory-efficient inference and serving engine for LLMs
Fully open reproduction of DeepSeek-R1
mm-grounding-dino-for-training
Mcity2.0 demo
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
Sys2Bench is a benchmarking suite designed to evaluate reasoning and planning capabilities of large language models across algorithmic, logical, arithmetic, and common-sense reasoning tasks.
PyTorch code for CVPR 2018 paper: Learning to Compare: Relation Network for Few-Shot Learning (Few-Shot Learning part)
pytorch version of "End-to-end Recovery of Human Shape and Pose"
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…