Stars
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Understand Human Behavior to Align True Needs
[SIGGRAPH Asia 2024] Painting process generating using diffusion models
This project features an open-source small bipedal robot designed for research, education, and hobbyist experimentation.
A python library to facilitate interaction with Onshape's REST API
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
corl-team / CORL
Forked from tinkoff-ai/CORLHigh-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
A good enough project template for writing reproducible code
A large-scale benchmark and learning environment.
ARKit teleoperation for MuJoCo and the real world
Code for paper, "A Comparison of Imitation Learning Algorithms for Bimanual Manipulation" (Drolet et al., 2024)
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Robust recipes to align language models with human and AI preferences
[RSS 2024] Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning