Stars
Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
STUMPY is a powerful and scalable Python library for modern time series analysis
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
Data-driven APIs for common optimization tasks
Infinite Photorealistic Worlds using Procedural Generation
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…
🔥Highlighting the top ML papers every week.
Get a ChatGPT plugin up and running in under 5 minutes!
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Computationally friendly hyper-parameter search with DP-SGD
Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".
High throughput synchronous and asynchronous reinforcement learning
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
A Python interface for reinforcement learning environments
A suite of test scenarios for multi-agent reinforcement learning.
Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlated Noise.