Stars
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
StanfordVL / sail-blog
Forked from sylhare/Type-on-StrapThe SAIL blog
Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Bayesian Neural Network Surrogates for Bayesian Optimization
Implementations of Gridworld, Modelwin, and Modelfail to experiment with offline RL
A highly efficient implementation of Gaussian Processes in PyTorch