ruhanaazam

ruhanaazam

7 followers · 5 following

Stars

unslothai / unsloth

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 39,316 3,090 Updated May 25, 2025

StanfordVL / sail-blog

Forked from sylhare/Type-on-Strap

The SAIL blog

HTML 11 47 Updated Apr 30, 2025

EmergenceAI / Agent-E

Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api

Python 1,112 163 Updated May 12, 2025

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 7,106 757 Updated Apr 8, 2025

yucenli / bnn-bo

Bayesian Neural Network Surrogates for Bayesian Optimization

Jupyter Notebook 51 13 Updated May 9, 2024

ShawnBLYU / offline_rl_envs

Implementations of Gridworld, Modelwin, and Modelfail to experiment with offline RL

Python 1 1 Updated May 14, 2021

cornellius-gp / gpytorch

A highly efficient implementation of Gaussian Processes in PyTorch

Python 3,707 564 Updated Mar 11, 2025

pytorch / botorch

Bayesian optimization in PyTorch

Jupyter Notebook 3,268 423 Updated May 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly