Stars
SYSTEM PROMPT TRANSPARENCY FOR ALL - CHATGPT, GEMINI, GROK, CLAUDE, PERPLEXITY, CURSOR, WINDSURF, DEVIN, REPLIT, AND MORE!
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
Self-contained, minimalistic implementation of diffusion models with Pytorch.
Lightweight coding agent that runs in your terminal
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
reasoning model trained using GRPO towards rosetta REF2015 for protein stability
MoBA: Mixture of Block Attention for Long-Context LLMs
A very simple GRPO implement for reproducing r1-like LLM thinking.
Machine Learning Journal for Intermediate to Advanced Topics.
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
Democratizing Reinforcement Learning for LLMs
Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase
Fully open reproduction of DeepSeek-R1
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
⚡ TabPFN: Foundation Model for Tabular Data ⚡
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Testing paligemma2 finetuning on reasoning dataset
Stanford-ILIAD / openvla-mini
Forked from openvla/openvlaOpenVLA: An open-source vision-language-action model for robotic manipulation.
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Muon: An optimizer for hidden layers in neural networks