Lists (5)
Sort Name ascending (A-Z)
Stars
KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems
A Tree Search Library with Flexible API for LLM Inference-Time Scaling
Development repository for the Triton language and compiler
An official Qdrant Model Context Protocol (MCP) server implementation
Automated Capability Discovery via Foundation Model Self-Exploration
Train transformer language models with reinforcement learning.
A curated list of awesome resources, tools, workflows, and guides for Google's > Gemini CLI
An open-source AI agent that brings the power of Gemini directly into your terminal.
A configuration framework that enhances Claude Code with specialized commands, cognitive personas, and development methodologies.
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"
Training Large Language Model to Reason in a Continuous Latent Space
A generative world for general-purpose robotics & embodied AI learning.
A simple script to see how my ideas evolve over time
Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
A supervisor agent for open agent platform
An open-source, no-code agent building platform.
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
A batteries-included toolkit for the GPU-accelerated OpenMM molecular simulation engine.
Compatibility tool for Steam Play based on Wine and additional components
FlashMLA: Efficient MLA decoding kernels