Stars
👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepEP: an efficient expert-parallel communication library
A lightweight data processing framework built on DuckDB and 3FS.
A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
FlashMLA: Efficient MLA decoding kernels
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
AI demo for playing ARPG/Soul-like game with RL frame
APALACHE: symbolic model checker for TLA+ and Quint
An executable specification language with delightful tooling based on the temporal logic of actions (TLA)
TLA+ specifications related to Viewstamped Replication
A modular graph-based Retrieval-Augmented Generation (RAG) system
llama3 implementation one matrix multiplication at a time
A local chatbot fine-tuned by bilibili user comments.
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Cargo subcommand to easily use LLVM source-based code coverage (-C instrument-coverage).
Xray, Penetrates Everything. Also the best v2ray-core. Where the magic happens. An open platform for various uses.