Stars
Supercharge Your LLM with the Fastest KV Cache Layer
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Self-Hosted Plaform for Secure Execution of Untrusted User/AI Code
Latest Advances on System-2 Reasoning
CodeRAG-Bench: Can Retrieval Augment Code Generation?
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
An extremely fast Python type checker and language server, written in Rust.
Build Real-Time Knowledge Graphs for AI Agents
A Python module for creating Excel XLSX files.
Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"
The official Python SDK for Model Context Protocol servers and clients
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
Official PyTorch implementation for "Large Language Diffusion Models"
Empower the Web community and invite more to build across platforms.
Marrying Rust and CMake - Easy Rust and C/C++ Integration!
[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model
On-device AI across mobile, embedded and edge for PyTorch
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Cost-efficient and pluggable Infrastructure components for GenAI inference
👨💻 Python cleanup script for macOS
A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.