Stars
How to reduce complexity and move faster? Just Postgres for everything.
Get your documents ready for gen AI
Things you can do with the token embeddings of an LLM
Cross-Platform Keystroke Launcher
minimal pytorch implementation of bm25 (with sparse tensors)
A python module to repair invalid JSON from LLMs
Write scalable load tests in plain Python 🚗💨
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
Open source platform for the machine learning lifecycle
Stable Diffusion web UI
🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
Official inference library for Mistral models
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.
WireGuard VPN installer for Linux servers
Command line driven CI frontend and development task automation tool.
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training