Stars
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
A modular graph-based Retrieval-Augmented Generation (RAG) system
SecretFlow-Serving is a serving system for privacy-preserving machine learning models.
Efficient Private Set Intersection base on VOLE
The reference implementation of SimplePIR and DoublePIR, accompanying the paper "One Server for the Price of Two: Simple and Fast Single-Server Private Information Retrieval" (USENIX Security 2023).
Building a quick conversation-based search demo with Lepton AI.
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…
Write scalable load tests in plain Python 🚗💨
OneDiff: An out-of-the-box acceleration library for diffusion models.
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Tools for merging pretrained large language models.
Codebase for Merging Language Models (ICML 2024)
The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"
An amazing UI for OpenAI's ChatGPT (Website + Windows + MacOS + Linux)
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
Modin: Scale your Pandas workflows by changing a single line of code