Lists (1)
Sort Name ascending (A-Z)
Stars
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Shadcn-ui based tree view, with multi-selection, drag, and more!
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
Efficient few-shot learning with Sentence Transformers
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
SGLang is a fast serving framework for large language models and vision language models.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Making large AI models cheaper, faster and more accessible
Port of OpenAI's Whisper model in C/C++
Example of how to handle background processes with FastAPI, Celery, and Docker
Faster Whisper transcription with CTranslate2