Lists (1)
Sort Name ascending (A-Z)
Stars
verl: Volcano Engine Reinforcement Learning for LLMs
🦜🔗 Build context-aware reasoning applications
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Community maintained hardware plugin for vLLM on Ascend
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Distributed reliable key-value store for the most critical data of a distributed system
The official Python library for the OpenAI API
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
A Datacenter Scale Distributed Inference Serving Framework
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Integrate cutting-edge LLM technology quickly and easily into your apps
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Several simple examples for popular neural network toolkits calling custom CUDA operators.
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Train transformer language models with reinforcement learning.
Align Anything: Training All-modality Model with Feedback
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Fast and memory-efficient exact attention
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)