8000 LiangJiabaoY (Jiabao Liang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View LiangJiabaoY's full-sized avatar

Block or report LiangJiabaoY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,337 1,020 Updated May 22, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 107,958 17,575 Updated May 22, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,505 1,421 Updated May 22, 2025

Community maintained hardware plugin for vLLM on Ascend

Python 666 157 Updated May 22, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9,236 1,583 Updated May 22, 2025

Distributed reliable key-value store for the most critical data of a distributed system

Go 49,428 10,074 Updated May 22, 2025

The official Python library for the OpenAI API

Python 26,764 3,911 Updated May 22, 2025

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,528 175 Updated Jun 25, 2024

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Python 7,726 840 Updated May 22, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,311 259 Updated May 22, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,079 374 Updated May 22, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,916 443 Updated Aug 7, 2024

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 24,719 3,860 Updated May 22, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,568 758 Updated May 15, 2025

LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…

JavaScript 25,348 5,182 Updated Feb 21, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 9,847 853 Updated May 14, 2025

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python 1,475 197 Updated Apr 29, 2021

BSP kernel source

C 1,055 1,174 Updated Mar 24, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 7,478 2,031 Updated May 22, 2025
C++ 796 92 Updated May 19, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,377 133 Updated May 22, 2025

Train transformer language models with reinforcement learning.

Python 13,858 1,900 Updated May 22, 2025

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 3,746 445 Updated May 1, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,506 4,383 Updated May 22, 2025

Fast and memory-efficient exact attention

Python 17,466 1,692 Updated May 22, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,421 6,020 Updated May 21, 2025
Jupyter Notebook 13 Updated Nov 4, 2024
Next
0