8000 wangaoone (Ao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View wangaoone's full-sized avatar

Organizations

@ds2-lab

Block or report wangaoone

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,470 290 Updated Jun 27, 2025

KV cache store for distributed LLM inference

C++ 276 28 Updated Jun 6, 2025

Run LLMs with MLX

Python 1,157 145 Updated Jun 26, 2025

Redis for LLMs

Python 1,741 253 Updated Jun 26, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,050 901 Updated Jun 17, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,840 279 Updated May 15, 2025

High-performance Python librarys for connecting AI/ML frameworks with OSS storage.

Python 23 4 Updated Jun 27, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 8,264 899 Updated Apr 30, 2025

A self-learning tutorail for CUDA High Performance Programing.

JavaScript 647 68 Updated Apr 12, 2025

cricket is a virtualization solution for GPUs

C 203 45 Updated Jun 3, 2025

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 629 66 Updated Apr 6, 2025
Python 32 5 Updated Jun 7, 2024

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,461 1,030 Updated Jun 26, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 15,506 2,200 Updated Jun 27, 2025
Python 611 56 Updated Jul 31, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,867 1,528 Updated Jun 27, 2025

CUDA checkpoint and restore utility

C 344 19 Updated Jan 27, 2025

Awesome LLM compression research papers and tools.

1,576 99 Updated Jun 14, 2025

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 102,621 10,125 Updated Jun 26, 2025

A self-developed version of the user-mode CUDA emulator project and a learning repository for Rust

Rust 5 2 Updated Sep 22, 2023

Examples and guides for using the OpenAI API

MDX 64,933 10,740 Updated Jun 27, 2025

Containers for machine learning

Go 8,679 617 Updated Jun 26, 2025

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,721 1,835 Updated Jun 27, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,917 3,441 Updated May 18, 2024

λFS: an elastic, high-performance, serverless-function-based metadata service for large-scale distributed file systems (ACM ASPLOS'23)

Java 11 2 Updated Apr 2, 2025
Rust 15 2 Updated Jul 2, 2023

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,153 424 Updated Aug 23, 2024

🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型

Python 5,990 553 Updated Jun 11, 2024

An HTTP serving framework by Banana

Python 99 9 Updated Dec 12, 2023

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,067 5,213 Updated Jun 27, 2024
Next
0