- Fairfax, VA
- wangaoone.github.io
More
Stars
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
High-performance Python librarys for connecting AI/ML frameworks with OSS storage.
A self-learning tutorail for CUDA High Performance Programing.
Disaggregated serving system for Large Language Models (LLMs).
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
SGLang is a fast serving framework for large language models and vision language models.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
Awesome LLM compression research papers and tools.
Virtual whiteboard for sketching hand-drawn like diagrams
A self-developed version of the user-mode CUDA emulator project and a learning repository for Rust
Examples and guides for using the OpenAI API
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Official Code for DragGAN (SIGGRAPH 2023)
λFS: an elastic, high-performance, serverless-function-based metadata service for large-scale distributed file systems (ACM ASPLOS'23)
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型