-
The Hong Kong University of Science and Technology
- Hong Kong
- blossomin.github.io
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
A KV storage engine based on LSM Tree, supporting Redis RESP
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
GPU programming related news and material links
A Datacenter Scale Distributed Inference Serving Framework
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
Efficient Mixture of Experts for LLM Paper List
📰 Must-read papers and blogs on Speculative Decoding ⚡️
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
Evaluation code for confidential virtual machines (AMD SEV-SNP / Intel TDX)
Advanced Privacy-Preserving Federated Learning framework
Curated collection of papers in MoE model inference
A high-performance inference system for large language models, designed for production environments.
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
Awesome LLMs on Device: A Comprehensive Survey
A curated list for Efficient Large Language Models
verl: Volcano Engine Reinforcement Learning for LLMs
MNPWAD: Multi-Normal Prototypes Learning for Weakly Supervised Anomaly Detection