blossomin

CHEN Dong blossomin

PhD studtent @ HKUST CSE

16 followers · 21 following

The Hong Kong University of Science and Technology
Hong Kong
blossomin.github.io

Highlights

Lists (1)

Sort

🚀 My stack

Stars

llm-d / llm-d-kv-cache-manager

Distributed KV cache coordinator

Go 39 12 Updated Jun 26, 2025

Vanilla-Beauty / tiny-lsm

A KV storage engine based on LSM Tree, supporting Redis RESP

C++ 183 25 Updated Jun 24, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 1,291 71 Updated Jun 30, 2025

gpu-mode / triton-index

Cataloging released Triton kernels.

240 12 Updated Jan 10, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,817 298 Updated Mar 10, 2025

shenh10 / DeepSeek_Simulator

Python 74 8 Updated Apr 2, 2025

gpu-mode / resource-stream

GPU programming related news and material links

1,602 89 Updated Jan 6, 2025

ppl-ai / pplx-kernels

Perplexity GPU Kernels

C++ 380 46 Updated Jun 10, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,378 455 Updated Jun 30, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 4,655 468 Updated Jun 18, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 992 67 Updated May 28, 2025

bytedance / InfiniStore

KV cache store for distributed LLM inference

C++ 278 28 Updated Jun 6, 2025

pprp / Awesome-Efficient-MoE

Efficient Mixture of Experts for LLM Paper List

Python 79 3 Updated Dec 15, 2024

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

814 45 Updated Jun 22, 2025

vllm-project / production-stack

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,417 217 Updated Jun 30, 2025

TUM-DSE / CVM_eval

Evaluation code for confidential virtual machines (AMD SEV-SNP / Intel TDX)

Python 10 3 Updated Apr 23, 2025

APPFL / APPFL

Advanced Privacy-Preserving Federated Learning framework

Python 140 25 Updated Jun 27, 2025

deepseek-ai / DeepSeek-V3

Python 97,916 15,938 Updated Jun 27, 2025

deepseek-ai / DeepSeek-R1

90,326 11,656 Updated Jun 27, 2025

MoE-Inf / awesome-moe-inference

Curated collection of papers in MoE model inference

203 8 Updated Feb 19, 2025

Azure / az-cgpu-onboarding

Python 25 11 Updated May 19, 2025

vectorch-ai / ScaleLLM

A high-performance inference system for large language models, designed for production environments.

C++ 449 37 Updated Jun 25, 2025

abcdabcd987 / libfabric-efa-demo

C++ 41 4 Updated Jan 5, 2025

apple / security-pcc

Private Cloud Compute (PCC)

Swift 829 79 Updated Apr 11, 2025

jxzhangjhu / Awesome-LLM-RAG

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,234 73 Updated Feb 24, 2025

NexaAI / Awesome-LLMs-on-device

Awesome LLMs on Device: A Comprehensive Survey

1,137 104 Updated Jan 12, 2025

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,756 140 Updated Jun 17, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,161 1,680 Updated Jun 30, 2025

Dongzhijin / MNPWAD

MNPWAD: Multi-Normal Prototypes Learning for Weakly Supervised Anomaly Detection

Python 3 Updated Jun 19, 2025

NVIDIA / kvpress

LLM KV cache compression made easy

Python 523 42 Updated Jun 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly