reyoung

Yang Yu reyoung

I am the NLP/LLM infra leader for WeChat, was a core developer for Paddle. WeChat LLM Infra Team is hiring! Please feel free to email me.

314 followers · 69 following

Tencent
Beijing

Achievements

x2 x3

Achievements

x2 x3

Stars

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,878 1,851 Updated Jun 11, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,286 102 Updated Jun 13, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,818 277 Updated May 15, 2025

microsoft / Tutel

Tutel MoE: Optimized Mixture-of-Experts Library, Support DeepSeek FP8/FP4

C 841 96 Updated Jun 7, 2025

microsoft / mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 377 54 Updated Jun 14, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 822 37 Updated Jun 5, 2025

reugn / async

Synchronization and asynchronous computation package for Go

Go 248 13 Updated Sep 7, 2024

chalk-diagrams / chalk

A declarative drawing API in Python

Python 296 14 Updated Aug 28, 2024

SkyworkAI / skywork-o1-prm-inference

Python 63 4 Updated Nov 26, 2024

Haiyang-W / TokenFormer

[ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Python 562 43 Updated Feb 11, 2025

borgo-lang / borgo

Borgo is a statically typed language that compiles to Go.

Rust 4,424 61 Updated Oct 27, 2024

mendableai / firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 39,979 3,722 Updated Jun 13, 2025

antgroup / glake

GLake: optimizing GPU memory management and IO transmission.

Python 467 40 Updated Mar 24, 2025

turboderp-org / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 4,209 317 Updated Jun 4, 2025

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 782 67 Updated Jun 12, 2025

ejoy / ant

Ant game engine

Lua 3,895 406 Updated Mar 24, 2025

lucsky / go-exml

An event based XML parsing API for Go

Go 19 5 Updated Jun 19, 2014

lucidrains / magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Python 607 34 Updated Jan 12, 2025

databricks / megablocks

Python 1,371 196 Updated May 30, 2025

LC1332 / Chat-Haruhi-Suzumiya

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,976 178 Updated Aug 13, 2024

beordle / termtunnel

Cross-platform terminal tunnel tool

C 362 35 Updated May 21, 2024

xverse-ai / XVERSE-13B

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

Python 646 59 Updated Apr 9, 2024

huggingface / candle

Minimalist ML framework for Rust

Rust 17,394 1,121 Updated Jun 7, 2025

SergiusTheBest / exceptxx

C++ exception handling library

C++ 40 14 Updated Jan 17, 2024

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 15,857 2,040 Updated Jun 15, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,588 676 Updated Jun 12, 2025

lucidrains / triton-transformer

Implementation of a Transformer, but completely in Triton

Python 266 16 Updated Apr 5, 2022

VKCOM / YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

C++ 968 105 Updated Mar 29, 2024

ELS-RD / kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,571 98 Updated Feb 16, 2024

allegro / bigcache

Efficient cache for gigabytes of data written in Go.

Go 7,850 604 Updated May 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yang Yu reyoung

Achievements

Achievements

Block or report reyoung

Stars

DLR-RM / stable-baselines3

tile-ai / tilelang

deepseek-ai / open-infra-index

microsoft / Tutel

microsoft / mscclpp

efeslab / Nanoflow

reugn / async

chalk-diagrams / chalk

SkyworkAI / skywork-o1-prm-inference

Haiyang-W / TokenFormer

borgo-lang / borgo

mendableai / firecrawl

antgroup / glake

turboderp-org / exllamav2

zhuzilin / ring-flash-attention

ejoy / ant

lucsky / go-exml

lucidrains / magvit2-pytorch

databricks / megablocks

LC1332 / Chat-Haruhi-Suzumiya

beordle / termtunnel

xverse-ai / XVERSE-13B

huggingface / candle

SergiusTheBest / exceptxx

triton-lang / triton

facebookresearch / xformers

lucidrains / triton-transformer

VKCOM / YouTokenToMe

ELS-RD / kernl

allegro / bigcache