kugwzk

💬

I may be slow to respond.

Zekun Wang kugwzk

💬

I may be slow to respond.

think twice, code once

98 followers · 285 following

https://kugwzk.github.io/

Achievements

Lists (1)

Sort

🔮 Future ideas

1 repository

Stars

scxue / AO-GPT-MDM

Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture. Training an MDM using GPT with this repo!

Python 14 Updated Jun 23, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 4,910 574 Updated Jun 27, 2025

MiniMax-AI / MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 2,558 198 Updated Jul 6, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 1,364 78 Updated Jul 2, 2025

EffiVLM-Bench / EffiVLM-Bench

Python 17 Updated Jun 3, 2025

Paper2Poster / Paper2Poster

Open-source Multi-agent Poster Generation from Papers

Python 2,267 132 Updated Jun 17, 2025

antimatter15 / reverse-engineering-gemma-3n

Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model

Python 204 12 Updated May 27, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 4,467 375 Updated Jul 2, 2025

xlang-ai / OSWorld-G

Scaling Computer-Use Grounding via UI Decomposition and Synthesis

TypeScript 85 2 Updated Jun 18, 2025

NVlabs / Tool-N1

Python 169 10 Updated Jun 2, 2025

fla-org / fla-synthetic-kit

Painless Evaluation of Flash Linear Attention models on Synthetic Tasks

5 Updated May 13, 2025

qiuzh20 / gated_attention

The official implementation for Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 44 1 Updated May 13, 2025

shiqichen17 / VLM_Merging

Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)

Python 63 2 Updated Jun 4, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 549 51 Updated Jul 6, 2025

showlab / Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

605 32 Updated Jun 27, 2025

ServiceNow / Fast-LLM

Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research

Python 209 32 Updated Jul 6, 2025

Dao-AILab / gemm-cublas

Python 21 Updated May 5, 2025

centerforaisafety / hle

Humanity's Last Exam

Python 840 44 Updated Jun 6, 2025

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 404 22 Updated Jul 5, 2025

hkust-nlp / GUIMid

19 Updated May 3, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 3,307 364 Updated Jul 6, 2025

xlang-ai / computer-agent-arena

Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!

45 2 Updated Apr 7, 2025

GAIR-NLP / DeepResearcher

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 492 38 Updated Apr 13, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 869 68 Updated Jul 4, 2025

LiveBench / liveswebench

Python 43 1 Updated Apr 2, 2025

zhixuan-lin / forgetting-transformer

[ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"

Python 112 7 Updated Jul 5, 2025

yzhangcs / stickbreaking-attention

Forked from shawntan/stickbreaking-attention

Stick-breaking attention

Python 1 Updated Jan 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zekun Wang kugwzk

Achievements

Achievements

Block or report kugwzk

Lists (1)

🔮 Future ideas

Stars

scxue / AO-GPT-MDM

GeeeekExplorer / nano-vllm

MiniMax-AI / MiniMax-M1

alibaba / ROLL

EffiVLM-Bench / EffiVLM-Bench

Paper2Poster / Paper2Poster

antimatter15 / reverse-engineering-gemma-3n

ByteDance-Seed / Bagel

xlang-ai / OSWorld-G

NVlabs / Tool-N1

fla-org / fla-synthetic-kit

qiuzh20 / gated_attention

shiqichen17 / VLM_Merging

NovaSky-AI / SkyRL

showlab / Awesome-Unified-Multimodal-Models

ServiceNow / Fast-LLM

Dao-AILab / gemm-cublas

centerforaisafety / hle

SandAI-org / MagiAttention

hkust-nlp / GUIMid

flashinfer-ai / flashinfer

xlang-ai / computer-agent-arena

GAIR-NLP / DeepResearcher

ByteDance-Seed / Triton-distributed

LiveBench / liveswebench

zhixuan-lin / forgetting-transformer

yzhangcs / stickbreaking-attention

bytedance / UI-TARS-desktop

xufangzhi / phi-Decoding

Zymrael / savanna