8000 kugwzk (Zekun Wang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View kugwzk's full-sized avatar
💬
I may be slow to respond.
💬
I may be slow to respond.

Block or report kugwzk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture. Training an MDM using GPT with this repo!

Python 14 Updated Jun 23, 2025

Nano vLLM

Python 4,910 574 Updated Jun 27, 2025

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 2,558 198 Updated Jul 6, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 1,364 78 Updated Jul 2, 2025
Python 17 Updated Jun 3, 2025

Open-source Multi-agent Poster Generation from Papers

Python 2,267 132 Updated Jun 17, 2025

Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model

Python 204 12 Updated May 27, 2025

Open-source unified multimodal model

Python 4,467 375 Updated Jul 2, 2025

Scaling Computer-Use Grounding via UI Decomposition and Synthesis

TypeScript 85 2 Updated Jun 18, 2025
Python 169 10 Updated Jun 2, 2025

Painless Evaluation of Flash Linear Attention models on Synthetic Tasks

5 Updated May 13, 2025

The official implementation for Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 44 1 Updated May 13, 2025

Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)

Python 63 2 Updated Jun 4, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 549 51 Updated Jul 6, 2025

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

605 32 Updated Jun 27, 2025

Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research

Python 209 32 Updated Jul 6, 2025
Python 21 Updated May 5, 2025

Humanity's Last Exam

Python 840 44 Updated Jun 6, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 404 22 Updated Jul 5, 2025
19 Updated May 3, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 3,307 364 Updated Jul 6, 2025

Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!

45 2 Updated Apr 7, 2025

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 492 38 Updated Apr 13, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 869 68 Updated Jul 4, 2025
Python 43 1 Updated Apr 2, 2025

[ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"

Python 112 7 Updated Jul 5, 2025

Stick-breaking attention

Python 1 Updated Jan 12, 2025

The Open All-in-One Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.

TypeScript 14,993 1,329 Updated Jul 6, 2025

[ACL 2025] An inference-time decoding strategy with adaptive foresight sampling

Python 96 7 Updated May 18, 2025

Pretraining infrastructure for multi-hybrid AI model architectures

Python 170 16 Updated May 7, 2025
Next
0