8000 Purshow (Purshow) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Purshow's full-sized avatar

Highlights

  • Pro

Organizations

@PKU-YuanGroup

Block or report Purshow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,474 731 Updated Jul 24, 2025

Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"

Python 54 3 Updated Jul 24, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 29,913 6,135 Updated Jul 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 11,388 1,894 Updated Jul 24, 2025

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 7,657 534 Updated Jul 24, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 15,609 1,946 Updated Jul 24, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 53,068 8,895 Updated Jul 24, 2025
Python 16 5 Updated Jul 24, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 147,386 29,759 Updated Jul 24, 2025

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 30,530 2,673 Updated Jul 24, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 2,783 343 Updated Jul 24, 2025

重庆大学资源共享计划

Python 208 12 Updated Jul 24, 2025

Distributed RL System for LLM Reasoning

Python 2,067 125 Updated Jul 24, 2025

LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]

Python 95 2 Updated Jul 24, 2025

Long-RL: Scaling RL to Long Sequences

Python 504 14 Updated Jul 24, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,864 766 Updated Jul 24, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,755 580 Updated Jul 24, 2025

[ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/TokenBridge

Python 129 4 Updated Jul 24, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 63,360 5,958 Updated Jul 24, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 147,392 12,491 Updated Jul 24, 2025

Train transformer language models with reinforcement learning.

Python 14,717 2,057 Updated Jul 24, 2025

A framework for few-shot evaluation of language models.

Python 9,636 2,569 Updated Jul 24, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,795 380 Updated Jul 24, 2025

Efficient Triton Kernels for LLM Training

Python 5,409 372 Updated Jul 24, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 10,104 978 Updated Jul 24, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 39,487 4,484 Updated Jul 23, 2025

Ongoing research training transformer models at scale

Python 12,976 2,949 Updated Jul 23, 2025

LLM UI with advanced features, easy setup, and multiple backend support.

Python 44,431 5,720 Updated Jul 23, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 54,838 6,740 Updated Jul 23, 2025

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)

Python 108 8 Updated Jul 23, 2025
Next
0