8000 rich-junwang (richard wang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View rich-junwang's full-sized avatar
  • 07:03 (UTC -07:00)

Block or report rich-junwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
JavaScript 2,480 455 Updated Jul 5, 2025

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 230 8 Updated Jun 26, 2025

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 12,008 1,860 Updated Jul 3, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,252 322 Updated Jun 26, 2025

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 219 11 Updated Jul 3, 2025
Python 193 16 Updated Jul 4, 2025

AI SYSTEMS TRANSPARENCY FOR ALL! - LEAKED SYSTEM PROMPTS FOR CHATGPT, GEMINI, GROK, CLAUDE, PERPLEXITY, CURSOR, WINDSURF, DEVIN, REPLIT, AND MORE!

7,197 1,576 Updated Jul 3, 2025

slime is a LLM post-training framework aiming at scaling RL.

Python 538 32 Updated Jul 5, 2025

Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.

Python 284 41 Updated Jun 23, 2025

Develop LLM Chat Applications with Electron.

Vue 2 Updated Jun 24, 2025

Awesome papers & datasets specifically focused on long-term videos.

282 12 Updated Nov 15, 2024

Distillation pipeline from pretrained Transformers to customized FLA models

Python 10 Updated Jul 3, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,905 220 Updated Jul 4, 2025

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 1,517 91 Updated Jul 2, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识

HTML 207 7 Updated May 12, 2024

Code for the paper "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"

Python 93 1 Updated Jun 24, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 47,177 4,537 Updated Jul 4, 2025

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 523 55 Updated May 27, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 41,556 3,310 Updated Jul 5, 2025

open-source coding LLM for software engineering tasks

Python 671 83 Updated Jun 27, 2025

Nano vLLM

Python 4,881 564 Updated Jun 27, 2025

Ultra and Unified CCL

C++ 353 20 Updated Jul 5, 2025

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 224 10 Updated Apr 2, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,289 50 Updated Jun 14, 2025

Get start with RL, today

Python 13 2 Updated Jul 3, 2025
Python 1,334 91 Updated May 8, 2025

Train your Agent model via our easy and efficient framework

Python 1,240 110 Updated Jul 1, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 1,361 76 Updated Jul 2, 2025
Python 510 50 Updated Jul 11, 2024
Next
0