8000 rich-junwang (richard wang) / Starred · GitHub

More Web Proxy on the site http://driver.im/

rich-junwang

Follow

richard wang rich-junwang

Follow

NLP

52 followers · 598 following

07:03 (UTC -07:00)

Lists (1)

Sort

🔮 Future ideas

Stars

pickle-com / glass

JavaScript 2,480 455 Updated Jul 5, 2025

PRIME-RL / Entropy-Mechanism-of-RL

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 230 8 Updated Jun 26, 2025

TauricResearch / TradingAgents

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 12,008 1,860 Updated Jul 3, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,252 322 Updated Jun 26, 2025

EvolvingLMMs-Lab / multimodal-search-r1

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 219 11 Updated Jul 3, 2025

ChenmienTan / RL2

Python 193 16 Updated Jul 4, 2025

elder-plinius / CL4R1T4S

AI SYSTEMS TRANSPARENCY FOR ALL! - LEAKED SYSTEM PROMPTS FOR CHATGPT, GEMINI, GROK, CLAUDE, PERPLEXITY, CURSOR, WINDSURF, DEVIN, REPLIT, AND MORE!

7,197 1,576 Updated Jul 3, 2025

THUDM / slime

slime is a LLM post-training framework aiming at scaling RL.

Python 538 32 Updated Jul 5, 2025

SakanaAI / RLT

Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.

Python 284 41 Updated Jun 23, 2025

wdndev / llmchat

Forked from project-templates-repo/electron-vue3-template

Develop LLM Chat Applications with Electron.

Vue 2 Updated Jun 24, 2025

ttengwang / Awesome_Long_Form_Video_Understanding

Awesome papers & datasets specifically focused on long-term videos.

282 12 Updated Nov 15, 2024

fla-org / distillation-fla

Forked from OpenSparseLLMs/Linearization

Distillation pipeline from pretrained Transformers to customized FLA models

Python 10 Updated Jul 3, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,905 220 Updated Jul 4, 2025

mirage-project / mirage

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 1,517 91 Updated Jul 2, 2025

wdndev / mllm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师多模态相关知识

HTML 207 7 Updated May 12, 2024

VTool-R1 / VTool-R1

Code for the paper "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"

Python 93 1 Updated Jun 24, 2025

unclecode / crawl4ai

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 47,177 4,537 Updated Jul 4, 2025

feifeibear / long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 523 55 Updated May 27, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 41,556 3,310 Updated Jul 5, 2025

MoonshotAI / Kimi-Dev

open-source coding LLM for software engineering tasks

Python 671 83 Updated Jun 27, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 4,881 564 Updated Jun 27, 2025

uccl-project / uccl

Ultra and Unified CCL

C++ 353 20 Updated Jul 5, 2025

ByteDance-Seed / ByteCheckpoint

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 224 10 Updated Apr 2, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,289 50 Updated Jun 14, 2025

fw-ai-external / reward-kit

Get start with RL, today

Python 13 2 Updated Jul 3, 2025

google-deepmind / rlax

Python 1,334 91 Updated May 8, 2025

Simple-Efficient / RL-Factory

Train your Agent model via our easy and efficient framework

Python 1,240 110 Updated Jul 1, 2025

CharlesQ9 / Alita

589 34 Updated Jun 6, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 1,361 76 Updated Jul 2, 2025

rwitten / HighPerfLLMs2024

Python 510 50 Updated Jul 11, 2024

0