8000 KenyonY (K.Y) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View KenyonY's full-sized avatar
🐾
On vacation
🐾
On vacation
  • Baidu Inc.
  • Shanghai
  • 11:58 (UTC +08:00)

Organizations

@ml-natural-language-processing

Block or report KenyonY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 2,801 218 Updated May 22, 2025

Modeling, training, eval, and inference code for OLMo

Python 5,621 611 Updated May 22, 2025

Build your personal knowledge base with TriliumNext Notes

TypeScript 2,543 145 Updated May 22, 2025

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 2,424 77 Updated May 22, 2025

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 575 82 Updated May 19, 2025

Concurrent Python made simple

Python 1,402 28 Updated Feb 4, 2025

跨平台桌宠 BongoCat,为桌面增添乐趣!

TypeScript 5,081 245 Updated May 22, 2025

FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.

50,915 15,644 Updated May 21, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,359 52 Updated Apr 18, 2025

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python 9,083 986 Updated May 23, 2025

APOLLO: SGD-like Memory, AdamW-level Performance

Python 228 9 Updated Apr 25, 2025

Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.

Rust 8,449 453 Updated May 22, 2025

🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用

Rust 38,582 7,100 Updated Mar 25, 2025

Hot Reloading and Profiling for Python

Python 2,964 62 Updated May 24, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,787 660 Updated May 22, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 25,083 2,237 Updated May 22, 2025

Python implementation of MPPI (Model Predictive Path-Integral) controller to understand the basic idea. Mandatory dependencies are numpy and matplotlib only.

Jupyter Notebook 433 42 Updated Feb 6, 2025

Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端

Python 8,227 1,416 Updated Apr 2, 2025

The official Python SDK for Model Context Protocol servers and clients

Python 12,839 1,496 Updated May 22, 2025

Deep research agent to help you find the best GitHub repositories 🕵️!

Python 716 74 Updated May 5, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 61,139 6,793 Updated May 23, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 13,981 1,450 Updated May 22, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 1,788 79 Updated May 21, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,348 1,024 Updated May 23, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,977 309 Updated May 11, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,412 172 Updated May 21, 2025

🪄 Create rich visualizations with AI

TypeScript 11,555 889 Updated May 16, 2025

RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

Python 61 11 Updated Feb 19, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,792 1,487 Updated Apr 24, 2025
Next
0