8000 wyxscir / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View wyxscir's full-sized avatar
🍒
🍒
  • beijing

Block or report wyxscir

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 2 Updated May 18, 2025

SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

Python 48 1 Updated Jun 3, 2025

A Business-Driven Real-World Financial Benchmark for Evaluating LLMs

Python 188 5 Updated Jun 5, 2025

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,239 254 Updated Jun 12, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 834 46 Updated Jun 12, 2025

Open-source Multi-agent Poster Generation from Papers

Python 1,981 100 Updated Jun 4, 2025

Code for the paper: "Learning to Reason without External Rewards"

Python 273 23 Updated Jun 12, 2025

The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Python 119 5 Updated Jun 3, 2025
Python 9 Updated Jun 3, 2025

Code for paper "SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation"

5 Updated May 28, 2025

A benchmark for LLMs on complicated tasks in the terminal

Shell 161 38 Updated Jun 12, 2025

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,049 47 Updated Jun 4, 2025

[ACL 2025 Findings] Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering

Python 2 Updated May 21, 2025

Obsidian Weread Plugin is a plugin to sync Weread(微信读书) hightlights and annotations into your Obsidian Vault.

TypeScript 1,453 86 Updated May 6, 2025
Python 202 9 Updated May 14, 2025

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

Python 381 15 Updated May 17, 2025
Python 1 Updated May 19, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,443 62 Updated Jun 5, 2025

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 226 23 Updated Jun 3, 2025

Official Repository of Absolute Zero Reasoner

Python 1,508 254 Updated Jun 2, 2025

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,002 91 Updated Jun 12, 2025

✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Python 143 7 Updated May 9, 2025
Python 13 Updated May 7, 2025

Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.

496 34 Updated Jun 6, 2025

official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”

Python 282 25 Updated Jun 9, 2025

YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual financial corpus (Chinese and English).

Python 26 3 Updated Dec 12, 2024

GraphGen: A Scalable Approach to Domain-agnostic Labeled Graph Generation

C++ 59 16 Updated Jul 6, 2023

My learning notes/codes for ML SYS.

Python 2,442 155 Updated Jun 12, 2025
Next
0