8000 yyDing1 (Yuyang Ding) / Starred · GitHub

More Web Proxy on the site http://driver.im/

yyDing1

Follow

Yuyang Ding yyDing1

Follow

PhD student at Soochow University.

28 followers · 12 following

Soochow University
https://yyding1.github.io

Achievements

Achievements

Stars

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,732 205 Updated Jun 19, 2025

yyDing1 / DS-NER

[TKDE] Code implementation of Paper "Towards DS-NER: Unveiling and Addressing Latent Noise in Distant Annotations"

Python 2 Updated Oct 21, 2023

RyanLiu112 / Awesome-Process-Reward-Models

A comprehensive collection of process reward models.

92 1 Updated Jun 9, 2025

QwenLM / Qwen2.5-Math

A series of math-specific large language models of our Qwen2 series.

Python 952 134 Updated Jan 11, 2025

pengsida / learning_research

本人的科研经验

6,955 408 Updated Jun 4, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,717 1,594 Updated Jun 20, 2025

deepseek-ai / awesome-deepseek-integration

Integrate the DeepSeek API into popular softwares

32,916 3,635 Updated May 13, 2025

deepseek-ai / DeepSeek-R1

90,181 11,650 Updated Apr 9, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,141 813 Updated Jun 20, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 40,788 3,250 Updated Jun 19, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 12,614 2,857 Updated Jun 19, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 145,867 29,412 Updated Jun 20, 2025

QwenLM / ProcessBench

Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"

Python 158 12 Updated May 20, 2025

PeterlitsZo / jisp

Rust 1 Updated Mar 23, 2025

Zhenwen-NLP / MathChat

Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions

Python 18 2 Updated Jun 3, 2024

CRUXEVAL-X / cruxeval-x

Python 9 2 Updated Dec 7, 2024

zhuzilin / vllm-group

Python 12 1 Updated Nov 5, 2024

yyDing1 / ScaleQuest

[ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.

Python 63 7 Updated Oct 27, 2024

eliahuhorwitz / Academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 607 Updated Jan 24, 2025

hkust-nlp / dart-math

[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 108 5 Updated Dec 10, 2024

magpie-align / magpie

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Python 716 62 Updated Mar 17, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 14,265 1,979 Updated Jun 20, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,231 448 Updated Apr 30, 2025

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,908 522 Updated Sep 25, 2024

LiveCodeBench / LiveCodeBench

Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"

Python 554 95 Updated Jun 18, 2025

yyDing1 / GNER

[ACL-24 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"

Python 52 2 Updated Mar 20, 2024

Codium-ai / AlphaCodium

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,847 288 Updated Nov 25, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,784 3,400 Updated Jan 26, 2025

xai-org / grok-1

Grok open release

Python 50,287 8,354 Updated Aug 30, 2024

quqxui / Awesome-LLM4IE-Papers

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

973 54 Updated Nov 18, 2024

0