8000 yyDing1 (Yuyang Ding) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yyDing1's full-sized avatar

Block or report yyDing1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,732 205 Updated Jun 19, 2025

[TKDE] Code implementation of Paper "Towards DS-NER: Unveiling and Addressing Latent Noise in Distant Annotations"

Python 2 Updated Oct 21, 2023

A comprehensive collection of process reward models.

92 1 Updated Jun 9, 2025

A series of math-specific large language models of our Qwen2 series.

Python 952 134 Updated Jan 11, 2025

本人的科研经验

6,955 408 Updated Jun 4, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,717 1,594 Updated Jun 20, 2025

Integrate the DeepSeek API into popular softwares

32,916 3,635 Updated May 13, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,141 813 Updated Jun 20, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 40,788 3,250 Updated Jun 19, 2025

Ongoing research training transformer models at scale

Python 12,614 2,857 Updated Jun 19, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 145,867 29,412 Updated Jun 20, 2025

Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"

Python 158 12 Updated May 20, 2025
Rust 1 Updated Mar 23, 2025

Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions

Python 18 2 Updated Jun 3, 2024
Python 9 2 Updated Dec 7, 2024
Python 12 1 Updated Nov 5, 2024

[ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.

Python 63 7 Updated Oct 27, 2024

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 3,271 607 Updated Jan 24, 2025

[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 108 5 Updated Dec 10, 2024

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Python 716 62 Updated Mar 17, 2025

Train transformer language models with reinforcement learning.

Python 14,265 1,979 Updated Jun 20, 2025

Robust recipes to align language models with human and AI preferences

Python 5,231 448 Updated Apr 30, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,908 522 Updated Sep 25, 2024

Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"

Python 554 95 Updated Jun 18, 2025

[ACL-24 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"

Python 52 2 Updated Mar 20, 2024

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,847 288 Updated Nov 25, 2024

The official Meta Llama 3 GitHub site

Python 28,784 3,400 Updated Jan 26, 2025

Grok open release

Python 50,287 8,354 Updated Aug 30, 2024

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

973 54 Updated Nov 18, 2024
Next
0