csyanghan

🎯

Focusing

Han Yang csyanghan

🎯

Focusing

39 followers · 92 following

SJTU
ShangHai , China
https://csyanghan.github.io/

Achievements

Highlights

Lists (6)

Sort

Starred repositories

QwenLM / PolyMath

Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

Python 17 Updated May 15, 2025

yzhao062 / cs-paper-checklist

A final sanity checklist to help your CS paper get accepted, not desk rejected.

937 105 Updated May 7, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,337 1,406 Updated May 16, 2025

DaoCloud / public-image-mirror

很多镜像都在国外。比如 gcr 。国内下载很慢，需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 10,050 1,190 Updated May 12, 2025

HassounLab / MADGEN

MASS-SPEC ATTENDS TO DE NOVO MOLECULAR GENERATION

Python 4 Updated Mar 16, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,344 52 Updated Apr 18, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 20,997 2,458 Updated Apr 30, 2025

MLNLP-World / Overleaf-Bib-Helper

Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 BibTeX 来增强 Overleaf。

JavaScript 82 5 Updated Apr 14, 2025

zwhe99 / DeepMath

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 189 9 Updated May 9, 2025

SkyworkAI / Skywork-OR1

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 569 35 Updated May 14, 2025

ByungKwanLee / DeepSick-R1

Reproduction of DeepSeek-R1

Python 228 23 Updated Apr 14, 2025

ByteDance-Seed / Seed-Thinking-v1.5

758 11 Updated Apr 20, 2025

Sweewangyu / s-mllm

端侧多模态小模型

Python 73 Updated Dec 30, 2024

yaotingwangofficial / Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

575 15 Updated May 9, 2025

jackfsuia / nanoRLHF

RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

Python 61 11 Updated Feb 19, 2025

huggingface / smol-course

A course on aligning smol models.

Jupyter Notebook 5,839 2,058 Updated Jan 24, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 2,967 226 Updated May 19, 2025

abinthomasonline / repo2txt

Web-based tool converts GitHub repository contents into a single formatted text file

JavaScript 1,314 151 Updated Dec 6, 2024

knoveleng / open-rs

Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"

Python 227 17 Updated May 12, 2025

Eclipsess / Awesome-Efficient-Reasoning-LLMs

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

388 11 Updated May 13, 2025

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,049 86 Updated Apr 3, 2025

cdk / cdk

The Chemistry Development Kit

Java 527 168 Updated May 6, 2025

sirius-ms / sirius

SIRIUS is a software for discovering a landscape of de-novo identification of metabolites using tandem mass spectrometry. This repository contains the code of the SIRIUS Software (GUI and CLI)

Java 105 29 Updated Jan 22, 2025

subframe7536 / maple-font

Maple Mono: Open source monospace font with round corner, ligatures and Nerd-Font for IDE and terminal, fine-grained customization options. 带连字和控制台图标的圆角等宽字体，中英文宽度完美2:1，细粒度的自定义选项

Python 16,169 478 Updated May 19, 2025