csyanghan

🎯

Focusing

Han Yang csyanghan

🎯

Focusing

39 followers · 93 following

SJTU
ShangHai , China
https://csyanghan.github.io/

Achievements

Highlights

Lists (6)

Sort

Starred repositories

648 results for source starred repositories

Clear filter

eric-ai-lab / Soft-Thinking

Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 67 5 Updated May 28, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript 12,072 1,242 Updated May 28, 2025

MINE-Lab-ND / SpectrumML_Survey_Papers

17 5 Updated Feb 18, 2025

QwenLM / PolyMath

Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

Python 20 Updated May 22, 2025

yzhao062 / cs-paper-checklist

A final sanity checklist to help your CS paper get accepted, not desk rejected.

1,079 116 Updated May 7, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,721 1,441 Updated May 22, 2025

DaoCloud / public-image-mirror

很多镜像都在国外。比如 gcr 。国内下载很慢，需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 10,177 1,201 Updated May 28, 2025

HassounLab / MADGEN

MASS-SPEC ATTENDS TO DE NOVO MOLECULAR GENERATION

Python 4 Updated Mar 16, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,377 54 Updated Apr 18, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 21,362 2,517 Updated Apr 30, 2025

MLNLP-World / Overleaf-Bib-Helper

Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 BibTeX 来增强 Overleaf。

JavaScript 85 5 Updated Apr 14, 2025

zwhe99 / DeepMath

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 192 9 Updated May 23, 2025

SkyworkAI / Skywork-OR1

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 579 38 Updated May 14, 2025

ByungKwanLee / DeepSick-R1

Reproduction of DeepSeek-R1

Python 231 23 Updated Apr 14, 2025

ByteDance-Seed / Seed-Thinking-v1.5

772 13 Updated Apr 20, 2025

Sweewangyu / s-mllm

端侧多模态小模型

Python 73 Updated Dec 30, 2024

yaotingwangofficial / Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

604 16 Updated May 20, 2025

jackfsuia / nanoRLHF

RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

Python 62 11 Updated Feb 19, 2025

huggingface / smol-course

A course on aligning smol models.

Jupyter Notebook 5,863 2,079 Updated Jan 24, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,032 234 Updated May 28, 2025

abinthomasonline / repo2txt

Web-based tool converts GitHub repository contents into a single formatted text file

JavaScript 1,330 153 Updated Dec 6, 2024

knoveleng / open-rs

Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"

Python 229 18 Updated May 12, 2025

Eclipsess / Awesome-Efficient-Reasoning-LLMs

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

408 11 Updated May 22, 2025

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,069 86 Updated Apr 3, 2025

cdk / cdk

The Chemistry Development Kit

Java 528 169 Updated May 6, 2025

sirius-ms / sirius

SIRIUS is a software for discovering a landscape of de-novo identification of metabolites using tandem mass spectrometry. This repository contains the code of the SIRIUS Software (GUI and CLI)

Java 105 29 Updated Jan 22, 2025

subframe7536 / maple-font

Maple Mono: Open source monospace font with round corner, ligatures and Nerd-Font for IDE and terminal, fine-grained customization options. 带连字和控制台图标的圆角等宽字体，中英文宽度完美2:1，细粒度的自定义选项

Python 16,462 504 Updated May 28, 2025

divelab / AIRS

Artificial Intelligence Research for Science (AIRS)

Python 629 72 Updated May 1, 2025

samgoldman97 / mist

Encoding MS/MS spectra using formula transformers for inferring molecular properties

Jupyter Notebook 58 14 Updated Jun 5, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,613 1,075 Updated May 28, 2025

Han Yang csyanghan

Highlights

Lists (6)

Front End

GNN

LLM Finetune

Molecule

PaperList

Traffic Forecasting

Starred repositories

Python