8000 csyanghan (Han Yang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View csyanghan's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report csyanghan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

648 results for source starred repositories
Clear filter

Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 67 5 Updated May 28, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript 12,072 1,242 Updated May 28, 2025

Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

Python 20 Updated May 22, 2025

A final sanity checklist to help your CS paper get accepted, not desk rejected.

1,079 116 Updated May 7, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,721 1,441 Updated May 22, 2025

很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 10,177 1,201 Updated May 28, 2025

MASS-SPEC ATTENDS TO DE NOVO MOLECULAR GENERATION

Python 4 Updated Mar 16, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,377 54 Updated Apr 18, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 21,362 2,517 Updated Apr 30, 2025

Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 BibTeX 来增强 Overleaf。

JavaScript 85 5 Updated Apr 14, 2025

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 192 9 Updated May 23, 2025

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 579 38 Updated May 14, 2025

Reproduction of DeepSeek-R1

Python 231 23 Updated Apr 14, 2025

端侧多模态小模型

Python 73 Updated Dec 30, 2024

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

604 16 Updated May 20, 2025

RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

Python 62 11 Updated Feb 19, 2025

A course on aligning smol models.

Jupyter Notebook 5,863 2,079 Updated Jan 24, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,032 234 Updated May 28, 2025

Web-based tool converts GitHub repository contents into a single formatted text file

JavaScript 1,330 153 Updated Dec 6, 2024

Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"

Python 229 18 Updated May 12, 2025

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

408 11 Updated May 22, 2025

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,069 86 Updated Apr 3, 2025

The Chemistry Development Kit

Java 528 169 Updated May 6, 2025

SIRIUS is a software for discovering a landscape of de-novo identification of metabolites using tandem mass spectrometry. This repository contains the code of the SIRIUS Software (GUI and CLI)

Java 105 29 Updated Jan 22, 2025

Maple Mono: Open source monospace font with round corner, ligatures and Nerd-Font for IDE and terminal, fine-grained customization options. 带连字和控制台图标的圆角等宽字体,中英文宽度完美2:1,细粒度的自定义选项

Python 16,462 504 Updated May 28, 2025

Artificial Intelligence Research for Science (AIRS)

Python 629 72 Updated May 1, 2025

Encoding MS/MS spectra using formula transformers for inferring molecular properties

Jupyter Notebook 58 14 Updated Jun 5, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,613 1,075 Updated May 28, 2025
Next
0