Lists (6)
Sort Name ascending (A-Z)
Starred repositories
Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"
A final sanity checklist to help your CS paper get accepted, not desk rejected.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。
Implementing DeepSeek R1's GRPO algorithm from scratch
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 BibTeX 来增强 Overleaf。
A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.
A course on aligning smol models.
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
Web-based tool converts GitHub repository contents into a single formatted text file
Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
A very simple GRPO implement for reproducing r1-like LLM thinking.
SIRIUS is a software for discovering a landscape of de-novo identification of metabolites using tandem mass spectrometry. This repository contains the code of the SIRIUS Software (GUI and CLI)
Maple Mono: Open source monospace font with round corner, ligatures and Nerd-Font for IDE and terminal, fine-grained customization options. 带连字和控制台图标的圆角等宽字体,中英文宽度完美2:1,细粒度的自定义选项
Artificial Intelligence Research for Science (AIRS)
Encoding MS/MS spectra using formula transformers for inferring molecular properties
verl: Volcano Engine Reinforcement Learning for LLMs
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023