8000 AmourWaltz (Beyond Hsueh) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View AmourWaltz's full-sized avatar
🎯
Focusing
🎯
Focusing
  • The Chinese University of Hong Kong

Highlights

  • Pro

Block or report AmourWaltz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 1 Updated May 21, 2025

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

106 4 Updated May 4, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,392 170 Updated May 23, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 7,668 845 Updated Apr 30, 2025

My learning notes/codes for ML SYS.

Python 2,282 142 Updated May 27, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,294 305 Updated May 13, 2025

Simple RL training for reasoning

Python 3,588 266 Updated Apr 10, 2025

Fully open reproduction of DeepSeek-R1

Python 24,561 2,264 Updated May 27, 2025

SOTA Math Opensource LLM

Python 332 20 Updated Dec 12, 2023

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,284 2,622 Updated Mar 4, 2025

Project of ALC 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"

Python 9 Updated Mar 25, 2025

The group website repo for CUHK MoE Lab of High Confidence Software Technologies

HTML 1 Updated Mar 14, 2025
1 Updated Oct 1, 2024

[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 105 4 Updated Dec 10, 2024

A Survey on the Honesty of Large Language Models

57 2 Updated Dec 8, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,738 375 Updated May 27, 2025

Project of ACL 2025 MlingConf: A Comprehensive Investigation of Multilingual Confidence Estimation for Large Language Models

Python 2 Updated Jan 22, 2025

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 353 29 Updated Sep 6, 2024

An Easy-to-use, Scalable and High-performance 9110 RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,842 669 Updated May 27, 2025

Vite & Vue powered static site generator.

TypeScript 14,886 2,356 Updated May 26, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 50,907 6,152 Updated May 27, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 25,442 2,588 Updated May 23, 2025

雪之梦技术驿站,snowdreams1006搭建的 Gitbook 个人博客

HTML 91 61 Updated Feb 7, 2025

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

920 78 Updated Sep 22, 2024

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,663 1,347 Updated May 27, 2025
JavaScript 122 5 Updated Sep 10, 2024

本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/

Jupyter Notebook 8,278 937 Updated May 27, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 15,698 1,756 Updated May 8, 2025

This is the repo which record the evolution of LM-based dialogue system. More details can be found in our original survey paper: A Survey of the Evolution of Language Model-Based Dialogue Systems

61 3 Updated Apr 11, 2025

Existing Literature about Machine Unlearning

863 103 Updated Mar 21, 2025
Next
0