8000 jweihe (jweihe) / Starred · GitHub

More Web Proxy on the site http://driver.im/

jweihe

Follow

🦙

Focusing

jweihe jweihe

🦙

Focusing

Follow

13 followers · 54 following

Institute of Computing Technology, CAS
Beijing

Achievements

Achievements

Highlights

Pro

8000

Starred repositories

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,585 266 Updated Apr 10, 2025

LC1332 / Chat-Haruhi-Suzumiya

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,973 177 Updated Aug 13, 2024

jweihe / RLHF-book-Chinese

RLHF中文手册 - 详细解析RLHF全流程优化阶段，涵盖指令调优、奖励模型训练，以及拒绝采样、强化学习和直接对齐算法等关键技术。

TeX 2 Updated May 7, 2025

TomBener / quarto-cn-tools

Quarto template for Chinese academic writing

Python 51 4 Updated May 24, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 13,890 1,906 Updated May 25, 2025

godweiyang / GrabGPU

一款便捷的抢占显卡脚本

Cuda 331 38 Updated Jan 20, 2025

modelscope / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,450 240 Updated May 25, 2025

nickscamara / open-deep-research

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

TypeScript 5,624 695 Updated May 7, 2025

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 98,915 14,865 Updated May 25, 2025

liaokongVFX / LangChain-Chinese-Getting-Started-Guide

LangChain 的中文入门教程

8,161 642 Updated Apr 19, 2025

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,406 578 Updated Oct 24, 2024

HCIILAB / M6Doc

126 5 Updated May 8, 2025

zankner / Hydra

Python 45 1 Updated Feb 19, 2024

pprp / Awesome-LLM-Prune

Awesome list for LLM pruning.

228 9 Updated Dec 15, 2024

yangyifei729 / LaCo

Official implementation for LaCo (EMNLP 2024 Findings)

Jupyter Notebook 16 4 Updated Oct 3, 2024

datawhalechina / agent-tutorial

294 32 Updated Mar 19, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,986 493 Updated May 18, 2025

lz1oceani / verify_cot

Python 133 8 Updated Nov 3, 2023

ytyz1307zzh / RefAug

Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"

Python 54 3 Updated Oct 1, 2024

openai / prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,998 118 Updated Jun 1, 2023

percent4 / llm_math_solver

本项目用于大模型数学解题能力方面的数据集合成，模型训练及评测，相关文章记录。

Python 87 8 Updated Sep 14, 2024

AIDC-AI / Ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 919 57 Updated Mar 25, 2025

google-deepmind / alphageometry

Python 4,499 516 Updated Oct 25, 2024

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

486 23 Updated May 25, 2025

yaolinli / DeCo

36 1 Updated Jul 8, 2024

meta-math / MetaMath

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 432 39 Updated Feb 1, 2024

nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,409 729 Updated Aug 5, 2024

mbzuai-oryx / LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 834 61 Updated Jul 10, 2024

labuladong / fucking-algorithm

刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.

Markdown 128,007 23,367 Updated Jan 31, 2025

lupantech / dl4math

Resources of deep learning for mathematical reasoning (DL4MATH).

357 28 Updated Dec 22, 2023

Starred topics

Python

0