8000 jweihe (jweihe) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View jweihe's full-sized avatar
🦙
Focusing
🦙
Focusing
  • Institute of Computing Technology, CAS
  • Beijing

Highlights

  • Pro

Block or report jweihe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Simple RL training for reasoning

Python 3,585 266 Updated Apr 10, 2025

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,973 177 Updated Aug 13, 2024

RLHF中文手册 - 详细解析RLHF全流程优化阶段,涵盖指令调优、奖励模型训练,以及拒绝采样、强化学习和直接对齐算法等关键技术。

TeX 2 Updated May 7, 2025

Quarto template for Chinese academic writing

Python 51 4 Updated May 24, 2025

Train transformer language models with reinforcement learning.

Python 13,890 1,906 Updated May 25, 2025

一款便捷的抢占显卡脚本

Cuda 331 38 Updated Jan 20, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,450 240 Updated May 25, 2025

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

TypeScript 5,624 695 Updated May 7, 2025

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 98,915 14,865 Updated May 25, 2025

LangChain 的中文入门教程

8,161 642 Updated Apr 19, 2025

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,406 578 Updated Oct 24, 2024
126 5 Updated May 8, 2025
Python 45 1 Updated Feb 19, 2024

Awesome list for LLM pruning.

228 9 Updated Dec 15, 2024

Official implementation for LaCo (EMNLP 2024 Findings)

Jupyter Notebook 16 4 Updated Oct 3, 2024

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,986 493 Updated May 18, 2025
Python 133 8 Updated Nov 3, 2023

Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"

Python 54 3 Updated Oct 1, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,998 118 Updated Jun 1, 2023

本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。

Python 87 8 Updated Sep 14, 2024

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 919 57 Updated Mar 25, 2025

A paper list of some recent works about Token Compress for Vit and VLM

486 23 Updated May 25, 2025
36 1 Updated Jul 8, 2024

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 432 39 Updated Feb 1, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,409 729 Updated Aug 5, 2024

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 834 61 Updated Jul 10, 2024

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Markdown 128,007 23,367 Updated Jan 31, 2025

Resources of deep learning for mathematical reasoning (DL4MATH).

357 28 Updated Dec 22, 2023
Next
0