qinb

qinb

8 followers · 5 following

LIMR Public
Forked from GAIR-NLP/LIMR

Python Updated Feb 20, 2025
unlock-deepseek Public
Forked from datawhalechina/unlock-deepseek

DeepSeek 系列工作解读、扩展和复现。

Python Updated Feb 15, 2025
verifiers Public
Forked from willccbb/verifiers

Verifiers for LLM Reinforcement Learning

Python Updated Feb 15, 2025
PRIME Public
Forked from PRIME-RL/PRIME

Scalable RL solution for the advanced reasoning of language models

Python Apache License 2.0 Updated Feb 14, 2025
MM-self-improve-qwen2vl Public
Forked from Liac-li/MM-self-improve-qwen2vl

Python Apache License 2.0 Updated Feb 14, 2025
openr Public
Forked from openreasoner/openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python MIT License Updated Feb 14, 2025
simpleRL-reason Public
Forked from hkust-nlp/simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python MIT License Updated Feb 7, 2025
OpenRLHF Public
Forked from OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python Apache License 2.0 Updated Jan 20, 2025
trl Public
Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python Apache License 2.0 Updated Nov 14, 2024
LLM-Dojo Public
Forked from mst272/LLM-Dojo

欢迎来到 LLM-Dojo，这里是一个开源大模型学习场所，使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python Updated Nov 8, 2024
LLaVA-KD Public
Forked from Fantasyele/LLaVA-KD

Python Updated Oct 25, 2024
LLaVA-MoD Public
Forked from shufangxun/LLaVA-MoD

Making LLaVA Tiny via MoE-Knowledge Distillation

Python Apache License 2.0 Updated Oct 24, 2024
Self-Correcting-LLM--Reinforcement-Learning- Public
Forked from sanowl/Self-Correcting-LLM--Reinforcement-Learning-

This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by google

Python Updated Oct 16, 2024
TinyLLaVA_Factory Public
Forked from TinyLLaVA/TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Python Apache License 2.0 Updated Oct 16, 2024
MathCoder2 Public
Forked from mathllm/MathCoder2

Python Updated Oct 16, 2024
SuperCorrect-llm Public
Forked from YangLing0818/SuperCorrect-llm

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Python Updated Oct 14, 2024
g1 Public
Forked from bklieger-groq/g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python MIT License Updated Oct 7, 2024
Google_SCoRe Public
Forked from daje0601/Google_SCoRe

Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)

Jupyter Notebook Apache License 2.0 Updated Sep 21, 2024
GOT-OCR2.0 Public
Forked from Ucas-HaoranWei/GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python Updated Sep 19, 2024
FineZip Public
Forked from fazalmittu/FineZip

Python Updated Sep 9, 2024
DynamicPose Public
Forked from dynamic-X-LAB/DynamicPose

DynamicPose, a simple and robust framework for animating human images.

Python Updated Aug 29, 2024
PhotoPoster Public
Forked from dynamic-X-LAB/PhotoPoster

To support and further the research in the field of portrait animation , we are excited to launch PhotoPoster, an open project for pose-driven image generation.

Python Updated Aug 26, 2024
facefusion Public
Forked from facefusion/facefusion

Next generation face swapper and enhancer

Python Other Updated Aug 12, 2024
florence2-finetuning Public
Forked from andimarafioti/florence2-finetuning

Quick exploration into fine tuning florence 2

Jupyter Notebook MIT License Updated Jul 31, 2024
Moore-AnimateAnyone Public
Forked from MooreThreads/Moore-AnimateAnyone

Python Apache License 2.0 Updated Jul 31, 2024
AniTalker Public
Forked from X-LANCE/AniTalker

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"

Python Apache License 2.0 Updated Jul 30, 2024
LayTextLLM Public
Forked from LayTextLLM/LayTextLLM

Python Updated Jul 24, 2024
UniAnimate Public
Forked from ali-vilab/UniAnimate

Code for Paper "UniAnimate: Taming Unified Video Diﬀusion Models for Consistent Human Image Animation".

Python Updated Jul 23, 2024
EchoMimic Public
Forked from antgroup/echomimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python Apache License 2.0 Updated Jul 9, 2024
Math-LLaVA Public
Forked from HZQ950419/Math-LLaVA

Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models

Python Apache License 2.0 Updated Jun 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qinb

Block or report qinb

LIMR Public

unlock-deepseek Public

verifiers Public

PRIME Public

MM-self-improve-qwen2vl Public

openr Public

simpleRL-reason Public

OpenRLHF Public

trl Public

LLM-Dojo Public

LLaVA-KD Public

LLaVA-MoD Public

Self-Correcting-LLM--Reinforcement-Learning- Public

TinyLLaVA_Factory Public

MathCoder2 Public

SuperCorrect-llm Public

g1 Public

Google_SCoRe Public

GOT-OCR2.0 Public

FineZip Public

DynamicPose Public

PhotoPoster Public

facefusion Public

florence2-finetuning Public

Moore-AnimateAnyone Public

AniTalker Public

LayTextLLM Public

UniAnimate Public

EchoMimic Public

Math-LLaVA Public