8000 azshue (Manli Shu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View azshue's full-sized avatar

Organizations

@judy-vscode

Block or report azshue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

My learning notes/codes for ML SYS.

Python 2,117 129 Updated May 9, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,901 132 Updated Oct 30, 2024

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,556 324 Updated Jul 15, 2024

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 348 28 Updated Dec 15, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 7,864 895 Updated May 12, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,731 1,483 Updated Apr 24, 2025

Build multimodal language agents for fast prototype and production

Python 2,478 271 Updated Mar 19, 2025

AllenAI's post-training codebase

Python 2,950 382 Updated May 12, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6,641 646 Updated May 12, 2025
Python 55 3 Updated Feb 23, 2025

A instruction data generation system for multimodal language models.

Jupyter Notebook 32 Updated Jan 31, 2025

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 698 41 Updated Apr 10, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

811 19 Updated Jul 31, 2024

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Python 142 3 Updated Aug 23, 2024

Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)

Python 72 8 Updated Apr 3, 2024

Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives

Python 68 5 Updated Feb 22, 2024

An open-source framework for training large multimodal models.

Python 3,910 303 Updated Aug 31, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,533 1,026 Updated Nov 18, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,532 341 Updated May 7, 2025

Consistency Distilled Diff VAE

Python 2,187 76 Updated Nov 7, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 41,170 6,818 Updated Dec 9, 2024

Ongoing research training transformer models at scale

Python 12,322 2,754 Updated May 10, 2025

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Python 395 19 Updated May 17, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,288 4,360 Updated May 10, 2025

Denoising Diffusion Implicit Models

Python 1,626 215 Updated Jul 26, 2024

GLIDE: a diffusion-based text-conditional image synthesis model

Python 3,621 507 Updated Mar 8, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,545 4,702 Updated Apr 12, 2025

A framework for few-shot evaluation of language models.

Python 8,900 2,367 Updated May 10, 2025

The official repository of the paper "On the Exploitability of Instruction Tuning".

Python 62 7 Updated Feb 5, 2024
Next
0