azshue

Manli Shu azshue

Research and code

45 followers · 26 following

Salesforce Research
Palo Alto
https://azshue.github.io/

Achievements

x2 x2

Achievements

x2 x2

Organizations

Stars

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 2,117 129 Updated May 9, 2025

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,901 132 Updated Oct 30, 2024

srush / Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,556 324 Updated Jul 15, 2024

RL4VLM / RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 348 28 Updated Dec 15, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 7,864 895 Updated May 12, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,731 1,483 Updated Apr 24, 2025

om-ai-lab / OmAgent

Build multimodal language agents for fast prototype and production

Python 2,478 271 Updated Mar 19, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 2,950 382 Updated May 12, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6,641 646 Updated May 12, 2025

SalesforceAIResearch / TACO

Python 55 3 Updated Feb 23, 2025

JieyuZ2 / ProVision

A instruction data generation system for multimodal language models.

Jupyter Notebook 32 Updated Jan 31, 2025

tomaarsen / attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 698 41 Updated Apr 10, 2024

salesforce / Hierarchical_Point_Attention

Python 8 1 Updated Jan 27, 2025

mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

811 19 Updated Jul 31, 2024

zzxslp / SoM-LLaVA

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Python 142 3 Updated Aug 23, 2024

YuxinWenRick / diffusion_memorization

Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)

Python 72 8 Updated Apr 3, 2024

JonasGeiping / carving

Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives

Python 68 5 Updated Feb 22, 2024

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 3,910 303 Updated Aug 31, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,533 1,026 Updated Nov 18, 2024

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,532 341 Updated May 7, 2025

openai / consistencydecoder

Consistency Distilled Diff VAE

Python 2,187 76 Updated Nov 7, 2023

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 41,170 6,818 Updated Dec 9, 2024

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 12,322 2,754 Updated May 10, 2025

neelsjain / NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Python 395 19 Updated May 17, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,288 4,360 Updated May 10, 2025

ermongroup / ddim

Denoising Diffusion Implicit Models

Python 1,626 215 Updated Jul 26, 2024

openai / glide-text2im

GLIDE: a diffusion-based text-conditional image synthesis model

Python 3,621 507 Updated Mar 8, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,545 4,702 Updated Apr 12, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 8,900 2,367 Updated May 10, 2025

azshue / AutoPoison

The official repository of the paper "On the Exploitability of Instruction Tuning".

Python 62 7 Updated Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Manli Shu azshue

Achievements

Achievements

Organizations

Block or report azshue

Stars

zhaochenyang20 / Awesome-ML-SYS-Tutorial

cambrian-mllm / cambrian

srush / Tensor-Puzzles

RL4VLM / RL4VLM

volcengine / verl

Jiayi-Pan / TinyZero

om-ai-lab / OmAgent

allenai / open-instruct

OpenRLHF / OpenRLHF

SalesforceAIResearch / TACO

JieyuZ2 / ProVision

tomaarsen / attention_sinks

salesforce / Hierarchical_Point_Attention

mlfoundations / MINT-1T

zzxslp / SoM-LLaVA

YuxinWenRick / diffusion_memorization

JonasGeiping / carving

mlfoundations / open_flamingo

salesforce / LAVIS

InternLM / xtuner

openai / consistencydecoder

karpathy / nanoGPT

NVIDIA / Megatron-LM

neelsjain / NEFTune

deepspeedai / DeepSpeed

ermongroup / ddim

openai / glide-text2im

lm-sys / FastChat

EleutherAI / lm-evaluation-harness

azshue / AutoPoison