duchao0726

Chao Du duchao0726

Machine Learning

56 followers · 12 following

Sea AI Lab
Singapore
https://duchao0726.github.io/

Achievements

Stars

sail-sg / VeriFree

Reinforcing General Reasoning without Verifiers

Python 53 4 Updated May 30, 2025

sail-sg / understand-r1-zero

Understanding R1-Zero-Like Training: A Critical Perspective

Python 966 45 Updated May 24, 2025

haonan3 / V1

V1: Toward Multimodal Reasoning by Designing Auxiliary Task

Python 34 1 Updated Apr 14, 2025

sail-sg / oat-zero

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 239 10 Updated Apr 15, 2025

haonan3 / AnchorContext

AnchorAttention: Improved attention for LLMs long-context training

Python 208 6 Updated Jan 15, 2025

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 372 27 Updated Jun 3, 2025

ML-GSAI / SMDM

Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"

Python 202 12 Updated Dec 22, 2024

sail-sg / closer-look-LLM-unlearning

[ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models

Python 29 5 Updated Dec 4, 2024

sail-sg / Attention-Sink

[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)

Python 85 3 Updated Oct 17, 2024

sail-sg / SimLayerKV

The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.

Python 46 Updated Oct 18, 2024

sail-sg / P-DoS

[ArXiv 2024] Denial-of-Service Poisoning Attacks on Large Language Models

Python 18 3 Updated Oct 22, 2024

sail-sg / Meta-Unlearning

Python 27 1 Updated Apr 22, 2025

sail-sg / Cheating-LLM-Benchmarks

[ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)

Jupyter Notebook 78 Updated Oct 23, 2024

sail-sg / CPO

[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.

Python 123 7 Updated Mar 21, 2025

sail-sg / dice

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

Python 44 3 Updated Apr 15, 2025

jiaxiaojunQAQ / I-GCG

Improved techniques for optimization-based jailbreaking on large language models (ICLR2025)

Python 102 7 Updated Apr 7, 2025

sail-sg / I-FSJ

Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)

Jupyter Notebook 61 9 Updated Jan 11, 2025

sail-sg / Agent-Smith

[ICML 2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Python 102 13 Updated Mar 26, 2024

sail-sg / GDPO

Graph Diffusion Policy Optimization

Python 36 4 Updated Mar 17, 2024

sail-sg / MMCBench

Python 27 Updated Jan 23, 2024

sail-sg / finetune-fair-diffusion

Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairness

Python 43 3 Updated Apr 26, 2024

sail-sg / D-TRAK

Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)

Jupyter Notebook 30 3 Updated Jan 23, 2024

Guohanzhong / GMS

Python 21 1 Updated May 7, 2024

sail-sg / DiffMemorize

[TMLR 2025] On Memorization in Diffusion Models

Python 26 1 Updated Oct 5, 2023

sail-sg / lorahub

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 637 38 Updated Jul 22, 2024

thudzj / Calibrated-DPMs

Official code for "On Calibrating Diffusion Probabilistic Models"

Python 29 1 Updated Feb 22, 2023

google-research / lm-extraction-benchmark

Python 290 20 Updated Apr 29, 2025

LuChengTHU / dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)

Python 1,707 129 Updated Feb 6, 2024

google / flax

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,596 706 Updated Jun 5, 2025

google-deepmind / dm-haiku

JAX-based neural network library

Python 3,042 246 Updated May 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chao Du duchao0726

Achievements

Achievements

Block or report duchao0726

Stars

sail-sg / VeriFree

sail-sg / understand-r1-zero

haonan3 / V1

sail-sg / oat-zero

haonan3 / AnchorContext

sail-sg / oat

ML-GSAI / SMDM

sail-sg / closer-look-LLM-unlearning

sail-sg / Attention-Sink

sail-sg / SimLayerKV

sail-sg / P-DoS

sail-sg / Meta-Unlearning

sail-sg / Cheating-LLM-Benchmarks

sail-sg / CPO

sail-sg / dice

jiaxiaojunQAQ / I-GCG

sail-sg / I-FSJ

sail-sg / Agent-Smith

sail-sg / GDPO

sail-sg / MMCBench

sail-sg / finetune-fair-diffusion

sail-sg / D-TRAK

Guohanzhong / GMS

sail-sg / DiffMemorize

sail-sg / lorahub

thudzj / Calibrated-DPMs

google-research / lm-extraction-benchmark

LuChengTHU / dpm-solver

google / flax

google-deepmind / dm-haiku