[ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

University of Illinois at Urbana Champaign

Paper Link: Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

Introduction

We envision generative models an important endeavor to develop unified vision models: all the tasks are modeled as pixel generation. In this paper, we analyze how to bridge the gaps between conventional diffusion models designed for generation and discriminative perception tasks. There are three main perspectives:

Uneven distribution of diffusion steps: How to reflect this in the training process?
Training-denoising distribution shift: How to simulate such drifts for training?
Interactivity: How to leverage classifier-free guidance and make diffusion models as agentic & interactive perception models?

These issues are analyzed as the following figure, where we use "image editing" as a unified interface for referring image segmentation:

Our solutions are simple changes to the by-default strategies in diffusion models, which are shown to be effective in our experiments.

Instructions

Code coming soon.

Citations

If you find this work useful for your research, please consider citing:

@inproceedings{pang2025aligning,
  title={Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception},
  author={Pang, Ziqi and Xu, Xin and Wang, Yu-Xiong},
  booktitle={International Conference on Learning Representations},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

Introduction

Instructions

Citations

About

Uh oh!

Releases

Packages

Uh oh!

License

ziqipang/ADDP

Folders and files

Latest commit

History

Repository files navigation

[ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

Introduction

Instructions

Citations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Packages