DAAM with mu #40

andreemic · 2023-05-20T08:42:09Z

Hey! Great job on this repo! Very clean documentation and a useful idea.

daemon · 2023-05-20T23:02:20Z

Hey, thanks. I may be wrong as I'm not too familiar with the InstructPix2Pix architecture, but I think focusing on the cross-attention heads between the key text embeddings and the usual latent embeddings could work. If the attention key vectors are instead a concatenation of text embeddings and, say, image embeddings, then you could look at cross attention restricted to the text dimensions/area. If the text and image embeddings are unseparable (e.g., multimodal fusion), then that would likely be outside of the scope of DAAM/cross-attention and require a separate set of techniques.

nityanandmathur · 2024-03-25T12:26:09Z

@andreemic Please let me know if you were able to generate cross-attention maps for IP2P or ControlNet.

I am trying to visualize cross-attention maps for Stable Diffusion image-to-image pipeline and facing same errors.

nityanandmathur · 2024-04-02T16:17:41Z

@daemon Opened a pull request which fixes this. Please have a look.

#60

nityanandmathur added a commit to nityanandmathur/daam that referenced this issue Apr 1, 2024

Fixes castorini#40

1deaae8

andreemic changed the title ~~DAAM with multi-conditioned SD models (ControlNet, IP2P, etc.)~~ DAAM with mu Apr 2, 2024

daemon closed this as completed in c30493e Apr 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DAAM with mu #40

DAAM with mu #40

Uh oh!

Uh oh!

Uh oh!

DAAM with mu #40

DAAM with mu #40

Comments

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!