Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.06458 (cs)

[Submitted on 13 Dec 2022 (v1), last revised 3 Aug 2023 (this version, v3)]

Title:HS-Diffusion: Semantic-Mixing Diffusion for Head Swapping

Authors:Qinghe Wang, Lijie Liu, Miao Hua, Pengfei Zhu, Wangmeng Zuo, Qinghua Hu, Huchuan Lu, Bing Cao

View PDF

Abstract:Image-based head swapping task aims to stitch a source head to another source body flawlessly. This seldom-studied task faces two major challenges: 1) Preserving the head and body from various sources while generating a seamless transition region. 2) No paired head swapping dataset and benchmark so far. In this paper, we propose a semantic-mixing diffusion model for head swapping (HS-Diffusion) which consists of a latent diffusion model (LDM) and a semantic layout generator. We blend the semantic layouts of source head and source body, and then inpaint the transition region by the semantic layout generator, achieving a coarse-grained head swapping. Semantic-mixing LDM can further implement a fine-grained head swapping with the inpainted layout as condition by a progressive fusion process, while preserving head and body with high-quality reconstruction. To this end, we propose a semantic calibration strategy for natural inpainting and a neck alignment for geometric realism. Importantly, we construct a new image-based head swapping benchmark and design two tailor-designed metrics (Mask-FID and Focal-FID). Extensive experiments demonstrate the superiority of our framework. The code will be available: this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2212.06458 [cs.CV]
	(or arXiv:2212.06458v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.06458

Submission history

From: Qinghe Wang [view email]
[v1] Tue, 13 Dec 2022 10:04:01 UTC (15,133 KB)
[v2] Thu, 30 Mar 2023 11:38:34 UTC (12,671 KB)
[v3] Thu, 3 Aug 2023 07:32:30 UTC (12,392 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:HS-Diffusion: Semantic-Mixing Diffusion for Head Swapping

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:HS-Diffusion: Semantic-Mixing Diffusion for Head Swapping

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators