High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation (텍스트 프롬프트와 픽셀 수준 블렌딩을 활용한 배경 보존형 고충실도 얼굴 변환)

Moonsung Kang (강문성); Jihoon Lee (이지훈); Seungwon Jang (장승원); Suin Kim (김수인); Doheun Cha (차도흔); Sangtae Ahn (안상태)

@article{ART003349280},
author={Moonsung Kang and Jihoon Lee and Seungwon Jang and Suin Kim and Doheun Cha and Sangtae Ahn},
title={High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation},
journal={Journal of The Korea Society of Computer and Information},
issn={1598-849X},
year={2026},
volume={31},
number={6},
pages={67-80}

TY - JOUR
AU - Moonsung Kang
AU - Jihoon Lee
AU - Seungwon Jang
AU - Suin Kim
AU - Doheun Cha
AU - Sangtae Ahn
TI - High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation
JO - Journal of The Korea Society of Computer and Information
PY - 2026
VL - 31
IS - 6
PB - The Korean Society Of Computer And Information
SP - 67
EP - 80
SN - 1598-849X
AB - In this paper, we propose a novel pipeline that integrates mask-weighted loss and a face-aware text adapter to address the unrealistic painterly textures and unstable text guidance inherent in existing latent diffusion models during high-resolution face swapping. To restore fine facial details and photorealistic textures, we first employ a fine-tuning strategy for the U-Net using a mask-weighted loss. While this optimization enhances visual fidelity, it often leads to a degradation of semantic information or unintended background distortions. To mitigate these issues, we introduce a face-aware text adapter that dynamically calibrates the intensity of text embeddings based on the spatial proportions of the facial region, ensuring robust semantic control. Furthermore, to circumvent the inherent background information loss caused by the variational autoencoder reconstruction process, we implement a pixel-level blending strategy that directly integrates the generated face with the original background in the pixel space. Experimental results demonstrate that our proposed model significantly outperforms baseline methods across key metrics, including FID, PSNR, LPIPS, and PickScore, successfully achieving both high-quality, prompt-driven face synthesis and perfect background preservation.
KW - Face swap;Latent diffusion model;Mask-weighted Loss;Face segmentation;Background preservation
DO -
UR -
ER -

Moonsung Kang, Jihoon Lee, Seungwon Jang, Suin Kim, Doheun Cha and Sangtae Ahn. (2026). High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation. Journal of The Korea Society of Computer and Information, 31(6), 67-80.

Moonsung Kang, Jihoon Lee, Seungwon Jang, Suin Kim, Doheun Cha and Sangtae Ahn. 2026, "High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation", Journal of The Korea Society of Computer and Information, vol.31, no.6 pp.67-80.

Moonsung Kang, Jihoon Lee, Seungwon Jang, Suin Kim, Doheun Cha, Sangtae Ahn "High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation" Journal of The Korea Society of Computer and Information 31.6 pp.67-80 (2026) : 67.

Moonsung Kang, Jihoon Lee, Seungwon Jang, Suin Kim, Doheun Cha, Sangtae Ahn. High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation. 2026; 31(6), 67-80.

Moonsung Kang, Jihoon Lee, Seungwon Jang, Suin Kim, Doheun Cha and Sangtae Ahn. "High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation" Journal of The Korea Society of Computer and Information 31, no.6 (2026) : 67-80.

Moonsung Kang; Jihoon Lee; Seungwon Jang; Suin Kim; Doheun Cha; Sangtae Ahn. High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation. Journal of The Korea Society of Computer and Information, 31(6), 67-80.

Moonsung Kang; Jihoon Lee; Seungwon Jang; Suin Kim; Doheun Cha; Sangtae Ahn. High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation. Journal of The Korea Society of Computer and Information. 2026; 31(6) 67-80.

Moonsung Kang, Jihoon Lee, Seungwon Jang, Suin Kim, Doheun Cha, Sangtae Ahn. High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation. 2026; 31(6), 67-80.

Moonsung Kang, Jihoon Lee, Seungwon Jang, Suin Kim, Doheun Cha and Sangtae Ahn. "High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation" Journal of The Korea Society of Computer and Information 31, no.6 (2026) : 67-80.

KJCKorea
Journal Central

Journal of The Korea Society of Computer and Information 2024 KCI Impact Factor : 0.81

High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation

ABSTRACT

KEYWORDS

Citation status

* References for papers published after 2024 are currently being built.

Journal of The Korea Society of Computer and Information 2024 KCI Impact Factor : 0.81

High-Fidelity Face Swap via Prompt-Driven Inpainting and Pixel-Level Background Preservation

ABSTRACT

KEYWORDS

Statistics

Tools

Issue List

Citation status

KCI Citation Counts (0)

REFERENCES (0) * References for papers published after 2024 are currently being built.

Search PDF

Citation

* References for papers published after 2024 are currently being built.