Optimization-Free Image Immunization Against Diffusion-Based Editing

Tarik Can Ozden^1,2,3*† Ozgur Kara^1* Oguzhan Akcin² Kerem Zaman⁴
Shashank Srivastava⁴ Sandeep P. Chinchali² James M. Rehg¹

¹University of Illinois Urbana-Champaign ²The University of Texas at Austin
³Bogazici University ⁴University of North Carolina at Chapel Hill
^*Equal contribution
^†Work done during an internship at UT Austin and UIUC

arXiv Code (Soon)

Demo (Soon)

DiffVax is an optimization-free image immunization approach designed to protect images and videos from diffusion-based editing. DiffVax demonstrates robustness across diverse content, providing protection for both in-the-wild (a) unseen images and (b) unseen video content while effectively preventing edits across various editing methods, including inpainting (illustrated with a human in the left column and a non-human foreground object in the right column) and instruction-based edits (right column).

Method

Our process begins with the immunizer model $f(\cdot;\theta)$ which generates imperceptible noise $\epsilon_{im}$ to be applied to original image $I$. This noise is applied to the masked region $M$ of the image, resulting in immunized image $I_{im}$. The immunized image is then processed by a diffusion-based editing model $SD(\cdot)$ using a text prompt $P$ and the complementary mask $\sim M$ to edit the background of the original image. The training aims to minimize two loss terms $\mathcal{L}_{noise}$ and $\mathcal{L}_{edit}$, which penalizes the applied noise magnitude, and if the edit is successful, respectively. During training, the immunizer learns to generalize across diverse images, ensuring editing attempts fail while preserving visual fidelity. This end-to-end framework enables robust, scalable immunization against diffusion-based editing for both images and videos.

DiffVax Immunization Results

Prompt

Original Image

Edited Image

Edited Immunized Image

"in a prison"

"Geoffrey Hinton at a political protest"

"an eagle sitting on a table in a library"

"add sunglasses"

"standing in an abandoned carnival"

"watching a theater performance"

"in front of a hotdog stand"

"under a turbulent sky with lightning"

"in a garage"

"in a church with wooden pews"

Original Video

Edited Video

Immunized Edited Video

"in snowstorm"

Comparisons

Prompt

Original Image

Edited Image

Random Noise
Edited Image

PhotoGuard-E ([1])
Edited Image

PhotoGuard-D ([1])
Edited Image

DiffVax (Ours)
Edited Image

"standing in a warehouse with a lot of shelves"

"in a betting shop"

"working out in a gymnasium"

Robustness Against Counter Attacks

Original Image

Edited PhotoGuard-D

Edited Attacked PhotoGuard-D

Edited DiffVax (Ours)

Edited Attacked DiffVax

Denoiser Attack
Prompt: "a person in a cinema"

JPEG Compression
Prompt: "in a health and wellness center"

[1] Hadi Salman, Alaa Khaddaj, Guillaume Leclerc, Andrew Ilyas, and Aleksander Madry. Raising the Cost of Malicious AI-Powered Image Editing. In International Conference on Machine Learning (ICML), pages 29894--29918, 2023.

Optimization-Free Image Immunization Against Diffusion-Based Editing

Abstract

Method

DiffVax Immunization Results

Comparisons

Robustness Against Counter Attacks

BibTeX