Unified Safe In-context Image Generation in Multimodal Diffusion Transformers via Restricting Unsafe Information Flows

2026-06-05 · Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy · Depth: Expert, quick

Summary

Unified Visual Safety Regulator (UVR) is a novel, training-free framework designed to enhance safety in multimodal Diffusion Transformers (DiTs) for image generation, particularly addressing limitations of existing mechanisms in image-to-image (I2I) editing tasks. Current safety solutions are often tailored for text-to-image (T2I) synthesis or U-Net architectures, proving ineffective for DiT-based systems. UVR operates by analyzing attention dynamics and information flow within multimodal attention (MM-Attn), identifying a critical "task-independent start-up stage" where unsafe semantics first appear. It then applies unified, targeted attention modulation and explicitly restricts harmful information flow over these identified unsafe output patches. Experimental results demonstrate UVR's state-of-the-art safety performance, achieving 91% and 77% erase rates in image synthesis and editing tasks, respectively, while maintaining visual quality.

Key takeaway

For Machine Learning Engineers developing multimodal Diffusion Transformers, UVR offers a training-free solution to mitigate unsafe content generation, especially in image-to-image editing. You should consider integrating UVR to achieve high erase rates (91% in synthesis, 77% in editing) without significant visual quality degradation. This approach provides a unified safety mechanism where existing T2I or U-Net-based methods fall short.

Key insights

UVR restricts unsafe information flow in DiT multimodal attention during an early "start-up stage" to prevent harmful image generation.

Principles

Unsafe semantics emerge early in DiT attention dynamics.
Harmful signals propagate and entangle with benign content.
Targeted attention modulation can restrict unsafe flow.

Method

UVR analyzes MM-Attn information flow to identify unsafe output patches during a "start-up stage," then applies unified, targeted attention modulation to explicitly restrict harmful information flow.

In practice

Apply UVR to DiT-based image synthesis.
Use UVR for safer I2I editing tasks.
Integrate UVR into existing DiT frameworks.

Topics

Diffusion Transformers
Multimodal Attention
Image Generation Safety
Image-to-Image Editing
Unsafe Content Mitigation
Attention Modulation

Code references

deng12yx/UVR

Best for: Computer Vision Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Security Engineer

Related on AIssential

See Counsel's argued verdicts on the open AI decisions leaders are weighing →

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.