CVPR2025

Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control

Basim Azam, Naveed Akhtar

Abstract

Figure 1 . Our unique plug-and-play interpretable approach simultaneously controls a range of concepts for responsible and fair image generation with text-to-image pipelines. Our method enables control over both text embedding space and latent diffusion space. Shown examples compare unfair and unsafe generation for the given prompts by the Stable Diffusion (top), along with their responsible counterparts resulting from our approach influencing the text encoder (middle) and the diffusion model (bottom). We control individual concepts for diverse/safe generation in these examples while modeling them in continuous composite responsible semantic spaces.