CVPR2025
Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control
Basim Azam, Naveed Akhtar
Abstract
Figure 1 . Our unique plug-and-play interpretable approach simultaneously controls a range of concepts for responsible and fair image generation with text-to-image pipelines. Our method enables control over both text embedding space and latent diffusion space. Shown examples compare unfair and unsafe generation for the given prompts by the Stable Diffusion (top), along with their responsible counterparts resulting from our approach influencing the text encoder (middle) and the diffusion model (bottom). We control individual concepts for diverse/safe generation in these examples while modeling them in continuous composite responsible semantic spaces.