CVPR2025

SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models

Jaerin Lee, Daniel Sungho Jung, Kanggeon Lee, Kyoung Mu Lee

摘要

Figure 1. Overview. Our SEMANTICDRAW is a sub-second (0.64 seconds) solution for region-based text-to-image generation. This streaming architecture enables an interactive application framework, dubbed semantic palette, where image is generated in near instant interactivity based on online user commands of hand-drawn semantic masks.