CVPR2025

ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts

Dmitry Petrov, Pradyumn Goyal, Divyansh Shivashok, Yuanming Tao, Melinos Averkiou, Evangelos Kalogerakis

Abstract

4 TU Crete Figure 1 . ShapeWords enables 3D shape-aware text-to-image generation via mapping of shape geometries into CLIP space. Given an input 3D shape and text prompts describing desired appearance and context, our method generates images that maintain both shape fidelity and text compliance. Unlike existing methods that use view-dependent guidance like depth maps, ShapeWords generalizes to compositional settings (top row) and allows for exploration of target geometries with stylistic deviations aligned with the prompt (bottom row).