ICML2025
UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation
Qin Guo, Ailing Zeng, Dongxu Yue, Ceyuan Yang, Yang Cao, Hanzhong Guo, Fei Shen, Wei Liu, Xihui Liu, Dan Xu
Abstract
UNIMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation ness of UNIMC, particularly in heavy occlusions and multi-class scenarios. Project page can be found at this link. (A) Raw image (B) Most condition form (C) Our condition form Class name: … Keypoint: … Bounding box: … Keypoint binding confusion Class binding confusion (A) Raw image (B) Most condition form (C) Our condition form Keypoint binding confusion Class binding confusion ☹ 🤗