CVPR2024
GenTron: Diffusion Transformers for Image and Video Generation
Shoufa Chen, Mengmeng Xu, Jiawei Ren, Yuren Cong, Sen He, Yanping Xie, Animesh Sinha, Ping Luo, Tao Xiang, Juan-Manuel Pérez-Rúa
摘要
Figure 1. GenTron: Transformer based diffusion model for high-quality text-to-image/video generation.