CVPR2025

MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model

Chenjie Cao, Chaohui Yu, Shang Liu, Fan Wang, Xiangyang Xue, Yanwei Fu

摘要

%& '() +,-./01+/2 3"/4 5/6505(0+7"-/ !3# $%& '() 52(01+/2 +,5/)8(."5+(, !9#$ %& "9)(** ")3+5)"): 5")-/5 ",4 )/'/)/,9/ 1+/2* !"# $%$ &%'()" (&+", !"#& '()" (&+", Figure 1. The proposed MVGenMaster handles various NVS scenarios properly as a master, including (a) NVS based on single-view text-to-image conditions, (b) interpolation between two known views, and (c) flexible NVS with variable reference views and arbitrary target views. MVGenMaster enables all tasks above with a single forward process without sophisticated iterative inference and dataset updating.