CVPR2025

MET3R: Measuring Multi-View Consistency in Generated Images

Mohammad Asim, Christopher Wewer, Thomas Wimmer, Bernt Schiele, Jan Eric Lenssen

Abstract

Figure 1 . We introduce MEt3R, a metric for multi-view consistency between pairs of generated images, which is independent of image quality and content and does not require camera poses. Left: generated images from different generative models, conditioned on the first frame, with MEt3R score map (Cons. Error) indicating levels of inconsistencies between consecutive images i and i + 1. Right: pair-wise consistency scores, evaluated for consecutive frames in a sliding window, averaged over multiple sequences. The pattern in MV-LDM's consistency clearly shows artifacts from using anchor frames that are generated first, highlighting the high signal-to-noise ratio of MEt3R.