CVPR2023

Neural Congealing: Aligning Images to a Joint Semantic Atlas

Dolev Ofri-Amar, Michal Geyer, Yoni Kasten, Tali Dekel

摘要

Figure 1 . Given a set of input images, our method automatically detects and jointly aligns semantically-common content across the images. This is achieved through a test-time training approach that estimates a unified 2D atlas that represents the common semantic content, and dense mappings from the joint atlas to each of the input images. Our atlas and mappings are optimized per input set in a self-supervised manner by leveraging a pre-trained DINO-ViT model. Our method can be applied to diverse image sets, without requiring any additional training data, and allows us to automatically propagate an edit applied to a single image across the entire set.