CVPR2021
Convolutional Hough Matching Networks
Juhong Min, Minsu Cho
Abstract
Despite advances in feature representation, leveraging geometric relations is crucial for establishing reliable visual correspondences under large variations of images. In this work we introduce a Hough transform perspective on convolutional matching and propose an effective geometric matching algorithm, dubbed Convolutional Hough Matching (CHM). The method distributes similarities of candidate matches over a geometric transformation space and evaluate them in a convolutional manner. We cast it into a trainable neural layer with a semi-isotropic high-dimensional kernel, which learns non-rigid matching with a small number of interpretable parameters. To validate the effect, we develop the neural network with CHM layers that perform convolutional matching in the space of translation and scaling. Our method sets a new state of the art on standard benchmarks for semantic visual correspondence, proving its strong robustness to challenging intra-class variations. Predicted keypoints on ๐ผโฒ แ ๐ค๐ โฒ ๐=1 ๐ 6D correlation ๐ (1) โ โ ๐ปร๐ร๐ร๐ปร๐ร๐ High-dimensional correlation computation Convolutional Hough Matching Keypoint transfer & Loss Conv (๐ 3 ) Conv (๐ 2 ) Conv (๐ 1 ) Copy & interpolate ๐ผ Copy & interpolate ๐ผโฒ Conv (๐ 3 ) Conv (๐ 2 ) Conv (๐ 1 ) 0 (๐ psi 6D ) CHM 0 (๐ psi 4D ) CHM Non-linearity + Maxpool + Upsample Predicted correlation ๐ โ โ เดฅ ๐ปร เดฅ ๐ร เดฅ ๐ปร เดฅ ๐ Flow formation Keypoint transfer Keypoints on ๐ผ ๐ค๐ ๐=1 ๐ Keypoints on ๐ผโฒ ๐ค๐ โฒ ๐=1 ๐ Training objective โ ๐ ๐ โฒ ๐ =1 ๐ ๐ ๐ ๐ =1 ๐ 0 Scale-space maxpool Upsample 4D Sigmoid