CVPR2021

Convolutional Hough Matching Networks

Juhong Min, Minsu Cho

摘要

Despite advances in feature representation, leveraging geometric relations is crucial for establishing reliable visual correspondences under large variations of images. In this work we introduce a Hough transform perspective on convolutional matching and propose an effective geometric matching algorithm, dubbed Convolutional Hough Matching (CHM). The method distributes similarities of candidate matches over a geometric transformation space and evaluate them in a convolutional manner. We cast it into a trainable neural layer with a semi-isotropic high-dimensional kernel, which learns non-rigid matching with a small number of interpretable parameters. To validate the effect, we develop the neural network with CHM layers that perform convolutional matching in the space of translation and scaling. Our method sets a new state of the art on standard benchmarks for semantic visual correspondence, proving its strong robustness to challenging intra-class variations. Predicted keypoints on 𝐼′ መ 𝐤𝑚 ′ 𝑚=1 𝑀 6D correlation 𝐂 (1) ∈ ℝ 𝐻×𝑊×𝑆×𝐻×𝑊×𝑆 High-dimensional correlation computation Convolutional Hough Matching Keypoint transfer & Loss Conv (𝜃 3 ) Conv (𝜃 2 ) Conv (𝜃 1 ) Copy & interpolate 𝐼 Copy & interpolate 𝐼′ Conv (𝜃 3 ) Conv (𝜃 2 ) Conv (𝜃 1 ) 0 (𝑘 psi 6D ) CHM 0 (𝑘 psi 4D ) CHM Non-linearity + Maxpool + Upsample Predicted correlation 𝐂 ∈ ℝ ഥ 𝐻× ഥ 𝑊× ഥ 𝐻× ഥ 𝑊 Flow formation Keypoint transfer Keypoints on 𝐼 𝐤𝑚 𝑚=1 𝑀 Keypoints on 𝐼′ 𝐤𝑚 ′ 𝑚=1 𝑀 Training objective ℒ 𝐅𝑠 ′ 𝑠=1 𝑆 𝐅𝑠 𝑠=1 𝑆 0 Scale-space maxpool Upsample 4D Sigmoid