UR2KiD: Unifying Retrieval, Keypoint Detection, and Keypoint Description without Local Correspondence Supervision

Oct 29, 2021

UR2KiD: Unifying Retrieval, Keypoint Detection, and Keypoint Description without Local Correspondence Supervision

Contents

Introduction Proposed Method Experiment Conclusions

Introduction

이 논문에서는 keypoint detection, description, 그리고 image retrieval까지 한번에 해결할 수 있는 framework을 제안합니다. 특히, 기존의 local matching 알고리즘은 pointwise/pixelwise 매칭 ground-truth가 주어져야 학습이 가능했지만 제안하는 알고리즘은 image pair만 가지고도 학습할 수 있습니다.

Proposed Method

Local Keypoint and description

이미지로부터 feature를 추출하는 과정에는 ImageNet에 pretrain된 ResNet-101을 사용했습니다.

이 네트워크에서 나온 feature로부터 matching affinity matrix를 계산합니다.

a는 query image, p는 매칭이 되는 positive sample을 의미합니다.

이 matrix의 column, row별 최댓값을 사용하여 average score를 계산합니다.

Score를 사용하여 margin loss를 계산합니다.

Matching loss만 사용하면 low-dimensional descriptor는 high-dimensional descriptor의 정보를 가지지 못합니다. High-dimension 정보를 low-dimension으로 흘려보내기 위하여 distilling 방법을 제안합니다.