CVPR2020

Select, Supplement and Focus for RGB-D Saliency Detection

Miao Zhang, Weisong Ren, Yongri Piao, Zhengkun Rong, Huchuan Lu

Abstract

Depth data containing a preponderance of discriminative power in location have been proven beneficial for accurate saliency prediction. However, RGB-D saliency detection methods are also negatively influenced by randomly distributed erroneous or missing regions on the depth map or along the object boundaries. This offers the possibility of achieving more effective inference by well designed models. In this paper, we propose a new framework for accurate RGB-D saliency detection taking account of global location and local detail complementarities from two modalities. This is achieved by designing a complimentary interaction module (CIM) to discriminatively select useful representation from the RGB and depth data, and effectively integrate cross-modal features. Benefiting from the proposed CIM, the fused features can accurately locate salient objects with fine edge details. Moreover, we propose a compensationaware loss to improve the network's confidence in detecting hard samples. Comprehensive experiments on six public datasets demonstrate that our method outperforms 18 stateof-the-art methods. * Equal Contributions † Corresponding Author Image depth map DMRA CPFP OURS GT 深度图 * CTMF Tcyb17 .