CVJul 16, 2023

CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance Segmentation

arXiv:2307.08098v227 citationsh-index: 112Has Code
AI Analysis

This addresses the problem of improving instance segmentation accuracy in RGB-D images for computer vision applications, representing an incremental advance with a new dataset.

The paper tackles RGB-D salient instance segmentation by proposing CalibNet, a dual-branch cross-modal calibration architecture that achieves 58.0% AP on the COME15K-N test set.

We propose a novel approach for RGB-D salient instance segmentation using a dual-branch cross-modal feature calibration architecture called CalibNet. Our method simultaneously calibrates depth and RGB features in the kernel and mask branches to generate instance-aware kernels and mask features. CalibNet consists of three simple modules, a dynamic interactive kernel (DIK) and a weight-sharing fusion (WSF), which work together to generate effective instance-aware kernels and integrate cross-modal features. To improve the quality of depth features, we incorporate a depth similarity assessment (DSA) module prior to DIK and WSF. In addition, we further contribute a new DSIS dataset, which contains 1,940 images with elaborate instance-level annotations. Extensive experiments on three challenging benchmarks show that CalibNet yields a promising result, i.e., 58.0% AP with 320*480 input size on the COME15K-N test set, which significantly surpasses the alternative frameworks. Our code and dataset are available at: https://github.com/PJLallen/CalibNet.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes