LGJul 9, 2024

Multiple Instance Verification

arXiv:2407.06544v22 citationsh-index: 14Has Code
Originality Incremental advance
AI Analysis

This work addresses a verification challenge in machine learning for applications like medical imaging or security, but it is incremental as it builds on existing attention-based and Siamese network methods.

The paper tackles the problem of multiple instance verification, where a query instance is verified against a bag of target instances with unknown relevancy, and introduces cross-attention pooling (CAP) to outperform state-of-the-art methods by substantial margins in classification accuracy and key instance detection across three verification tasks.

We explore multiple instance verification, a problem setting in which a query instance is verified against a bag of target instances with heterogeneous, unknown relevancy. We show that naive adaptations of attention-based multiple instance learning (MIL) methods and standard verification methods like Siamese neural networks are unsuitable for this setting: directly combining state-of-the-art (SOTA) MIL methods and Siamese networks is shown to be no better, and sometimes significantly worse, than a simple baseline model. Postulating that this may be caused by the failure of the representation of the target bag to incorporate the query instance, we introduce a new pooling approach named "cross-attention pooling" (CAP). Under the CAP framework, we propose two novel attention functions to address the challenge of distinguishing between highly similar instances in a target bag. Through empirical studies on three different verification tasks, we demonstrate that CAP outperforms adaptations of SOTA MIL methods and the baseline by substantial margins, in terms of both classification accuracy and the ability to detect key instances. The superior ability to identify key instances is attributed to the new attention functions by ablation studies. We share our code at https://github.com/xxweka/MIV.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes