LGCVJun 25, 2024

The Balanced-Pairwise-Affinities Feature Transform

arXiv:2407.01467v16 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the need for better feature representations in computer vision tasks such as classification and clustering, though it appears incremental as it adapts existing Sinkhorn optimal transport methods in a novel way.

The paper tackles the problem of improving feature representations for matching or grouping tasks by introducing the Balanced-Pairwise-Affinities (BPA) feature transform, which encodes high-order relations between input features using an optimal transport optimization, and it demonstrates state-of-the-art results in tasks like few-shot classification, unsupervised image clustering, and person re-identification.

The Balanced-Pairwise-Affinities (BPA) feature transform is designed to upgrade the features of a set of input items to facilitate downstream matching or grouping related tasks. The transformed set encodes a rich representation of high order relations between the input features. A particular min-cost-max-flow fractional matching problem, whose entropy regularized version can be approximated by an optimal transport (OT) optimization, leads to a transform which is efficient, differentiable, equivariant, parameterless and probabilistically interpretable. While the Sinkhorn OT solver has been adapted extensively in many contexts, we use it differently by minimizing the cost between a set of features to $itself$ and using the transport plan's $rows$ as the new representation. Empirically, the transform is highly effective and flexible in its use and consistently improves networks it is inserted into, in a variety of tasks and training schemes. We demonstrate state-of-the-art results in few-shot classification, unsupervised image clustering and person re-identification. Code is available at \url{github.com/DanielShalam/BPA}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes