CVOct 6, 2022

Neural Matching Fields: Implicit Representation of Matching Fields for Visual Correspondence

NVIDIAU of Toronto
arXiv:2210.02689v133 citationsh-index: 37
Originality Incremental advance
AI Analysis

This addresses a bottleneck in visual correspondence for computer vision applications, but it is incremental as it builds on existing implicit representation and PatchMatch techniques.

The paper tackles the problem of low-resolution matching fields in semantic correspondence pipelines by introducing Neural Matching Field (NeMF), which uses implicit neural representation to establish high-precision correspondences, achieving competitive results on standard benchmarks.

Existing pipelines of semantic correspondence commonly include extracting high-level semantic features for the invariance against intra-class variations and background clutters. This architecture, however, inevitably results in a low-resolution matching field that additionally requires an ad-hoc interpolation process as a post-processing for converting it into a high-resolution one, certainly limiting the overall performance of matching results. To overcome this, inspired by recent success of implicit neural representation, we present a novel method for semantic correspondence, called Neural Matching Field (NeMF). However, complicacy and high-dimensionality of a 4D matching field are the major hindrances, which we propose a cost embedding network to process a coarse cost volume to use as a guidance for establishing high-precision matching field through the following fully-connected network. Nevertheless, learning a high-dimensional matching field remains challenging mainly due to computational complexity, since a naive exhaustive inference would require querying from all pixels in the 4D space to infer pixel-wise correspondences. To overcome this, we propose adequate training and inference procedures, which in the training phase, we randomly sample matching candidates and in the inference phase, we iteratively performs PatchMatch-based inference and coordinate optimization at test time. With these combined, competitive results are attained on several standard benchmarks for semantic correspondence. Code and pre-trained weights are available at https://ku-cvlab.github.io/NeMF/.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes