DCTM: Discrete-Continuous Transformation Matching for Semantic Flow
This addresses a key limitation in computer vision for applications like image alignment and object recognition, though it appears incremental as it builds on existing methods for geometric variations.
The paper tackles the problem of dense semantic correspondence under complex affine deformations by proposing a discrete-continuous transformation matching framework, which outperforms state-of-the-art methods on various benchmarks.
Techniques for dense semantic correspondence have provided limited ability to deal with the geometric variations that commonly exist between semantically similar images. While variations due to scale and rotation have been examined, there lack practical solutions for more complex deformations such as affine transformations because of the tremendous size of the associated solution space. To address this problem, we present a discrete-continuous transformation matching (DCTM) framework where dense affine transformation fields are inferred through a discrete label optimization in which the labels are iteratively updated via continuous regularization. In this way, our approach draws solutions from the continuous space of affine transformations in a manner that can be computed efficiently through constant-time edge-aware filtering and a proposed affine-varying CNN-based descriptor. Experimental results show that this model outperforms the state-of-the-art methods for dense semantic correspondence on various benchmarks.