CVMar 3, 2021

ID-Unet: Iterative Soft and Hard Deformation for View Synthesis

arXiv:2103.02264v55 citations
Originality Incremental advance
AI Analysis

This work addresses view synthesis for computer vision applications, but it is incremental as it builds on existing autoencoder and Unet approaches with iterative deformation modules.

The paper tackles the problem of preserving source content while achieving view conformity in view synthesis by proposing ID-Unet, an architecture that iteratively deforms encoder features using soft and hard modules in a coarse-to-fine manner, resulting in improved performance on two datasets as shown in ablation studies.

View synthesis is usually done by an autoencoder, in which the encoder maps a source view image into a latent content code, and the decoder transforms it into a target view image according to the condition. However, the source contents are often not well kept in this setting, which leads to unnecessary changes during the view translation. Although adding skipped connections, like Unet, alleviates the problem, but it often causes the failure on the view conformity. This paper proposes a new architecture by performing the source-to-target deformation in an iterative way. Instead of simply incorporating the features from multiple layers of the encoder, we design soft and hard deformation modules, which warp the encoder features to the target view at different resolutions, and give results to the decoder to complement the details. Particularly, the current warping flow is not only used to align the feature of the same resolution, but also as an approximation to coarsely deform the high resolution feature. Then the residual flow is estimated and applied in the high resolution, so that the deformation is built up in the coarse-to-fine fashion. To better constrain the model, we synthesize a rough target view image based on the intermediate flows and their warped features. The extensive ablation studies and the final results on two different data sets show the effectiveness of the proposed model.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes