CVAIGRApr 10, 2019

Predicting Novel Views Using Generative Adversarial Query Network

arXiv:1904.05124v14 citations
Originality Incremental advance
AI Analysis

This work addresses a challenging computer vision problem for applications like robotics and virtual reality, but it is incremental as it builds on existing GQN and GAN frameworks.

The paper tackles the problem of novel view synthesis from arbitrary observations by introducing the Generative Adversarial Query Network (GAQN), which combines GQN and GANs to improve visual quality and convergence speed, resulting in high-quality outputs and faster training compared to conventional methods.

The problem of predicting a novel view of the scene using an arbitrary number of observations is a challenging problem for computers as well as for humans. This paper introduces the Generative Adversarial Query Network (GAQN), a general learning framework for novel view synthesis that combines Generative Query Network (GQN) and Generative Adversarial Networks (GANs). The conventional GQN encodes input views into a latent representation that is used to generate a new view through a recurrent variational decoder. The proposed GAQN builds on this work by adding two novel aspects: First, we extend the current GQN architecture with an adversarial loss function for improving the visual quality and convergence speed. Second, we introduce a feature-matching loss function for stabilizing the training procedure. The experiments demonstrate that GAQN is able to produce high-quality results and faster convergence compared to the conventional approach.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes