CVAug 18, 2020

Multiple View Generation and Classification of Mid-wave Infrared Images using Deep Learning

arXiv:2008.07714v13 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of blurry and distorted outputs in infrared image generation, which is incremental as it builds on existing deep learning methods for a specific domain.

The authors tackled the problem of generating unseen arbitrary viewpoints for mid-wave infrared images, achieving good 3D representations by learning semantic information in a Riemannian subspace rather than Euclidean transformations.

We propose a novel study of generating unseen arbitrary viewpoints for infrared imagery in the non-linear feature subspace . Current methods use synthetic images and often result in blurry and distorted outputs. Our approach on the contrary understands the semantic information in natural images and encapsulates it such that our predicted unseen views possess good 3D representations. We further explore the non-linear feature subspace and conclude that our network does not operate in the Euclidean subspace but rather in the Riemannian subspace. It does not learn the geometric transformation for predicting the position of the pixel in the new image but rather learns the manifold. To this end, we use t-SNE visualisations to conduct a detailed analysis of our network and perform classification of generated images as a low-shot learning task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes