CVAINov 26, 2018

Unsupervised learning with sparse space-and-time autoencoders

arXiv:1811.10355v11 citations
Originality Synthesis-oriented
AI Analysis

This work addresses unsupervised learning for sparse data in computer vision and motion capture, but appears incremental as it applies known autoencoder methods to new dimensional contexts.

The paper tackled modeling sparse structures in 2D, 3D, and 4D space-time using spatially-sparse convolutional autoencoders, and evaluated their latent spaces on downstream tasks like handwriting recognition and segmentation, achieving unspecified results without concrete numbers.

We use spatially-sparse two, three and four dimensional convolutional autoencoder networks to model sparse structures in 2D space, 3D space, and 3+1=4 dimensional space-time. We evaluate the resulting latent spaces by testing their usefulness for downstream tasks. Applications are to handwriting recognition in 2D, segmentation for parts in 3D objects, segmentation for objects in 3D scenes, and body-part segmentation for 4D wire-frame models generated from motion capture data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes