CVAIApr 12, 2016

Volumetric and Multi-View CNNs for Object Classification on 3D Data

arXiv:1604.03265v21645 citations
Originality Incremental advance
AI Analysis

This work addresses the gap in performance between volumetric and multi-view CNNs for 3D object classification, which is incremental as it builds upon existing methods to provide better understanding and improvements.

The paper tackled the problem of improving object classification on 3D data by enhancing both volumetric and multi-view CNN architectures, resulting in outperforming current state-of-the-art methods for both approaches.

3D shape models are becoming widely available and easier to capture, making available 3D information crucial for progress in object classification. Current state-of-the-art methods rely on CNNs to address this problem. Recently, we witness two types of CNNs being developed: CNNs based upon volumetric representations versus CNNs based upon multi-view representations. Empirical results from these two types of CNNs exhibit a large gap, indicating that existing volumetric CNN architectures and approaches are unable to fully exploit the power of 3D representations. In this paper, we aim to improve both volumetric CNNs and multi-view CNNs according to extensive analysis of existing approaches. To this end, we introduce two distinct network architectures of volumetric CNNs. In addition, we examine multi-view CNNs, where we introduce multi-resolution filtering in 3D. Overall, we are able to outperform current state-of-the-art methods for both volumetric CNNs and multi-view CNNs. We provide extensive experiments designed to evaluate underlying design choices, thus providing a better understanding of the space of methods available for object classification on 3D data.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes