CVApr 7, 2020

Feature Pyramid Grids

arXiv:2004.03580v160 citations
AI Analysis

This work addresses scale handling in object detection, offering a uniform and effective alternative to complex neural architecture search methods, though it is incremental in nature.

The paper tackles the problem of scale variation in object detection by introducing Feature Pyramid Grids (FPG), a deep multi-pathway feature pyramid that improves performance over single-pathway networks at similar computational cost, with significant gains reported.

Feature pyramid networks have been widely adopted in the object detection literature to improve feature representations for better handling of variations in scale. In this paper, we present Feature Pyramid Grids (FPG), a deep multi-pathway feature pyramid, that represents the feature scale-space as a regular grid of parallel bottom-up pathways which are fused by multi-directional lateral connections. FPG can improve single-pathway feature pyramid networks by significantly increasing its performance at similar computation cost, highlighting importance of deep pyramid representations. In addition to its general and uniform structure, over complicated structures that have been found with neural architecture search, it also compares favorably against such approaches without relying on search. We hope that FPG with its uniform and effective nature can serve as a strong component for future work in object recognition.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes