CVAug 10, 2020

Locating Cephalometric X-Ray Landmarks with Foveated Pyramid Attention

arXiv:2008.04428v125 citations
Originality Incremental advance
AI Analysis

This addresses memory and computation bottlenecks for medical imaging tasks, offering a domain-specific solution.

The paper tackled the problem of training CNNs on very large images by proposing an image pyramid approach with narrow glimpses and spatialized features, achieving state-of-the-art results in Cephalometric X-ray Landmark Detection.

CNNs, initially inspired by human vision, differ in a key way: they sample uniformly, rather than with highest density in a focal point. For very large images, this makes training untenable, as the memory and computation required for activation maps scales quadratically with the side length of an image. We propose an image pyramid based approach that extracts narrow glimpses of the of the input image and iteratively refines them to accomplish regression tasks. To assist with high-accuracy regression, we introduce a novel intermediate representation we call 'spatialized features'. Our approach scales logarithmically with the side length, so it works with very large images. We apply our method to Cephalometric X-ray Landmark Detection and get state-of-the-art results.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes