CVAILGApr 15, 2024

Adaptive Patching for High-resolution Image Segmentation with Transformers

arXiv:2404.09707v17 citationsh-index: 21Int Conf High Perform Comput Netw Storage Anal
Originality Highly original
AI Analysis

This addresses the problem of scalability for researchers and practitioners using transformers on high-resolution images like pathology data, offering a plug-and-play solution with significant performance gains.

The paper tackles the high computational cost of attention-based models for high-resolution image segmentation by introducing adaptive patching as a pre-processing step, which reduces the number of patches by orders of magnitude, resulting in a 6.9x geomean speedup and superior segmentation quality on pathology datasets.

Attention-based models are proliferating in the space of image analytics, including segmentation. The standard method of feeding images to transformer encoders is to divide the images into patches and then feed the patches to the model as a linear sequence of tokens. For high-resolution images, e.g. microscopic pathology images, the quadratic compute and memory cost prohibits the use of an attention-based model, if we are to use smaller patch sizes that are favorable in segmentation. The solution is to either use custom complex multi-resolution models or approximate attention schemes. We take inspiration from Adapative Mesh Refinement (AMR) methods in HPC by adaptively patching the images, as a pre-processing step, based on the image details to reduce the number of patches being fed to the model, by orders of magnitude. This method has a negligible overhead, and works seamlessly with any attention-based model, i.e. it is a pre-processing step that can be adopted by any attention-based model without friction. We demonstrate superior segmentation quality over SoTA segmentation models for real-world pathology datasets while gaining a geomean speedup of $6.9\times$ for resolutions up to $64K^2$, on up to $2,048$ GPUs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes