CVAug 29, 2023

Learning to Upsample by Learning to Sample

arXiv:2308.15085v1535 citationsh-index: 36Has Code
Originality Highly original
AI Analysis

This addresses the problem of high computational workload in upsampling for computer vision researchers and practitioners, offering a more efficient alternative to existing methods.

The paper tackles the computational inefficiency of kernel-based dynamic upsamplers by introducing DySample, a lightweight upsampler that uses point sampling instead of dynamic convolution, achieving superior performance across five dense prediction tasks with fewer parameters, FLOPs, and latency.

We present DySample, an ultra-lightweight and effective dynamic upsampler. While impressive performance gains have been witnessed from recent kernel-based dynamic upsamplers such as CARAFE, FADE, and SAPA, they introduce much workload, mostly due to the time-consuming dynamic convolution and the additional sub-network used to generate dynamic kernels. Further, the need for high-res feature guidance of FADE and SAPA somehow limits their application scenarios. To address these concerns, we bypass dynamic convolution and formulate upsampling from the perspective of point sampling, which is more resource-efficient and can be easily implemented with the standard built-in function in PyTorch. We first showcase a naive design, and then demonstrate how to strengthen its upsampling behavior step by step towards our new upsampler, DySample. Compared with former kernel-based dynamic upsamplers, DySample requires no customized CUDA package and has much fewer parameters, FLOPs, GPU memory, and latency. Besides the light-weight characteristics, DySample outperforms other upsamplers across five dense prediction tasks, including semantic segmentation, object detection, instance segmentation, panoptic segmentation, and monocular depth estimation. Code is available at https://github.com/tiny-smart/dysample.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes