High Quality Segmentation for Ultra High-resolution Images
This addresses a domain-specific challenge in computer vision for applications requiring high-resolution image segmentation, with incremental improvements over existing strategies.
The paper tackles the problem of segmenting ultra high-resolution images (4K or 6K) by proposing the Continuous Refinement Model (CRM) to balance accuracy and computation cost, showing it is fast and effective with quantitative performance evaluation.
To segment 4K or 6K ultra high-resolution images needs extra computation consideration in image segmentation. Common strategies, such as down-sampling, patch cropping, and cascade model, cannot address well the balance issue between accuracy and computation cost. Motivated by the fact that humans distinguish among objects continuously from coarse to precise levels, we propose the Continuous Refinement Model~(CRM) for the ultra high-resolution segmentation refinement task. CRM continuously aligns the feature map with the refinement target and aggregates features to reconstruct these images' details. Besides, our CRM shows its significant generalization ability to fill the resolution gap between low-resolution training images and ultra high-resolution testing ones. We present quantitative performance evaluation and visualization to show that our proposed method is fast and effective on image segmentation refinement. Code will be released at https://github.com/dvlab-research/Entity.