CVMay 18, 2023

Ultra-High Resolution Segmentation with Ultra-Rich Context: A Novel Benchmark

arXiv:2305.10899v148 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This provides a comprehensive benchmark for researchers in computer vision to advance Ultra-High Resolution segmentation, though it is incremental as it builds on existing datasets and methods.

The authors tackled the lack of a large-scale benchmark for Ultra-High Resolution segmentation by introducing the URUR dataset with 3,008 high-resolution images and 80 billion annotated pixels, and proposed WSDNet, which achieved state-of-the-art performance on UHR datasets.

With the increasing interest and rapid development of methods for Ultra-High Resolution (UHR) segmentation, a large-scale benchmark covering a wide range of scenes with full fine-grained dense annotations is urgently needed to facilitate the field. To this end, the URUR dataset is introduced, in the meaning of Ultra-High Resolution dataset with Ultra-Rich Context. As the name suggests, URUR contains amounts of images with high enough resolution (3,008 images of size 5,120x5,120), a wide range of complex scenes (from 63 cities), rich-enough context (1 million instances with 8 categories) and fine-grained annotations (about 80 billion manually annotated pixels), which is far superior to all the existing UHR datasets including DeepGlobe, Inria Aerial, UDD, etc.. Moreover, we also propose WSDNet, a more efficient and effective framework for UHR segmentation especially with ultra-rich context. Specifically, multi-level Discrete Wavelet Transform (DWT) is naturally integrated to release computation burden while preserve more spatial details, along with a Wavelet Smooth Loss (WSL) to reconstruct original structured context and texture with a smooth constrain. Experiments on several UHR datasets demonstrate its state-of-the-art performance. The dataset is available at https://github.com/jankyee/URUR.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes