CVJul 12, 2025

360-Degree Full-view Image Segmentation by Spherical Convolution compatible with Large-scale Planar Pre-trained Models

arXiv:2507.09216v1h-index: 72025 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of applying standard image models to panoramic images with distortions, though it appears incremental as it adapts existing methods rather than creating new ones.

The paper tackles the problem of panoramic image segmentation by introducing a spherical sampling method that enables direct use of existing 2D pre-trained models, achieving commendable results on the Stanford2D3D indoor dataset.

Due to the current lack of large-scale datasets at the million-scale level, tasks involving panoramic images predominantly rely on existing two-dimensional pre-trained image benchmark models as backbone networks. However, these networks are not equipped to recognize the distortions and discontinuities inherent in panoramic images, which adversely affects their performance in such tasks. In this paper, we introduce a novel spherical sampling method for panoramic images that enables the direct utilization of existing pre-trained models developed for two-dimensional images. Our method employs spherical discrete sampling based on the weights of the pre-trained models, effectively mitigating distortions while achieving favorable initial training values. Additionally, we apply the proposed sampling method to panoramic image segmentation, utilizing features obtained from the spherical model as masks for specific channel attentions, which yields commendable results on commonly used indoor datasets, Stanford2D3D.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes