CVNov 22, 2023

Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models

Mengyang Feng, Jinlin Liu, Miaomiao Cui, Xuansong Xie

arXiv:2311.13141v122.063 citationsh-index: 15Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses a domain-specific challenge in panoramic image generation for applications like virtual reality, but it is incremental as it adapts existing diffusion models.

The paper tackles the problem of generating seamless 360-degree panoramic images using diffusion models, proposing a circular blending strategy that achieves state-of-the-art performance with improved geometry continuity.

This is a technical report on the 360-degree panoramic image generation task based on diffusion models. Unlike ordinary 2D images, 360-degree panoramic images capture the entire $360^\circ\times 180^\circ$ field of view. So the rightmost and the leftmost sides of the 360 panoramic image should be continued, which is the main challenge in this field. However, the current diffusion pipeline is not appropriate for generating such a seamless 360-degree panoramic image. To this end, we propose a circular blending strategy on both the denoising and VAE decoding stages to maintain the geometry continuity. Based on this, we present two models for \textbf{Text-to-360-panoramas} and \textbf{Single-Image-to-360-panoramas} tasks. The code has been released as an open-source project at \href{https://github.com/ArcherFMY/SD-T2I-360PanoImage}{https://github.com/ArcherFMY/SD-T2I-360PanoImage} and \href{https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary}{ModelScope}

View on arXiv PDF Code

Similar