CVIVMar 14, 2025

UStyle: Waterbody Style Transfer of Underwater Scenes by Depth-Guided Feature Synthesis

arXiv:2503.11893v21 citationsh-index: 5Has CodeIEEE J Ocean Eng
Originality Incremental advance
AI Analysis

This addresses the need for robust style transfer in underwater imaging, enabling applications in marine research and robotics, though it is incremental as it builds on existing style transfer techniques with domain-specific adaptations.

The paper tackles the problem of waterbody style transfer in underwater images, which is challenging due to scattering and depth-dependent artifacts, by introducing UStyle, a data-driven framework that achieves perceptually consistent stylization without reference images, surpassing state-of-the-art methods.

The concept of waterbody style transfer remains largely unexplored in the underwater imaging and vision literature. Traditional image style transfer (STx) methods primarily focus on artistic and photorealistic blending, often failing to preserve object and scene geometry in images captured in high-scattering mediums such as underwater. The wavelength-dependent nonlinear attenuation and depth-dependent backscattering artifacts further complicate learning underwater image STx from unpaired data. This paper introduces UStyle, the first data-driven learning framework for transferring waterbody styles across underwater images without requiring prior reference images or scene information. We propose a novel depth-aware whitening and coloring transform (DA-WCT) mechanism that integrates physics-based waterbody synthesis to ensure perceptually consistent stylization while preserving scene structure. To enhance style transfer quality, we incorporate carefully designed loss functions that guide UStyle to maintain colorfulness, lightness, structural integrity, and frequency-domain characteristics, as well as high-level content in VGG and CLIP (contrastive language-image pretraining) feature spaces. By addressing domain-specific challenges, UStyle provides a robust framework for no-reference underwater image STx, surpassing state-of-the-art (SOTA) methods that rely solely on end-to-end reconstruction loss. Furthermore, we introduce the UF7D dataset, a curated collection of high-resolution underwater images spanning seven distinct waterbody styles, establishing a benchmark to support future research in underwater image STx. The UStyle inference pipeline and UF7D dataset are released at: https://github.com/uf-robopi/UStyle.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes