CVAIIVJun 5, 2025

Physics Informed Capsule Enhanced Variational AutoEncoder for Underwater Image Enhancement

arXiv:2506.04753v13 citationsh-index: 26Has Code
Originality Highly original
AI Analysis

This addresses the problem of enhancing degraded underwater images for applications like marine exploration, with incremental improvements in performance and efficiency.

The paper tackles underwater image enhancement by integrating a physical model with capsule clustering in a dual-stream architecture, achieving state-of-the-art results with improvements of +0.5dB PSNR over existing methods and one-third the computational complexity.

We present a novel dual-stream architecture that achieves state-of-the-art underwater image enhancement by explicitly integrating the Jaffe-McGlamery physical model with capsule clustering-based feature representation learning. Our method simultaneously estimates transmission maps and spatially-varying background light through a dedicated physics estimator while extracting entity-level features via capsule clustering in a parallel stream. This physics-guided approach enables parameter-free enhancement that respects underwater formation constraints while preserving semantic structures and fine-grained details. Our approach also features a novel optimization objective ensuring both physical adherence and perceptual quality across multiple spatial frequencies. To validate our approach, we conducted extensive experiments across six challenging benchmarks. Results demonstrate consistent improvements of $+0.5$dB PSNR over the best existing methods while requiring only one-third of their computational complexity (FLOPs), or alternatively, more than $+1$dB PSNR improvement when compared to methods with similar computational budgets. Code and data \textit{will} be available at https://github.com/iN1k1/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes