CVIVMay 20, 2025

Towards Generating Realistic Underwater Images

arXiv:2505.14296v1
Originality Incremental advance
AI Analysis

This work addresses the problem of creating realistic underwater imagery for applications like marine research or robotics, but it is incremental as it builds on existing methods like GANs and contrastive learning.

This paper tackled generating realistic underwater images from synthetic ones using various image translation models, finding that incorporating depth information into CUT achieved the lowest FID score for enhanced realism, though with a slight decrease in SSIM.

This paper explores the use of contrastive learning and generative adversarial networks for generating realistic underwater images from synthetic images with uniform lighting. We investigate the performance of image translation models for generating realistic underwater images using the VAROS dataset. Two key evaluation metrics, Fréchet Inception Distance (FID) and Structural Similarity Index Measure (SSIM), provide insights into the trade-offs between perceptual quality and structural preservation. For paired image translation, pix2pix achieves the best FID scores due to its paired supervision and PatchGAN discriminator, while the autoencoder model attains the highest SSIM, suggesting better structural fidelity despite producing blurrier outputs. Among unpaired methods, CycleGAN achieves a competitive FID score by leveraging cycle-consistency loss, whereas CUT, which replaces cycle-consistency with contrastive learning, attains higher SSIM, indicating improved spatial similarity retention. Notably, incorporating depth information into CUT results in the lowest overall FID score, demonstrating that depth cues enhance realism. However, the slight decrease in SSIM suggests that depth-aware learning may introduce structural variations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes