CVLGJan 24

ReflexSplit: Single Image Reflection Separation via Layer Fusion-Separation

arXiv:2601.174681 citationsh-index: 10Has Code
AI Analysis

This work solves the problem of separating reflections from images for computer vision applications, representing an incremental improvement over existing methods.

The paper tackles the problem of single image reflection separation (SIRS) by proposing ReflexSplit, a dual-stream framework that addresses transmission-reflection confusion under nonlinear mixing. It achieves state-of-the-art performance on synthetic and real-world benchmarks with superior perceptual quality and robust generalization.

Single Image Reflection Separation (SIRS) disentangles mixed images into transmission and reflection layers. Existing methods suffer from transmission-reflection confusion under nonlinear mixing, particularly in deep decoder layers, due to implicit fusion mechanisms and inadequate multi-scale coordination. We propose ReflexSplit, a dual-stream framework with three key innovations. (1) Cross-scale Gated Fusion (CrGF) adaptively aggregates semantic priors, texture details, and decoder context across hierarchical depths, stabilizing gradient flow and maintaining feature consistency. (2) Layer Fusion-Separation Blocks (LFSB) alternate between fusion for shared structure extraction and differential separation for layer-specific disentanglement. Inspired by Differential Transformer, we extend attention cancellation to dual-stream separation via cross-stream subtraction. (3) Curriculum training progressively strengthens differential separation through depth-dependent initialization and epoch-wise warmup. Extensive experiments on synthetic and real-world benchmarks demonstrate state-of-the-art performance with superior perceptual quality and robust generalization. Our code is available at https://github.com/wuw2135/ReflexSplit.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes