CVMay 5

RPBA-Net: An Interpretable Residual Pyramid Bilateral Affine Network for RAW-Domain ISP Enhancement

arXiv:2605.0362641.9
Predicted impact top 77% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses the fragmentation and lack of interpretability in RAW-domain image signal processing for mobile and embedded platforms.

RPBA-Net proposes an interpretable residual pyramid bilateral affine network for RAW-domain ISP enhancement, unifying demosaicing, color correction, and detail enhancement. It achieves state-of-the-art reconstruction fidelity and perceptual quality with low model complexity, suitable for mobile and embedded platforms.

To address module fragmentation, uninterpretable mappings, and deployment constraints in RAW-domain demosaicing, color correction, and detail enhancement, this paper proposes RPBA-Net, an interpretable residual pyramid bilateral affine network for RAW-domain ISP enhancement. Given packed RAW as input, the method performs residual affine base reconstruction by estimating a base RGB representation and learning identity-guided residual affine corrections, thereby unifying demosaicing and enhancement. It further builds pyramid bilateral affine grids and combines guide-driven autoregressive adaptive slicing with adaptive cross-layer fusion to hierarchically model global tone restoration and local texture enhancement. In addition, smoothness, cross-scale consistency, and magnitude regularization terms are introduced to improve model stability, controllability, and structural interpretability. Extensive experiments demonstrate that RPBA-Net surpasses representative RAW-to-sRGB methods and achieves state-of-the-art performance in reconstruction fidelity and perceptual quality, while maintaining low model complexity and strong deployment potential for mobile and embedded platforms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes