CVMar 18, 2025

Intra and Inter Parser-Prompted Transformers for Effective Image Restoration

arXiv:2503.14037v17 citationsh-index: 5AAAI
Originality Incremental advance
AI Analysis

This work addresses image quality enhancement for computer vision applications, presenting an incremental improvement by combining parser-prompted attention mechanisms with existing transformer architectures.

The paper tackles image restoration tasks like deraining and deblurring by proposing PPTformer, which integrates parser features from visual foundation models to boost performance, achieving state-of-the-art results across multiple benchmarks.

We propose Intra and Inter Parser-Prompted Transformers (PPTformer) that explore useful features from visual foundation models for image restoration. Specifically, PPTformer contains two parts: an Image Restoration Network (IRNet) for restoring images from degraded observations and a Parser-Prompted Feature Generation Network (PPFGNet) for providing IRNet with reliable parser information to boost restoration. To enhance the integration of the parser within IRNet, we propose Intra Parser-Prompted Attention (IntraPPA) and Inter Parser-Prompted Attention (InterPPA) to implicitly and explicitly learn useful parser features to facilitate restoration. The IntraPPA re-considers cross attention between parser and restoration features, enabling implicit perception of the parser from a long-range and intra-layer perspective. Conversely, the InterPPA initially fuses restoration features with those of the parser, followed by formulating these fused features within an attention mechanism to explicitly perceive parser information. Further, we propose a parser-prompted feed-forward network to guide restoration within pixel-wise gating modulation. Experimental results show that PPTformer achieves state-of-the-art performance on image deraining, defocus deblurring, desnowing, and low-light enhancement.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes