IV CV LGMar 30, 2025

A Lightweight Image Super-Resolution Transformer Trained on Low-Resolution Images Only

Björn Möller, Lucas Görnhardt, Tim Fingscheidt

arXiv:2503.23265v15.13 citationsh-index: 4Has CodeProcedia Computer Science

Originality Synthesis-oriented

AI Analysis

This addresses a practical limitation for real-world super-resolution applications where high-quality training data is scarce, though it is incremental as it adapts existing methods to new data.

The paper tackles the problem of training image super-resolution models without high-resolution images by proposing a lightweight transformer trained only on low-resolution data, achieving superior performance over state-of-the-art methods on benchmark datasets like Set5 and Urban100.

Transformer architectures prominently lead single-image super-resolution (SISR) benchmarks, reconstructing high-resolution (HR) images from their low-resolution (LR) counterparts. Their strong representative power, however, comes with a higher demand for training data compared to convolutional neural networks (CNNs). For many real-world SR applications, the availability of high-quality HR training images is not given, sparking interest in LR-only training methods. The LR-only SISR benchmark mimics this condition by allowing only low-resolution (LR) images for model training. For a 4x super-resolution, this effectively reduces the amount of available training data to 6.25% of the HR image pixels, which puts the employment of a data-hungry transformer model into question. In this work, we are the first to utilize a lightweight vision transformer model with LR-only training methods addressing the unsupervised SISR LR-only benchmark. We adopt and configure a recent LR-only training method from microscopy image super-resolution to macroscopic real-world data, resulting in our multi-scale training method for bicubic degradation (MSTbic). Furthermore, we compare it with reference methods and prove its effectiveness both for a transformer and a CNN model. We evaluate on the classic SR benchmark datasets Set5, Set14, BSD100, Urban100, and Manga109, and show superior performance over state-of-the-art (so far: CNN-based) LR-only SISR methods. The code is available on GitHub: https://github.com/ifnspaml/SuperResolutionMultiscaleTraining.

View on arXiv PDF Code

Similar