CVOct 18, 2023

HSTR-Net: Reference Based Video Super-resolution with Dual Cameras

arXiv:2310.12092v21 citationsh-index: 24
AI Analysis

This addresses the high cost of HSTR video recording for applications like aerial monitoring, though it is incremental as it builds on existing RefSR techniques.

The paper tackles generating high-spatio-temporal resolution video using a dual-camera system with reference-based super-resolution, achieving significant improvements in PSNR and SSIM metrics over existing methods and sufficient FPS for drone deployment.

High-spatio-temporal resolution (HSTR) video recording plays a crucial role in enhancing various imagery tasks that require fine-detailed information. State-of-the-art cameras provide this required high frame-rate and high spatial resolution together, albeit at a high cost. To alleviate this issue, this paper proposes a dual camera system for the generation of HSTR video using reference-based super-resolution (RefSR). One camera captures high spatial resolution low frame rate (HSLF) video while the other captures low spatial resolution high frame rate (LSHF) video simultaneously for the same scene. A novel deep learning architecture is proposed to fuse HSLF and LSHF video feeds and synthesize HSTR video frames. The proposed model combines optical flow estimation and (channel-wise and spatial) attention mechanisms to capture the fine motion and complex dependencies between frames of the two video feeds. Simulations show that the proposed model provides significant improvement over existing reference-based SR techniques in terms of PSNR and SSIM metrics. The method also exhibits sufficient frames per second (FPS) for aerial monitoring when deployed on a power-constrained drone equipped with dual cameras.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes