CVMay 15

On RGB-TIR Stereo Calibration under Extreme Resolution Asymmetry

arXiv:2605.158608.0
AI Analysis

Enables accurate multimodal calibration for building energy assessment using low-cost thermal sensors, addressing a practical bottleneck in extreme resolution asymmetry.

The paper presents a stereo calibration framework for an RGB camera (2028x1520 px) and a low-resolution TIR camera (80x62 px) with a 1:625 pixel ratio, achieving a baseline of 32.7 mm and reprojection error of 0.382 px, validated on a building mock-up.

Accurate geometric calibration of RGB-thermal infrared (TIR) stereo camera systems is essential for multimodal building envelope analysis, yet remains challenging when low-cost thermal sensors with very low spatial resolution are employed. This paper presents a practical stereo calibration framework for an RGB camera (2028 x 1520 px) paired with a TIR camera operating at only 80 x 62 px - a pixel-count ratio of approximately 1:625. An active OLED screen dynamically switches modality-specific patterns (checkerboard for TIR, ChArUco for RGB) on a single physical surface, providing controlled and repeatable thermal contrast. A dedicated corner detection algorithm combining perspective rectification, Hessian saddle-point analysis, and Mean Shift localisation achieves reliable checkerboard detection at 80 x 62 px without per-frame parameter tuning. A baseline-constrained bundle adjustment enforces physically consistent rig geometry under the planar-calibration-object degeneracy, yielding a stereo baseline of 32.7 mm (nominal 30 mm) with an overall reprojection error of 0.382 px. The system is validated on a thermally active building mock-up using constant-depth and per-pixel depth estimation, demonstrating consistent TIR-to-RGB projection suitable for building energy performance assessment.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes