CVIVOct 10, 2025

3D Reconstruction from Transient Measurements with Time-Resolved Transformer

arXiv:2510.09205v1h-index: 5Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of low quantum efficiency and high noise in 3D reconstruction for photon-efficient imaging, offering a novel method that is incremental in its adaptation of transformers to spatio-temporal data.

The paper tackles 3D reconstruction from transient measurements in photon-efficient imaging by proposing a Time-Resolved Transformer (TRT) architecture, which significantly outperforms existing methods on synthetic and real-world data, as demonstrated in extensive experiments.

Transient measurements, captured by the timeresolved systems, are widely employed in photon-efficient reconstruction tasks, including line-of-sight (LOS) and non-line-of-sight (NLOS) imaging. However, challenges persist in their 3D reconstruction due to the low quantum efficiency of sensors and the high noise levels, particularly for long-range or complex scenes. To boost the 3D reconstruction performance in photon-efficient imaging, we propose a generic Time-Resolved Transformer (TRT) architecture. Different from existing transformers designed for high-dimensional data, TRT has two elaborate attention designs tailored for the spatio-temporal transient measurements. Specifically, the spatio-temporal self-attention encoders explore both local and global correlations within transient data by splitting or downsampling input features into different scales. Then, the spatio-temporal cross attention decoders integrate the local and global features in the token space, resulting in deep features with high representation capabilities. Building on TRT, we develop two task-specific embodiments: TRT-LOS for LOS imaging and TRT-NLOS for NLOS imaging. Extensive experiments demonstrate that both embodiments significantly outperform existing methods on synthetic data and real-world data captured by different imaging systems. In addition, we contribute a large-scale, high-resolution synthetic LOS dataset with various noise levels and capture a set of real-world NLOS measurements using a custom-built imaging system, enhancing the data diversity in this field. Code and datasets are available at https://github.com/Depth2World/TRT.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes