IVAICVJun 24, 2025

NeRF-based CBCT Reconstruction needs Normalization and Initialization

arXiv:2506.19742v11 citationsh-index: 11Has CodeMICCAI
Originality Incremental advance
AI Analysis

This addresses a specific bottleneck in medical imaging for CBCT reconstruction, offering an incremental improvement to existing NeRF-based methods.

The paper tackles the problem of unstable training and degraded reconstruction quality in NeRF-based CBCT reconstruction due to a local-global optimization mismatch between the hash encoder and neural network, achieving improved stability, faster convergence, and enhanced performance on 128 CT cases from 4 datasets.

Cone Beam Computed Tomography (CBCT) is widely used in medical imaging. However, the limited number and intensity of X-ray projections make reconstruction an ill-posed problem with severe artifacts. NeRF-based methods have achieved great success in this task. However, they suffer from a local-global training mismatch between their two key components: the hash encoder and the neural network. Specifically, in each training step, only a subset of the hash encoder's parameters is used (local sparse), whereas all parameters in the neural network participate (global dense). Consequently, hash features generated in each step are highly misaligned, as they come from different subsets of the hash encoder. These misalignments from different training steps are then fed into the neural network, causing repeated inconsistent global updates in training, which leads to unstable training, slower convergence, and degraded reconstruction quality. Aiming to alleviate the impact of this local-global optimization mismatch, we introduce a Normalized Hash Encoder, which enhances feature consistency and mitigates the mismatch. Additionally, we propose a Mapping Consistency Initialization(MCI) strategy that initializes the neural network before training by leveraging the global mapping property from a well-trained model. The initialized neural network exhibits improved stability during early training, enabling faster convergence and enhanced reconstruction performance. Our method is simple yet effective, requiring only a few lines of code while substantially improving training efficiency on 128 CT cases collected from 4 different datasets, covering 7 distinct anatomical regions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes