GaRField++: Reinforced Gaussian Radiance Fields for Large-Scale 3D Scene Reconstruction
This work addresses the problem of reconstructing large-scale 3D scenes for applications like novel view synthesis, though it appears incremental as it builds upon 3D Gaussian splatting with specific improvements.
The paper tackles scalability and accuracy challenges in large-scale 3D scene reconstruction by proposing a reinforced Gaussian radiance field framework, achieving more high-fidelity rendering results than state-of-the-art methods on datasets like Mill19, Urban3D, and MatrixCity.
This paper proposes a novel framework for large-scale scene reconstruction based on 3D Gaussian splatting (3DGS) and aims to address the scalability and accuracy challenges faced by existing methods. For tackling the scalability issue, we split the large scene into multiple cells, and the candidate point-cloud and camera views of each cell are correlated through a visibility-based camera selection and a progressive point-cloud extension. To reinforce the rendering quality, three highlighted improvements are made in comparison with vanilla 3DGS, which are a strategy of the ray-Gaussian intersection and the novel Gaussians density control for learning efficiency, an appearance decoupling module based on ConvKAN network to solve uneven lighting conditions in large-scale scenes, and a refined final loss with the color loss, the depth distortion loss, and the normal consistency loss. Finally, the seamless stitching procedure is executed to merge the individual Gaussian radiance field for novel view synthesis across different cells. Evaluation of Mill19, Urban3D, and MatrixCity datasets shows that our method consistently generates more high-fidelity rendering results than state-of-the-art methods of large-scale scene reconstruction. We further validate the generalizability of the proposed approach by rendering on self-collected video clips recorded by a commercial drone.