CVSep 22, 2025

Neural-MMGS: Multi-modal Neural Gaussian Splats for Large-Scale Scene Reconstruction

arXiv:2509.17762v1h-index: 4
Originality Incremental advance
AI Analysis

This addresses multimodal reconstruction for robotics or autonomous systems, but is incremental as it builds on existing neural 3DGS frameworks.

The paper tackles large-scale scene reconstruction by fusing image, LiDAR, and semantic data into a compact embedding per Gaussian, achieving higher-quality reconstructions on Oxford Spires and competitive results with less storage on KITTI-360.

This paper proposes Neural-MMGS, a novel neural 3DGS framework for multimodal large-scale scene reconstruction that fuses multiple sensing modalities in a per-gaussian compact, learnable embedding. While recent works focusing on large-scale scene reconstruction have incorporated LiDAR data to provide more accurate geometric constraints, we argue that LiDAR's rich physical properties remain underexplored. Similarly, semantic information has been used for object retrieval, but could provide valuable high-level context for scene reconstruction. Traditional approaches append these properties to Gaussians as separate parameters, increasing memory usage and limiting information exchange across modalities. Instead, our approach fuses all modalities -- image, LiDAR, and semantics -- into a compact, learnable embedding that implicitly encodes optical, physical, and semantic features in each Gaussian. We then train lightweight neural decoders to map these embeddings to Gaussian parameters, enabling the reconstruction of each sensing modality with lower memory overhead and improved scalability. We evaluate Neural-MMGS on the Oxford Spires and KITTI-360 datasets. On Oxford Spires, we achieve higher-quality reconstructions, while on KITTI-360, our method reaches competitive results with less storage consumption compared with current approaches in LiDAR-based novel-view synthesis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes