GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module
This addresses the generalization and practicality limitations of 3DGS for 3D reconstruction and rendering, though it is incremental as it builds on existing 3DGS methods.
The paper tackles the problem of 3D Gaussian Splatting (3DGS) models overfitting to single-scene training and being sensitive to initialization, proposing GS-Net as a generalizable plug-and-play module that densifies Gaussian ellipsoids from sparse point clouds, resulting in PSNR improvements of 2.08 dB for conventional viewpoints and 1.86 dB for novel viewpoints.
3D Gaussian Splatting (3DGS) integrates the strengths of primitive-based representations and volumetric rendering techniques, enabling real-time, high-quality rendering. However, 3DGS models typically overfit to single-scene training and are highly sensitive to the initialization of Gaussian ellipsoids, heuristically derived from Structure from Motion (SfM) point clouds, which limits both generalization and practicality. To address these limitations, we propose GS-Net, a generalizable, plug-and-play 3DGS module that densifies Gaussian ellipsoids from sparse SfM point clouds, enhancing geometric structure representation. To the best of our knowledge, GS-Net is the first plug-and-play 3DGS module with cross-scene generalization capabilities. Additionally, we introduce the CARLA-NVS dataset, which incorporates additional camera viewpoints to thoroughly evaluate reconstruction and rendering quality. Extensive experiments demonstrate that applying GS-Net to 3DGS yields a PSNR improvement of 2.08 dB for conventional viewpoints and 1.86 dB for novel viewpoints, confirming the method's effectiveness and robustness.