GaussianFocus: Constrained Attention Focus for 3D Gaussian Splatting
This addresses rendering quality and scalability challenges for 3D reconstruction and neural rendering applications, representing an incremental improvement over existing 3D Gaussian Splatting techniques.
The paper tackles the problem of excessive redundant noisy Gaussians and scalability issues in 3D Gaussian Splatting for 3D scene rendering, resulting in significantly reduced unnecessary Gaussians and enhanced rendering quality that surpasses state-of-the-art methods, with demonstrated capability to manage large scenes like urban environments while maintaining high fidelity.
Recent developments in 3D reconstruction and neural rendering have significantly propelled the capabilities of photo-realistic 3D scene rendering across various academic and industrial fields. The 3D Gaussian Splatting technique, alongside its derivatives, integrates the advantages of primitive-based and volumetric representations to deliver top-tier rendering quality and efficiency. Despite these advancements, the method tends to generate excessive redundant noisy Gaussians overfitted to every training view, which degrades the rendering quality. Additionally, while 3D Gaussian Splatting excels in small-scale and object-centric scenes, its application to larger scenes is hindered by constraints such as limited video memory, excessive optimization duration, and variable appearance across views. To address these challenges, we introduce GaussianFocus, an innovative approach that incorporates a patch attention algorithm to refine rendering quality and implements a Gaussian constraints strategy to minimize redundancy. Moreover, we propose a subdivision reconstruction strategy for large-scale scenes, dividing them into smaller, manageable blocks for individual training. Our results indicate that GaussianFocus significantly reduces unnecessary Gaussians and enhances rendering quality, surpassing existing State-of-The-Art (SoTA) methods. Furthermore, we demonstrate the capability of our approach to effectively manage and render large scenes, such as urban environments, whilst maintaining high fidelity in the visual output.