research-article

Open access

Point Cloud Densification for 3D Gaussian Splatting from Sparse Input Views

Authors:

Kin-Chung Chan,

Jun Xiao,

Hana Lebeta Goshu,

Kin-Man LamAuthors Info & Claims

MM '24: Proceedings of the 32nd ACM International Conference on Multimedia

Pages 896 - 904

https://doi.org/10.1145/3664647.3681454

Published: 28 October 2024 Publication History

PDF eReader

Abstract

The technique of 3D Gaussian splatting (3DGS) has demonstrated its effectiveness and efficiency in rendering photo-realistic images for novel view synthesis. However, 3DGS requires a high density of camera coverage, and its performance inevitably degrades with sparse training views, which significantly limits its applicability in real-world scenarios. In recent years, many researchers have explored the use of depth information to alleviate this problem, but the performance of their methods is sensitive to the accuracy of depth estimation. To this end, we propose an efficient method to enhance the performance of 3DGS with sparse training views. Specifically, instead of applying depth maps for regularization, we propose a densification method that generates high-quality point clouds, providing a superior initialization for 3D Gaussians. Furthermore, we propose Systematically Angle of View Sampling (SAOVS), which employs Spherical Linear Interpolation (SLERP) and linear interpolation for side view sampling, to determine unseen views outside the training data for semantic pseudo-label regularization. Experiments show that our proposed method significantly outperforms other leading 3D rendering models on the ScanNet dataset and the LLFF dataset. In particular, compared with the conventional 3DGS method, our proposed method achieves performance gains of up to 1.71dB in PSNR and 0.07 in SSIM. In addition, the novel view synthesis produced by our method demonstrates the highest visual quality with minimal distortions.

Supplemental Material

MP4 File - 4055-video.mp4

Novel view synthesis has gained increasing importance due to the widespread availability of VR devices. Previous methods for sparse input data introduced various regularization techniques to the optimization process but encountered significant challenges. In our approach, we propose a point cloud densification method. Specifically, we use estimated depth information to inversely unproject RGB images into 3D world space, resulting in a dense point cloud. This dense point cloud is then used to initialize the 3DGS model. We introduce semantic constraints into the optimization process to ensure that 3D objects maintain consistent structure and detail across different views. To support this, we propose systematically Angle of View Sampling for generating coherent side views. Our method synthesizes images with fewer artifacts while preserving rich, detailed textures. Notably, it produces sharper object appearances and achieves superior performance across various evaluation metrics.

Download
13.06 MB

References

[1]

Reiner Birkl, Diana Wofk, and Matthias Müller. 2023. MiDaS v3.1 -- A Model Zoo for Robust Monocular Relative Depth Estimation. arXiv preprint arXiv:2307.14460 (2023).

Abstract

Supplemental Material

References

Index Terms

Recommendations

AbsGS: Recovering Fine Details in 3D Gaussian Splatting

Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing

GS3: Efficient Relighting with Triple Gaussian Splatting

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

PDF

eReader

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations

GS³: Efficient Relighting with Triple Gaussian Splatting