InstantSplat: Sparse-View SfM-Free Gaussian Splatting in Seconds
InstantSplat is a new framework for novel view synthesis from sparse images, reducing training time significantly and improving 3D scene reconstruction efficiency without relying on traditional Structure-from-Motion methods.
Read original articleInstantSplat is a new framework designed to enhance novel view synthesis (NVS) from sparse images without relying on the traditional Structure-from-Motion (SfM) process, which can be time-consuming and unreliable in scenarios with limited matched features. The method integrates dense stereo predictions with point-based representations to create 3D Gaussians from sparse-view data in a matter of seconds. It begins by generating a pixel-wise multi-view stereo dense point cloud and computing initial camera poses. To optimize the representation, InstantSplat employs a grid-based, confidence-aware Farthest Point Sampling technique to strategically position point primitives, minimizing redundancy and ensuring uniform sampling. This approach allows for the initialization of densely populated Gaussians and a streamlined optimization process that adjusts Gaussian and camera parameters efficiently, avoiding the complexities of previous methods. InstantSplat significantly reduces training time from hours to seconds and demonstrates robust performance across various datasets and view counts. The results indicate that this method can effectively handle the challenges posed by sparse-view scenarios, providing a more efficient and reliable solution for 3D scene reconstruction. The framework's performance has been validated against benchmarks, showcasing its advantages over existing techniques.
Related
Mip-Splatting: Alias-Free 3D Gaussian Splatting
The paper introduces Mip-Splatting, enhancing 3D Gaussian Splatting by addressing artifacts with a 3D smoothing filter and a 2D Mip filter, achieving alias-free renderings and improved image fidelity in 3D rendering applications.
MASt3R – Matching and Stereo 3D Reconstruction
MASt3R, a model within the DUSt3R framework, excels in 3D reconstruction and feature mapping for image collections. It enhances depth perception, reduces errors, and revolutionizes spatial awareness across industries.
WildGaussians: 3D Gaussian Splatting in the Wild
A novel method, WildGaussians, enhances 3D scene reconstruction for in-the-wild data by combining DINO features and appearance modeling with 3D Gaussian Splatting. It outperforms NeRFs and 3DGS in handling dynamic scenes.
New Gaussian Splatting viewer that allows code modification during runtime
The GitHub project "splatviz" offers an interactive viewer for 3D Gaussian Splatting scenes. It uses Python GUI library (imgui) for real-time editing and visualization, supporting scene saving and video creation. Users can clone the repository for exploration.
GLOMAP – Global Structure-from-Motion Revisited
The paper introduces GLOMAP, a new system for 3D structure recovery and camera motion estimation, outperforming COLMAP in accuracy and speed, and is available as open-source software.
Related
Mip-Splatting: Alias-Free 3D Gaussian Splatting
The paper introduces Mip-Splatting, enhancing 3D Gaussian Splatting by addressing artifacts with a 3D smoothing filter and a 2D Mip filter, achieving alias-free renderings and improved image fidelity in 3D rendering applications.
MASt3R – Matching and Stereo 3D Reconstruction
MASt3R, a model within the DUSt3R framework, excels in 3D reconstruction and feature mapping for image collections. It enhances depth perception, reduces errors, and revolutionizes spatial awareness across industries.
WildGaussians: 3D Gaussian Splatting in the Wild
A novel method, WildGaussians, enhances 3D scene reconstruction for in-the-wild data by combining DINO features and appearance modeling with 3D Gaussian Splatting. It outperforms NeRFs and 3DGS in handling dynamic scenes.
New Gaussian Splatting viewer that allows code modification during runtime
The GitHub project "splatviz" offers an interactive viewer for 3D Gaussian Splatting scenes. It uses Python GUI library (imgui) for real-time editing and visualization, supporting scene saving and video creation. Users can clone the repository for exploration.
GLOMAP – Global Structure-from-Motion Revisited
The paper introduces GLOMAP, a new system for 3D structure recovery and camera motion estimation, outperforming COLMAP in accuracy and speed, and is available as open-source software.