VisDom: Sparse Novel View Synthesis with Visible Domain Constraint

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, quick

Summary

VisDom is a novel, learning-free geometric constraint designed to enhance sparse novel view synthesis (NVS) by addressing overfitting issues common in NeRF- and Gaussian Splatting (GS)-based methods when few input views are available. This technique augments classical carving-based visual hull reconstruction by enforcing a minimum multi-view visibility requirement. Specifically, VisDom defines a "visible domain" as the subset of 3D space observed by at least K views, applying this as an additional filtering criterion alongside standard silhouette-based reconstruction. It integrates seamlessly into both implicit (NeRF) and explicit (GS) pipelines, restricting volumetric sampling and guiding Gaussian placement during optimization. Experimental results across three challenging datasets demonstrate consistent improvements in sparse-view NVS, facilitating high-quality object-centric reconstruction from as few as four input images. VisDom is domain-agnostic, requires only silhouettes, and introduces no learned parameters, making it a simple, effective complement to existing approaches, even improving GaussianObject performance on Omni3D and MipNeRF360 at 22x lower training cost.

Key takeaway

For computer vision engineers developing sparse novel view synthesis systems, consider integrating VisDom to significantly improve reconstruction quality from limited input. This learning-free geometric constraint effectively mitigates overfitting and floating artifacts, enabling high-fidelity object-centric models with as few as four images. You can seamlessly apply it to both NeRF and Gaussian Splatting pipelines, potentially reducing training costs by up to 22 times while matching or exceeding current performance benchmarks.

Key insights

VisDom uses a multi-view visibility constraint to improve sparse novel view synthesis by reducing overfitting and artifacts.

Principles

Method

VisDom defines a visible domain as 3D space seen by ≥K views, applying it as a filter with silhouette-based reconstruction to restrict sampling and guide Gaussian placement.

In practice

Topics

Best for: Research Scientist, AI Scientist, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.