Capacity-Controlled Multi-View Stylization of 3D Gaussian Splatting

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision · Depth: Expert, quick

Summary

A new capacity-controlled framework addresses the challenge of enforcing stylistic coherence across viewpoints in 3D Gaussian Splatting (3DGS). Existing 3D stylization methods often apply 2D feature-matching losses independently per rendered view, resulting in unstable style allocation, many-to-one feature reuse, and limited cross-view consistency. This novel approach reformulates local style matching as a semi-balanced optimal transport problem, introducing explicit column-capacity constraints with tunable strength. This mechanism mitigates many-to-one matching and enables controllable allocation of style features, balancing feature coverage and stylistic diversity while maintaining stable correspondences across viewpoints. The framework further enhances cross-view coherence through a novel cross-view matching guidance and incorporates geometric regularizations to optimize Gaussian primitives for finer-grained textures. Experiments demonstrate significant improvements in multi-view stylistic consistency, yielding stable, expressive 3D stylizations that preserve scene semantic structure.

Key takeaway

For computer vision engineers developing 3D stylization applications, this optimal transport-based framework offers a robust solution to achieve consistent multi-view stylization with 3D Gaussian Splatting. You should consider integrating capacity-controlled optimal transport and cross-view matching guidance to overcome current limitations of unstable style allocation. This approach ensures stable, expressive stylizations that preserve scene semantics, significantly improving visual coherence across different viewpoints in your projects.

Key insights

Optimal transport with capacity control enables consistent multi-view stylization for 3D Gaussian Splatting by mitigating feature reuse.

Principles

Method

Reformulate local style matching as a semi-balanced optimal transport problem with explicit column-capacity constraints, then integrate cross-view matching guidance and geometric regularizations for 3DGS.

In practice

Topics

Best for: Research Scientist, AI Scientist, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.