Constructing VAE Latent Spaces with Prescribed Topology

· Source: cs.CV updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Mathematics & Computational Sciences · Depth: Expert, extended

Summary

Variational autoencoders (VAEs) typically use a Gaussian prior, creating a topological mismatch when data lies on non-Euclidean manifolds. This paper introduces a constructive mathematical framework to resolve this for manifolds admitting a product covering space, including cylinders, tori, Möbius strips, Klein bottles, and real projective spaces. The framework enables factorized distributions over elementary factors (circles, intervals, lines) with closed-form, decoupled KL divergences, ensuring tractable training. It catalogues reparametrizable encoder-prior pairs for periodic, bounded, and unbounded supports, and provides coordinate transformations for neural networks to output non-Euclidean parameters. For quotient manifolds, a group-invariant feature map ensures identified points produce identical decoder outputs. Anchor constraints fix the coordinate system or create soft topological holes. Experiments on synthetic manifolds and rotated/shifted MNIST datasets confirm that topology-matched priors align KL regularization with the data manifold, outperforming Gaussian baselines at relevant regularization strengths. Code is available at https://github.com/JvHulst/VAE-Topology.

Key takeaway

For AI Scientists and Machine Learning Engineers developing VAEs for data with known non-Euclidean manifold structures, you should adopt this framework to ensure topological fidelity. By matching your VAE's latent space topology to the data's intrinsic structure, you can achieve superior reconstruction quality, improved prior consistency, and more interpretable latent representations, especially at higher regularization strengths. This approach prevents latent space collapse and aligns learned coordinates with true generative factors, offering a robust alternative to standard Gaussian VAEs.

Key insights

A constructive VAE framework resolves topological mismatches for non-Euclidean data manifolds by using factorized priors and invariant features.

Principles

Method

The framework involves decomposing manifolds into elementary factors, selecting reparametrizable encoder-prior pairs, constructing G-invariant feature maps for quotients, and adding anchor constraints.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.