Batch Normalization for Neural Networks on Complex Domains
Summary
This paper introduces a novel Batch Normalization (BN) layer designed for neural networks operating on complex domains, specifically the Siegel disk and the complex unit ball. The proposed BN layer extends existing Riemannian BN concepts by leveraging automorphisms and the Kobayashi pseudodistance to define essential operations like batch centering, biasing, and updating the running mean, particularly when closed-form geometric quantities (e.g., exponential/logarithmic maps) are unavailable. The method's efficacy is demonstrated through experiments on radar clutter classification, node classification, and action recognition tasks. Results show that the new BN layer, named SiegelNetBN and CBallNetBN for respective domains, consistently improves mean accuracy and training stability compared to baselines, including existing FC layers and other Riemannian BN layers, with improvements ranging from 1.15% to over 9% in various scenarios.
Key takeaway
For AI Scientists and Machine Learning Engineers developing neural networks on non-Euclidean data, particularly complex domains like Siegel spaces or the complex unit ball, integrating this novel Batch Normalization layer is crucial. Your models will likely achieve higher accuracy and improved training stability, especially in applications like radar clutter or node classification, by leveraging the proposed automorphism-based centering and Kobayashi pseudodistance-derived almost geodesics. Consider replacing traditional BN layers with this approach to enhance performance on complex-valued inputs.
Key insights
A novel Batch Normalization layer for complex domains enhances neural network stability and accuracy using automorphisms and Kobayashi pseudodistance.
Principles
- Automorphisms enable BN operations in complex domains.
- Kobayashi pseudodistance constructs effective "almost geodesics."
- BN layers improve accuracy and stability in complex domain NNs.
Method
The method computes a Fréchet mean, then uses domain-specific automorphisms for batch centering and biasing. Running mean updates rely on "almost geodesics" derived from the Kobayashi pseudodistance, particularly for Siegel disk and complex unit ball domains.
In practice
- Apply this BN layer to Siegel neural networks for radar clutter classification.
- Integrate this BN layer into hyperbolic neural networks for node classification.
- Use this BN layer to improve action recognition on complex data embeddings.
Topics
- Batch Normalization
- Complex Domains
- Siegel Disk
- Complex Unit Ball
- Kobayashi Pseudodistance
Code references
Best for: AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.