Batch Normalization for Neural Networks on Complex Domains

· Source: stat.ML updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Mathematics & Computational Sciences · Depth: Expert, extended

Summary

This paper introduces a novel Batch Normalization (BN) layer designed for neural networks operating on complex domains, specifically the Siegel disk and the complex unit ball. The proposed BN layer extends existing Riemannian BN concepts by leveraging automorphisms and the Kobayashi pseudodistance to define essential operations like batch centering, biasing, and updating the running mean, particularly when closed-form geometric quantities (e.g., exponential/logarithmic maps) are unavailable. The method's efficacy is demonstrated through experiments on radar clutter classification, node classification, and action recognition tasks. Results show that the new BN layer, named SiegelNetBN and CBallNetBN for respective domains, consistently improves mean accuracy and training stability compared to baselines, including existing FC layers and other Riemannian BN layers, with improvements ranging from 1.15% to over 9% in various scenarios.

Key takeaway

For AI Scientists and Machine Learning Engineers developing neural networks on non-Euclidean data, particularly complex domains like Siegel spaces or the complex unit ball, integrating this novel Batch Normalization layer is crucial. Your models will likely achieve higher accuracy and improved training stability, especially in applications like radar clutter or node classification, by leveraging the proposed automorphism-based centering and Kobayashi pseudodistance-derived almost geodesics. Consider replacing traditional BN layers with this approach to enhance performance on complex-valued inputs.

Key insights

A novel Batch Normalization layer for complex domains enhances neural network stability and accuracy using automorphisms and Kobayashi pseudodistance.

Principles

Method

The method computes a Fréchet mean, then uses domain-specific automorphisms for batch centering and biasing. Running mean updates rely on "almost geodesics" derived from the Kobayashi pseudodistance, particularly for Siegel disk and complex unit ball domains.

In practice

Topics

Code references

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.