Distribution Alignment for One-Shot Federated Learning via Optimal Transport

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

SLOT-Align (Single-round, Learning-free Optimal Transport Alignment) is a novel geometry-aware feature harmonization framework for One-Shot Federated Learning (OSFL). OSFL addresses extreme communication regimes where clients interact with a server only once, amplifying issues from heterogeneous client data distributions, specifically joint domain and label shift. These shifts cause misaligned feature representations that iterative optimization cannot correct. Existing OSFL methods, relying on distillation, server-side generation, or ensemble-based aggregation, typically assume aligned representations or handle shifts separately. SLOT-Align uses a shared frozen encoder to extract compact feature statistics, constructs a global reference via Bures-Wasserstein barycenters, and aligns local representations using closed-form geodesic optimal transport maps. This computationally efficient method integrates with existing OSFL pipelines using frozen encoders without modifying their training. Experiments show consistent accuracy and robustness improvements under joint domain and label shift across multiple benchmarks, backbones, and OSFL methods.

Key takeaway

For Machine Learning Engineers deploying One-Shot Federated Learning in environments with significant client data heterogeneity, particularly joint domain and label shift, you should consider integrating SLOT-Align. This framework directly addresses misaligned feature representations without iterative optimization, offering a computationally efficient way to improve model accuracy and robustness. Its compatibility with existing frozen encoder OSFL pipelines means you can enhance performance without modifying core training procedures.

Key insights

SLOT-Align harmonizes heterogeneous client feature distributions in One-Shot Federated Learning using optimal transport for improved accuracy.

Principles

Method

SLOT-Align extracts feature statistics via a shared frozen encoder, builds a global reference using Bures-Wasserstein barycenters, then aligns local representations with closed-form geodesic optimal transport maps.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.