FedSIR: Spectral Client Identification and Relabeling for Federated Learning with Noisy Labels
Summary
FedSIR is a multi-stage framework designed to enhance robust federated learning (FL) performance in the presence of noisy labels across distributed clients. Unlike methods that focus on noise-tolerant loss functions or loss dynamics, FedSIR utilizes the spectral structure of client feature representations to identify and mitigate label noise. The framework operates through three main components: identifying clean and noisy clients via spectral consistency analysis of class-wise feature subspaces with minimal communication, enabling noisy clients to relabel corrupted samples using spectral references from clean clients, and employing a noise-aware training strategy. This strategy integrates logit-adjusted loss, knowledge distillation, and distance-aware aggregation to stabilize federated optimization. Experiments on standard FL benchmarks show FedSIR consistently outperforms existing methods for FL with noisy labels.
Key takeaway
For research scientists developing robust federated learning systems, FedSIR offers a novel approach to address label noise by leveraging spectral analysis. You should consider integrating spectral consistency checks and clean client spectral references into your noise mitigation strategies. This method provides a strong alternative to purely loss-function-based or loss-dynamics-based techniques, potentially leading to more stable and accurate model training.
Key insights
FedSIR uses spectral analysis of client features to identify and relabel noisy data in federated learning.
Principles
- Spectral consistency identifies client noise.
- Clean clients provide relabeling references.
- Noise-aware training stabilizes optimization.
Method
FedSIR identifies noisy clients via spectral consistency, relabels samples using clean client spectral references, then trains with logit-adjusted loss, knowledge distillation, and distance-aware aggregation.
In practice
- Analyze feature subspaces for noise.
- Use clean client data as spectral references.
- Integrate logit-adjusted loss for robustness.
Topics
- Federated Learning
- Noisy Labels
- Spectral Analysis
- Client Identification
- Label Relabeling
Code references
Best for: Research Scientist, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.