Symmetrization of Loss Functions for Robust Training of Neural Networks in the Presence of Noisy Labels
Summary
A new study introduces a symmetrization method for robust training of neural networks, addressing the challenge of noisy labels in expensive and error-prone training sets. This method arises from the unique decomposition of any multi-class loss function into a symmetric component and a class-insensitive term. Specifically, symmetrizing the cross-entropy loss results in a linear multi-class extension of the unhinged loss. Under suitable assumptions, this multi-class unhinged loss is demonstrated to be the unique convex multi-class symmetric loss. Furthermore, it plays a fundamental local role, as the linear approximation of any symmetric loss around score vectors with equal components is equivalent to it. The researchers also introduce SGCE and alpha-MAE, two novel loss functions that interpolate between the multi-class unhinged loss and the Mean Absolute Error, offering control over the beta-smoothness of the loss. Experiments on standard noisy-label benchmarks confirm their competitive performance against existing robust loss functions.
Key takeaway
For Machine Learning Engineers training neural networks with datasets prone to noisy labels, you should evaluate the newly introduced SGCE and alpha-MAE loss functions. These functions, derived from a principled symmetrization of cross-entropy, offer a robust alternative to existing methods. Their ability to interpolate between the multi-class unhinged loss and Mean Absolute Error, while controlling beta-smoothness, provides flexibility. Incorporating them could significantly improve model robustness and performance in real-world, imperfect data scenarios.
Key insights
Symmetrizing multi-class loss functions, especially cross-entropy, yields a unique convex unhinged loss robust to noisy labels.
Principles
- Symmetry offers theoretical robustness guarantees.
- Multi-class losses uniquely decompose into symmetric and class-insensitive parts.
- The multi-class unhinged loss is uniquely convex and symmetric.
Method
Decompose multi-class loss functions into symmetric and class-insensitive terms. Symmetrize cross-entropy to derive a multi-class unhinged loss. Introduce SGCE and alpha-MAE to interpolate between this unhinged loss and MAE, controlling beta-smoothness.
In practice
- Apply SGCE or alpha-MAE for robust training with noisy labels.
- Consider multi-class unhinged loss for convex symmetric loss needs.
Topics
- Loss Functions
- Noisy Labels
- Neural Network Training
- Symmetrization
- Cross-Entropy
- SGCE
- alpha-MAE
Best for: Research Scientist, AI Engineer, NLP Engineer, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.