Characterizing and Correcting Effective Target Shift in Online Learning

· Source: stat.ML updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, extended

Summary

This research introduces a framework to analyze and improve online learning in non-stationary environments, particularly focusing on kernel regression. The authors derive a closed-form expression for online kernel regression, demonstrating its equivalence to offline regression with "effective target shifts"—inaccurate target outputs. Conversely, they prove that by compensating for these shifts through "target correction," online kernel-based learning can achieve the same predictive performance as its offline counterpart. They provide both a closed-form and an iterative, causal expression for this correction. Applying this framework to image classification tasks on CIFAR-10 and CORe50, the study shows that online stochastic gradient descent (SGD) with iteratively corrected targets outperforms learning with true targets in continual learning settings, even for non-linear models using an evolving empirical Neural Tangent Kernel (NTK).

Key takeaway

For Machine Learning Engineers developing online learning systems, understanding and correcting for effective target shifts is crucial. Your models can achieve better performance in non-stationary environments, such as continual learning, by training on iteratively corrected targets rather than raw ground-truth labels. This approach, even when heuristically applied to deep neural networks using evolving empirical Neural Tangent Kernels, significantly improves resistance to catastrophic forgetting and enhances overall accuracy.

Key insights

Online learning's sub-optimality stems from effective target shifts, which can be corrected to match offline performance.

Principles

Method

Online kernel regression is reformulated as offline regression with shifted targets. Target correction is derived by inverting this relationship, with an iterative approximation for causal, sequential application, even for non-linear models via empirical NTK.

In practice

Topics

Code references

Best for: Computer Vision Engineer, AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.