Analysis of Nystrom method with sequential ridge leverage scores

· Source: cs.LG updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, extended

Summary

This paper introduces INK-Estimate, an algorithm designed for large-scale kernel ridge regression (KRR) in a sequential setting, addressing the computational and memory limitations of traditional Nyström methods. KRR typically requires storing a large kernel matrix, leading to $\mathcal{O}(n^2)$ space and $\mathcal{O}(n^3)$ time complexity for $n$ samples. INK-Estimate maintains a small sketch of the kernel matrix, incrementally updating estimates of ridge leverage scores (RLSs) and the effective dimension in a single pass over the data. This approach requires a fixed, small space budget, $\overline{q}$, proportional to the effective dimension, and provides strong approximation guarantees for the kernel matrix and the statistical risk of the KRR solution at any intermediate step. Unlike prior methods, INK-Estimate's accuracy does not depend on the smallest eigenvalue of the kernel matrix and offers continuous generalization guarantees.

Key takeaway

Research Scientists working with large, evolving datasets for Kernel Ridge Regression should consider INK-Estimate. This algorithm offers a robust, single-pass solution that maintains strong approximation and risk guarantees at every step, unlike traditional batch or online kernel sparsification methods. You can achieve significant computational savings and fixed memory usage, making it suitable for scenarios where continuous, accurate model updates are essential without re-processing historical data.

Key insights

INK-Estimate provides a single-pass, space-efficient sequential Nyström approximation with strong, continuous KRR solution guarantees.

Principles

Method

INK-Estimate incrementally updates RLS and effective dimension estimates using a Shrink-Expand procedure, maintaining a Nyström approximation with a fixed space budget, and discarding superfluous samples.

In practice

Topics

Best for: Research Scientist, AI Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.LG updates on arXiv.org.