Adaptive multi-fidelity optimization with fast learning rates

· Source: stat.ML updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Mathematics & Computational Sciences · Depth: Expert, extended

Summary

Researchers from École Normale Supérieure, Inria Lille, and Huawei R&D, UK, introduce Kometo, a new algorithm for adaptive multi-fidelity optimization. This method addresses the challenge of optimizing locally smooth functions with a limited budget, where approximations of varying costs and biases are available. The work first establishes novel lower bounds for simple regret under different assumptions on fidelity, characterized by a cost-to-bias function. Kometo is then presented as an algorithm that achieves these optimal rates, up to logarithmic factors, without requiring prior knowledge of the function's smoothness or fidelity assumptions, improving upon previous methods like MFPDOO. Empirical results demonstrate Kometo's superior performance over existing multi-fidelity optimization techniques in several synthetic and real-world hyper-parameter tuning experiments, notably outperforming MFPDOO on three synthetic tasks and a practical text classification hyper-parameter tuning task.

Key takeaway

For machine learning engineers optimizing complex models with limited budgets, Kometo offers a robust solution for hyper-parameter tuning. Its ability to adapt to unknown function smoothness and bias characteristics, coupled with demonstrated empirical superiority over prior methods, means you can achieve better optimization outcomes without extensive problem-dependent parameter tuning. Implement Kometo to potentially reduce computational costs and improve model performance in scenarios where evaluations are expensive and gradient information is unavailable.

Key insights

Kometo optimizes functions using biased, multi-fidelity approximations without prior knowledge of smoothness or bias function.

Principles

Method

Kometo employs Zipf sampling to open a decreasing number of cells at each depth, gradually reducing fidelity. Cross-validation selects the best cell, adapting to unknown smoothness and bias parameters.

In practice

Topics

Best for: NLP Engineer, AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.