RASP-Tuner: Retrieval-Augmented Soft Prompts for Context-Aware Black-Box Optimization in Non-Stationary Environments

· Source: Takara TLDR - Daily AI Papers · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, medium

Summary

RASP-Tuner is a novel black-box optimization system designed for non-stationary environments where optimal configurations shift with external context. It addresses the high adaptation costs of traditional optimizers like Gaussian Processes (GP) by employing a retrieval-augmented soft prompt approach. The system decomposes online tuning into identifying regime proxies, predicting short-horizon loss with a mixture-of-experts surrogate, and adapting primarily in a low-dimensional prompt subspace. RASP-Tuner utilizes a RealErrorComposer to map heterogeneous streaming metrics to a single differentiable training target. Evaluated on nine synthetic non-stationary benchmarks, an adversarial-context check, and three real-world tabular streams, RASP-Tuner matched or improved cumulative regret compared to GP-UCB and CMA-ES on seven of nine synthetic tasks at horizon T=100. It also demonstrated 8-12 times lower wall-clock time per step than sliding-window GP-UCB on identical hardware.

Key takeaway

For research scientists developing online optimization systems in dynamic, non-stationary environments, RASP-Tuner offers a method to significantly reduce computational overhead while maintaining or improving performance. You should investigate integrating retrieval-augmented soft prompts and mixture-of-experts surrogates to manage context-dependent shifts, especially when full model refits are computationally prohibitive. This approach can lead to more efficient and scalable black-box optimization.

Key insights

RASP-Tuner optimizes black-box systems in non-stationary environments using retrieval-augmented soft prompts for efficient context adaptation.

Principles

Method

RASP-Tuner identifies regime proxies via past context retrieval, predicts loss with a mixture-of-experts surrogate using parameters, context, and a soft prompt, and adapts in a low-dimensional prompt subspace, with full surrogate updates only on error spikes.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.