Amortizing Maximum Inner Product Search with Learned Support Functions

· Source: stat.ML updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, extended

Summary

Amortized Maximum Inner Product Search (MIPS) is a novel learning-based approach that trains neural networks to predict MIPS solutions, amortizing computational costs for queries from a fixed distribution. This method leverages the MIPS value function's property as a support function, which is convex and 1-homogeneous, with its gradient equaling the optimal key. Two architectures are proposed: SupportNet, an Input Convex Neural Network (ICNN) that models the support function directly, recovering keys via gradient computation; and KeyNet, which directly regresses the optimal key, bypassing inference-time gradient computation. Experiments on BEIR benchmark datasets (FIQA, Quora, Natural Questions, HotpotQA) with $d=384$ dimensional embeddings demonstrate high match rates and significant speedups. The approach also integrates effectively into approximate search indices like FAISS IVF, improving recall with fewer floating-point operations.

Key takeaway

For AI Scientists and Machine Learning Engineers optimizing large-scale MIPS, consider implementing amortized MIPS with SupportNet or KeyNet. This approach significantly reduces inference costs for predictable query distributions, offering speedups and higher match rates on datasets like FIQA and HotpotQA. You can achieve better routing accuracy in clustered databases or enhance approximate search indices by mapping queries to predicted keys, improving recall with fewer operations.

Key insights

Amortized MIPS uses neural networks to predict optimal keys, exploiting the support function's convex and homogeneous properties.

Principles

Method

Train SupportNet (ICNN for support function, gradient for key) or KeyNet (direct key regression) using score regression, gradient matching, or score consistency losses, with optional homogenization wrappers.

In practice

Topics

Best for: Research Scientist, AI Engineer, NLP Engineer, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.