Constrained Auto-Bidding via Generative Response Modeling

2026-05-27 · Source: Takara TLDR - Daily AI Papers · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Marketing, Branding & Advertising · Depth: Expert, medium

Summary

The Generative Response Model (GRM) is a new approach for auto-bidding systems designed to maximize advertiser value under budget and ratio constraints, such as cost-per-acquisition. Traditional methods often struggle with non-stationary auction dynamics, leading to constraint violations or performance degradation under distribution shifts. GRM shifts the learning target from direct bidding actions to predicting future responses. It operates as a history-conditioned sequence model that jointly forecasts future traffic volume and horizon-aggregate cost/value curves based on a single bid multiplier. The model's optimality gap is bounded by the dispersion of per-tick marginal value-per-cost under mild monotonicity. A lightweight analytic controller then uses these predictions, employing a 1D root-finding step to enforce active constraints. This controller is proven exact for single-multiplier problems, with constraint violations bounded by prediction error during receding-horizon replanning. Experiments on the AuctionNet dataset demonstrate that GRM enhances constraint stability and overall performance compared to existing baselines.

Key takeaway

For AI Engineers developing auto-bidding systems, you should consider adopting a generative response modeling approach like GRM. This method directly predicts future auction dynamics and value curves, allowing for more stable constraint adherence and improved overall performance compared to reactive or reward-signal-based systems. Implement a lightweight analytic controller with 1D root-finding to precisely manage budget and ratio targets, mitigating constraint violations in non-stationary environments.

Key insights

GRM improves auto-bidding by predicting future responses, enabling precise constraint enforcement and better performance in dynamic auctions.

Principles

Shift learning from actions to responses for better control.
Optimality gap bounded by marginal value-per-cost dispersion.
Analytic controllers can enforce constraints exactly.

Method

GRM uses a history-conditioned sequence model to jointly predict future traffic and cost/value curves from a bid multiplier. An analytic controller then applies 1D root-finding for constraint enforcement.

In practice

Apply GRM for stable auto-bidding.
Use 1D root-finding for constraint control.
Test on AuctionNet-like environments.

Topics

Auto-Bidding Systems
Generative Response Models
Constraint Optimization
AuctionNet Dataset
Bid Multiplier Control
Sequence Modeling

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.