PreAct: Computer-Using Agents that Get Faster on Repeated Tasks

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Software Development & Engineering · Depth: Expert, quick

Summary

PreAct is a novel system designed to accelerate computer-using agents on repeated tasks by compiling successful runs into small state-machine programs. Unlike agents that re-reason every step, PreAct replays these compiled programs directly, achieving 8.5-13x faster execution without incurring per-step language-model calls. The system incorporates a crucial screen-state verification during replay, ensuring accuracy and reverting control to the agent if discrepancies arise. Furthermore, a rigorous store-time check validates that newly compiled programs genuinely solve tasks, preventing the accumulation of faulty programs and improving performance by 1.75-2.6 tasks per benchmark across mobile, desktop, and web platforms. PreAct also includes a fallback mechanism for novel situations, maintaining performance against strong record-and-replay baselines, and found that prompt wording, runtime guardrails, or the program selection method did not matter.

Key takeaway

For AI Engineers developing computer-using agents for repetitive workflows, PreAct demonstrates a critical path to significant efficiency gains. You should consider implementing a program compilation and replay mechanism to accelerate task execution by 8.5-13x, drastically reducing language model inference costs. Crucially, integrate robust screen-state validation and independent program evaluation to ensure reliability and prevent the accumulation of faulty automation, thereby maintaining consistent performance across diverse platforms.

Key insights

PreAct enables computer-using agents to learn and accelerate repeated tasks via compiled state-machine programs with robust validation.

Principles

Method

PreAct compiles a successful agent run into a state-machine program. This program is stored only after an independent evaluator confirms task completion. During replay, it checks screen state before acting, reverting to the agent if mismatches occur.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.