AutoRPA: Efficient GUI Automation through LLM-Driven Code Synthesis from Interactions

· Source: Takara TLDR - Daily AI Papers · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Software Development & Engineering · Depth: Expert, quick

Summary

AutoRPA is a novel framework designed to enhance graphical user interface (GUI) automation by distilling the decision logic of Large Language Model (LLM) based agents into efficient Robotic Process Automation (RPA) functions. Addressing the inefficiency of repeated LLM reasoning in repetitive tasks, AutoRPA combines the flexibility of LLMs with the runtime efficiency of traditional RPA. Its core innovations include a translator-builder pipeline, where a translator agent converts hard-coded ReAct actions into soft-coded procedures, and a builder agent synthesizes robust RPA functions using retrieval-augmented generation over multiple interaction trajectories. Additionally, AutoRPA employs a hybrid repair strategy during code verification, integrating RPA execution with ReAct-based fallback for iterative refinement. Experiments across various GUI environments demonstrate that AutoRPA-generated RPA functions successfully solve similar tasks, reducing token usage by 82% to 96% and significantly improving runtime efficiency and reusability.

Key takeaway

For Machine Learning Engineers developing GUI automation solutions, AutoRPA offers a compelling approach to overcome the inefficiencies of repeated LLM invocations for repetitive tasks. You should consider integrating this framework to distill LLM agent logic into highly efficient RPA functions, significantly reducing token usage by 82% to 96%. This enables more cost-effective and reusable automation scripts, improving overall system performance and scalability for your deployments.

Key insights

AutoRPA efficiently automates repetitive GUI tasks by converting LLM agent logic into robust, token-saving RPA functions.

Principles

Method

AutoRPA uses a translator agent to convert ReAct actions, a builder agent to synthesize RPA functions via RAG, and a hybrid repair strategy for iterative refinement.

In practice

Topics

Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, Automation Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.