我的「破產版」賈維斯:調教 AI Agent 的血淚史

· Source: AI on Medium · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Robotics & Autonomous Systems · Depth: Intermediate, short

Summary

A developer recounts the challenging process of training an AI agent, dubbed "Budget Jarvis," to automate web-based data retrieval using Claude, Playwright, and Brave API. Initial attempts led to excessive token consumption and system crashes due to bloated web structures and the AI's tendency to "memorize" irrelevant page elements. The agent also exhibited "confirmation OCD," performing redundant steps for simple form filling, and a "lack of judgment," opting for browser-based navigation over more efficient API calls for straightforward queries. Through iterative prompting and system adjustments, including a sliding window approach for context management and explicit instructions on planning and cost awareness, the developer significantly improved the agent's efficiency and accuracy.

Key takeaway

For AI Architects designing autonomous agents, recognize that simply providing powerful tools like large language models and browser automation is insufficient. You should prioritize teaching your agents judgment and cost awareness through careful prompt engineering, rather than solely focusing on increasing context windows. Explicitly instruct agents to plan multi-step actions and to weigh the efficiency of different tools (e.g., direct API calls vs. browser navigation) to prevent excessive resource consumption and improve operational speed.

Key insights

Effective AI agent training requires teaching judgment and cost awareness, not just providing powerful tools.

Principles

Method

Implement a sliding window for context, compressing web page elements by ignoring non-essential sections. Instruct the AI to plan all steps before acting and to weigh tool usage based on "cost awareness" (e.g., API vs. browser).

In practice

Topics

Best for: AI Architect, AI Engineer, Machine Learning Engineer, Prompt Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by AI on Medium.