Beyond Static Endpoints: Tool Programs as an Interface for Flexible Agentic Web Services
Summary
ToolPro introduces a novel agent-facing service interface for LLM-based agents interacting with web services, moving "Beyond Static Endpoints." It addresses the limitations of traditional static API endpoints by representing multi-step tool intent as executable tool programs. ToolPro combines constraint-guided program construction, effect-aware replay for "exactly-once" state-modifying calls, and a profile-driven policy to adaptively decide when program execution outperforms stepwise calling. Implemented over MCP-style services with WebAssembly sandboxing, ToolPro was evaluated on real-world applications like Memos, Directus, and MinIO. It demonstrated significant efficiency gains, reducing end-to-end latency by up to 53.4% and client-side traffic by up to 96.1%, with larger improvements under higher network latency and workflow complexity.
Key takeaway
For AI Architects designing agentic web services, consider adopting tool programs to enhance workflow efficiency and reliability. Your systems can significantly reduce end-to-end latency and client-side traffic by consolidating multi-step interactions into single, executable units. Implement effect-aware replay to ensure retry-safe WRITE operations, preventing state corruption during repairs. This approach optimizes resource use and improves agent performance.
Key insights
Tool programs offer an executable, effect-typed interface for LLM agents to manage complex web service workflows efficiently.
Principles
- Represent tool intent as executable programs
- Enforce "exactly-once" semantics for WRITE operations
- Adaptively consolidate interactions based on cost models
Method
ToolPro synthesizes, projects, compiles, and executes programs in a WebAssembly sandbox, using compiler/runtime feedback for bounded in-place repair and logging WRITE outcomes for retry safety.
In practice
- Utilize WebAssembly for secure sandboxed execution
- Implement explicit READ/WRITE effect typing
- Profile network latency to optimize execution mode
Topics
- Agentic Web
- Tool Programs
- LLM Agents
- WebAssembly
- Service Interfaces
- Retry Safety
Code references
Best for: AI Scientist, Research Scientist, AI Engineer, MLOps Engineer, AI Architect
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.SE updates on arXiv.org.