CAX-Agent: A Lightweight Agent Harness for Reliable APDL Automation

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Engineering & Applied Sciences · Depth: Expert, extended

Summary

CAX-Agent is a lightweight agent harness designed for reliable automation of MAPDL finite-element simulations, addressing challenges like inconsistent outputs and task failures in LLM-driven engineering. Its architecture features three layers: an LLM service, the agent harness, and a solver backend, incorporating a recovery ladder that escalates from deterministic rule-based patching to model-driven regeneration and human intervention. An empirical evaluation compared three recovery strategies (no_recovery, rule_only, model_only) on 50 standard structural benchmarks, each run three times, totaling 450 case-runs. Model_only achieved the highest completion rate (0.9267), task score (3.59/4), total score (9.16/10), and zero-intervention rate (0.84), significantly outperforming rule_only and no_recovery. The study used simple geometries to isolate recovery policy effects, with human raters scoring task completion under blind conditions, showing strong inter-rater agreement (Cohen's kappa = 0.84).

Key takeaway

For Machine Learning Engineers developing LLM-driven simulation tools, integrating a dedicated agent harness with model-driven recovery is crucial. Your systems will achieve significantly higher completion rates and autonomy, reducing manual intervention compared to rule-based or no-recovery approaches. Consider adopting a layered recovery strategy, starting with deterministic rules and escalating to LLM-driven regeneration, to enhance reliability in complex engineering workflows.

Key insights

An agent harness with model-driven recovery significantly improves reliability and autonomy in LLM-driven engineering simulations.

Principles

Method

CAX-Agent employs a three-layer architecture (LLM service, agent harness, solver backend) with a recovery ladder escalating from deterministic rule patching to LLM-driven regeneration, context enrichment, and human intervention.

In practice

Topics

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.