Neither Layer Alone: Epistemic Integrity Requires Hierarchical Joint Design for Long-Running AI Agents

· Source: cs.SE updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, extended

Summary

Long-running AI agents fail not only when inference fails or tools are underspecified, but when independently evolving model and harness layers change the semantics of belief, capability, and goal commitments across their boundary — a failure class this paper terms "interface volatility". This paper argues that "agent epistemic integrity" (AEI) must be treated as a first-class architectural constraint, achievable only through joint model–harness design organized around an explicit interface contract. The central claim is that this contract is the precondition for joint design, operationalized by a four-level hierarchy: goal validity, action-archetype sequencing, tool-instance selection, and invocation-level failure discrimination. Evaluation and training should derive from this contract, testing whether belief, tool, and goal commitments hold across session boundaries and independent layer upgrades, rather than just local task completion.

Key takeaway

For AI Architects and Machine Learning Engineers designing long-running AI agents, you must prioritize explicit model-harness interface contracts. This prevents "interface volatility" and ensures epistemic integrity across model upgrades and session boundaries. Implement a four-level operational hierarchy for goal validity, action sequencing, tool selection, and failure discrimination. Your evaluation and training should derive from these contract obligations, not just task completion, to ensure robust, resumable agent behavior.

Key insights

Long-running AI agents need "epistemic integrity" via a hierarchical model-harness interface contract to prevent semantic drift.

Principles

Method

A four-level hierarchy (goal validity, action-archetype sequencing, tool-instance selection, invocation-level failure discrimination) operationalizes the model-harness interface contract, replacing flat action loops with contract-preserving control over persistent state.

In practice

Topics

Best for: Research Scientist, AI Engineer, AI Scientist, AI Architect, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.SE updates on arXiv.org.