Harness, Scaffold, and the AI Agent Terms Worth Getting Right
Summary
A glossary published on May 25, 2026, by Sergio Paniego and Aritra Roy Gosthipaty clarifies key terminology in the rapidly evolving field of AI Agents, addressing confusion observed at ICLR 2026. It distinguishes between "harness" and "scaffolding," which are often conflated. The "model" is the LLM itself (e.g., Claude, Qwen, GPT), while "scaffolding" is the behavior-defining layer, including system prompts and tool descriptions. The "harness" is the execution layer that calls the model, handles tool calls, and manages stopping conditions. An "agent" is defined as the model plus everything enabling it to act in a loop, commonly simplified as "Agent = Model + Harness." The article also defines "context engineering," "policy," "tool use," "skills," and "sub-agents," along with training-specific terms like "RL Environment," "Trainer," "Rollout," and "Reward." It highlights that products like Claude Code and Codex are specific harnesses built on particular models.
Key takeaway
For AI Engineers building or deploying LLM agents, understanding the precise definitions of "harness" and "scaffolding" is crucial. This distinction clarifies how the model's behavior is defined versus how its actions are executed, especially when designing training pipelines or evaluating agent performance. You should differentiate between the model, its behavioral scaffolding, and the execution harness to optimize agent design and debug complex interactions effectively.
Key insights
AI agent terminology distinguishes the LLM "model" from its "scaffolding" (behavior) and "harness" (execution).
Principles
- Agent = Model + Harness.
- Scaffolding defines model behavior.
- Harness manages agent execution.
Method
Harness engineering designs the execution layer, managing agent stopping, error handling, and guardrails at both training and inference.
In practice
- Use "eval harness" for model checkpoint metrics.
- Distinguish tools (actions) from skills (multi-step goals).
- Employ sub-agents for independent subtasks.
Topics
- AI Agents
- LLM Scaffolding
- Agent Harness
- Context Engineering
- Reinforcement Learning
- Tool Use
Code references
Best for: AI Engineer, Machine Learning Engineer, AI Student
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Hugging Face - Blog.