MEDLEY-BENCH: Scale Buys Evaluation but Not Control in AI Metacognition

2026-04-17 · Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

MEDLEY-BENCH is a new benchmark designed to evaluate metacognition in 35 AI models across 12 families, focusing on how models monitor and regulate their own reasoning. It assesses independent reasoning, private self-revision, and socially influenced revision using 130 ambiguous instances across five domains. The benchmark introduces two scores: the Medley Metacognition Score (MMS), which aggregates reflective updating, social robustness, and epistemic articulation, and the Medley Ability Score (MAS), derived from four metacognitive sub-abilities. Initial results indicate that while evaluation ability improves with model size within families, control ability does not. A progressive adversarial analysis of 11 models identified two behavioral profiles: some models revise based on argument quality, while others track consensus statistics. The study also found that evaluation was the weakest relative ability across all 35 models, suggesting a systematic knowing/doing gap, and that smaller models often matched or exceeded larger ones in metacognitive competence.

Key takeaway

For AI Scientists developing advanced models, these findings highlight a critical "knowing/doing gap" in metacognition. You should prioritize training methodologies that explicitly reward calibrated and proportional belief updating, rather than focusing solely on output quality. This approach could lead to more robust and reliable AI systems, especially in scenarios involving social influence or ambiguous information.

Key insights

AI models exhibit a dissociation between metacognitive evaluation and control abilities, with evaluation improving with scale but control not.

Principles

Metacognitive competence is not solely a function of model scale.
Evaluation is the weakest relative metacognitive ability in current models.

Method

MEDLEY-BENCH evaluates metacognition by separating independent reasoning, private self-revision, and socially influenced revision using ambiguous instances and two complementary scoring systems.

In practice

Reward calibrated, proportional updating in AI training.
Use MEDLEY-BENCH to measure belief revision under social pressure.

Topics

AI Metacognition
MEDLEY-BENCH
Model Evaluation
Belief Revision
Evaluation-Control Dissociation

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.