Structural Grid Descriptors Predict Within-Task Solver Success on ARC-AGI

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

Structural grid descriptors effectively predict the success of symbolic ARC-AGI solvers, as demonstrated across 44,800 runs involving beam search and Stochastic DFS solvers on 400 ARC tasks. Hand-crafted descriptors, measured at 50% trajectory completion, discriminate successful from failed runs within the same task with a mean within-task best-feature AUC of 0.885. This predictive capability generalizes across solver architectures, achieving AUCs of 0.747-0.762 in transfer scenarios. On a held-out set of 41 tasks, the frozen feature n_components_final yielded an AUC of 0.765. This signal is independent of solver capacity and weakly coupled to score trajectories. Applying this, early stopping at 50% completion reduced beam-search compute by 33.6% while retaining 98.9% of solves, and degenerate-trajectory detection cut SDFS compute by 65.3% with no solve loss. Additionally, 229 of 400 evaluation tasks failed universally due to DSL primitive library limitations, indicating a coverage issue.

Key takeaway

For AI scientists developing or optimizing symbolic ARC-AGI solvers, you should integrate structural grid descriptor analysis into your development workflow. This allows you to predict solver success early, enabling efficient early stopping to reduce compute by up to 65.3% without solve loss. Furthermore, analyzing these descriptors can help identify fundamental DSL coverage limitations, guiding improvements to your primitive library rather than solely focusing on search budget.

Key insights

Structural properties of intermediate grid states reliably predict symbolic ARC-AGI solver success, enabling efficiency gains.

Principles

Method

Hand-crafted grid descriptors, measured at 50% trajectory completion, were used to discriminate successful from failed solver runs.

In practice

Topics

Best for: Research Scientist, AI Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.