It's time to be right.

2026-04-30 · Source: Marc Brooker's Blog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Intermediate, medium

Summary

The author, speaking at AI Dev 26, posits that the future opportunity for agentic AI in development and knowledge work will be constrained more by defect rate than by inherent capabilities. This hypothesis is illustrated through a four-block matrix categorizing defects by frequency (high/low) and seriousness (high/low), identifying the "low defect frequency, low defect seriousness" quadrant as key for widespread adoption. The article argues for shifting focus from the "right tail" of positive outcomes to the "left tail" of defects, which currently receives insufficient attention. AWS is addressing this through initiatives like correct-by-construction tools such as Hydro and Cedar, spec-driven development with Kiro, code reasoning via Strata and Lean, autoformalization in Bedrock AR Checks, and deterministic agent steering with Strands Steering. The author also advocates for industry-wide changes, including benchmarks that capture failure severity, an end-to-end view of agent success beyond just code patching, and a research program into agentic AI failure modes.

Key takeaway

For AI Engineers deploying agentic systems, recognize that defect rates, not just capabilities, will dictate real-world adoption and business success. Prioritize investing in tools and processes that minimize defect frequency and seriousness, such as correct-by-construction languages, spec-driven development, and formal code reasoning. Your focus should shift from solely maximizing agent performance to rigorously understanding and mitigating failure modes to ensure broad utility and customer trust.

Key insights

Agentic AI adoption hinges on minimizing defect rates and seriousness, not just maximizing capabilities.

Principles

Agentic AI adoption is limited by defect rate, not capabilities.
Prioritize minimizing defect frequency and seriousness for broad utility.
Agents can effectively mitigate underlying model limitations.

Method

Improve agentic AI correctness through correct-by-construction tools (Hydro, Cedar), spec-driven development (Kiro), formal code reasoning (Strata, Lean), autoformalization (Bedrock AR Checks), and deterministic steering (Strands Steering).

In practice

Implement correct-by-construction coding tools.
Utilize spec-driven development for system evolution.
Apply formal code reasoning for property verification.

Topics

Agentic AI
Software Defects
AI Correctness
Formal Methods
Spec-driven Development
AI Benchmarking

Code references

strata-org/Strata

Best for: CTO, VP of Engineering/Data, AI Architect, AI Engineer, Machine Learning Engineer, Director of AI/ML

Related on AIssential

See Counsel's argued verdicts on the open AI decisions leaders are weighing →

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Marc Brooker's Blog.