The twilight of the chatbots

· Source: One Useful Thing · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Intermediate, short

Summary

AI model capabilities and release rates are accelerating, with frontier models like Claude Fable and GPT-5.6 facing government access restrictions despite rapid advancements. Assessments from METR, the UK's AI Security Institute, and GDPval indicate "better than exponential" growth in AI's ability to perform human work. For example, Opus 4.7 autonomously built software in 14 hours for \$251, a task estimated to require 2-17 weeks of human engineering. The author's experiments with Fable showed 9 hours of autonomous work on complex software projects. While US companies like Anthropic, OpenAI, and Google lead with frontier models, Chinese open-weights models, though lagging 6-12 months, also show exponential improvement and are cheaper. This evolution is shifting AI usage from co-intelligence with chatbots to managing autonomous agents, which leverage specialized harnesses and apps. An OpenAI study revealed widespread agent adoption across coding, legal, and HR, with a quarter of its workforce running at least four agents weekly. Domain expertise, not profession, correlates with greater agent success. This exponential growth creates "shocks" in policy and markets.

Key takeaway

For Directors of AI/ML evaluating strategic roadmaps, recognize that AI capabilities are evolving exponentially, rendering plans from early 2025 or prior obsolete. Your teams should pivot from chatbot-centric co-intelligence models to deploying and managing autonomous AI agents for complex, long-running tasks. Prioritize developing deep domain expertise within your staff, as this is the primary driver for maximizing agent effectiveness and achieving significant productivity gains. Embrace a management-focused approach to AI integration.

Key insights

AI capabilities are accelerating exponentially, driving a shift from interactive chatbots to autonomous, expert-managed agents.

Principles

In practice

Topics

Best for: Investor, CTO, AI Architect, Director of AI/ML, VP of Engineering/Data, Consultant

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by One Useful Thing.