😺 OpenAI shipped GPT-5.5 today
Summary
OpenAI released GPT-5.5 on April 24, 2026, seven days after Anthropic's Opus 4.7, marking a rapid release cadence in the AI industry. GPT-5.5 is described as "worker-class," designed for task completion rather than just answering questions, and is available in ChatGPT and Codex for Plus, Pro, Business, and Enterprise users, with API access coming soon at $5/$30 per million input/output tokens. The model achieved an 82.7% score on Terminal-Bench 2.0, surpassing Opus 4.7's 69.4%, and matched or exceeded industry professionals on 84.9% of GDPval tasks. It also improved on FrontierMath Tier 4, jumping from 27.1% to 35.4%, and contributed to a new proof in off-diagonal Ramsey numbers. OpenAI rated GPT-5.5 "High" in bio/chem and cyber capabilities, with partner XBOW noting its "Mythos-like hacking" potential, leading to a "Trusted Access for Cyber" initiative.
Key takeaway
For engineering teams evaluating large language models for agentic workflows, you should re-evaluate your current model choices, especially if running agents in production. The new GPT-5.5 demonstrates superior task completion and benchmark performance in several areas, suggesting a potential shift in optimal model selection. Consider implementing a hybrid workflow where Opus 4.7 handles planning and GPT-5.5 executes, as this combination has shown significantly higher performance on senior engineer benchmarks.
Key insights
GPT-5.5's release intensifies AI model competition, showcasing a shift towards task-oriented capabilities and a potential "soul swap" in model characteristics.
Principles
- Rapid release cycles are the new norm.
- Hardware advantage dictates model performance.
- Specialized models excel at specific tasks.
Method
Combine Opus 4.7 for generating a precise, contract-style plan for code rewrites, then use GPT-5.5 to execute that plan faithfully, leveraging its boldness for comprehensive refactoring.
In practice
- Re-test production agents on GPT-5.5 and Opus 4.7.
- Use Opus 4.7 for planning, GPT-5.5 for execution in coding.
- Add explicit targets to GPT-5.5 prompts for bolder actions.
Topics
- GPT-5.5
- Opus 4.7
- AI Agents
- AI Benchmarking
- AI Infrastructure
Best for: CTO, Investor, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Neuron.