OpenAI launches GPT-5.4 Thinking and Pro combining coding, reasoning, and computer use in one model

· Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Cybersecurity & Data Privacy · Depth: Intermediate, medium

Summary

OpenAI has launched GPT-5.4 Thinking and GPT-5.4 Pro, integrating coding, reasoning, agentic workflows, and native computer operation into a single model for the first time. The new model achieved an 83.0 percent score on the GDPval benchmark for professional knowledge work, a notable increase from GPT-5.2's 70.9 percent, and surpassed human performance on the OSWorld Verified benchmark for desktop environment navigation with a 75.0 percent success rate. A key technical enhancement, "Tool Search" in the API, reduces token consumption by 47 percent by retrieving tool definitions only when needed. While coding gains are modest on SWE-Bench Pro, a new "/fast" mode in Codex boosts token speed by up to 1.5x. GPT-5.4 also features improved visual perception, reduced hallucinations, and a "High Capability" cybersecurity rating.

Key takeaway

For AI architects evaluating new frontier models, GPT-5.4's integration of coding, reasoning, and native computer use, alongside its "High Capability" cybersecurity rating, presents a compelling option for enterprise applications. You should consider its improved performance on professional benchmarks and token efficiency gains, despite increased per-token pricing, for automating complex workflows and enhancing agentic systems.

Key insights

GPT-5.4 unifies coding, reasoning, and computer operation, significantly boosting performance in professional knowledge work and desktop navigation.

Principles

Method

GPT-5.4 employs "Tool Search" to dynamically load tool definitions, reducing token consumption. It also features a "/fast" mode for accelerated coding and a two-stage monitoring system for cybersecurity.

In practice

Topics

Best for: Machine Learning Engineer, CTO, AI Architect, AI Engineer, Data Scientist, AI Product Manager

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.