GPT 5.4 leaks

2026-03-05 · Source: Wes Roth · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Intermediate, long

Summary

OpenAI's GPT 5.4 model has been extensively leaked through GitHub code, error logs, and employee screenshots, with confirmation from The Information. This upcoming model is expected to feature a 1 million token context window, a significant increase from GPT 5.2's 400,000 tokens, aligning it with Google and Anthropic's offerings. Key enhancements include an "extreme thinking mode" for prolonged, hours-long reasoning tasks, improved reliability for long-running tasks by better retaining details across multiple steps, and support for full-resolution image uploads crucial for detailed visual analysis in fields like medical imaging or architectural design. Additionally, a new priority inference system with "standard" and "fast" service tiers is anticipated for real-time AI agent applications. These developments occur amidst a growing "Quit GPT" movement, with Anthropic reportedly surpassing OpenAI in estimated first-time downloads due to backlash against OpenAI's actions and dealings with the Department of War.

Key takeaway

For Machine Learning Engineers developing AI agents or applications requiring extensive context and high reliability, GPT 5.4's 1 million token context window and improved long-running task capabilities are critical. You should evaluate the "extreme thinking mode" for computationally intensive reasoning and consider the new priority inference system for latency-sensitive real-time deployments. This release strategy suggests a continuous integration approach to model updates, requiring your teams to adapt to more frequent, smaller-scale changes.

Key insights

GPT 5.4 leaks reveal a 1M token context window, "extreme thinking mode," and full-resolution image support.

Principles

Incremental releases manage user expectations.
Context window size impacts long-task reliability.

Method

OpenAI is adopting a rapid, monthly release cadence to avoid the "hype and letdown" cycle, focusing on consistent, incremental improvements rather than single, grand releases.

In practice

Utilize 1M token context for complex, multi-step agent workflows.
Employ "extreme thinking mode" for deep research or analytical tasks.
Leverage full-resolution image input for precise visual data analysis.

Topics

GPT 5.4
Large Language Models
AI Ethics
Context Window
OpenAI Release Strategy

Best for: Machine Learning Engineer, NLP Engineer, Computer Vision Engineer, AI Engineer, AI Product Manager, Tech Journalist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Wes Roth.