Introducing GPT-5.4 mini and nano
Summary
OpenAI has introduced GPT-5.4 mini and nano, their most capable small models yet, designed for speed and efficiency in high-volume workloads like coding and subagents. GPT-5.4 mini significantly outperforms GPT-5 mini across coding, reasoning, multimodal understanding, and tool use, running over 2x faster and approaching GPT-5.4 performance on benchmarks such as SWE-Bench Pro and OSWorld-Verified. GPT-5.4 nano is the smallest and cheapest version, recommended for tasks like classification, data extraction, and simpler coding subagents, also showing a significant upgrade over GPT-5 nano. These models are optimized for latency-sensitive applications, enabling responsive coding assistants, parallel subagents, and real-time multimodal computer use. Both GPT-5.4 mini and nano are available in the API, with GPT-5.4 mini also accessible in Codex and ChatGPT, featuring a 400k context window and specific token pricing.
Key takeaway
OpenAI's new GPT-5.4 mini and nano models offer significant speed and cost efficiencies for high-volume AI/ML workloads like coding and subagents. GPT-5.4 mini is over 2x faster than GPT-5 mini, achieving 54.4% on SWE-Bench Pro and 72.1% on OSWorld-Verified, approaching GPT-5.4 performance at a fraction of the cost (\$0.75/1M input). This enables developers to build more responsive, cost-effective systems by delegating tasks to specialized, faster models.
Topics
- GPT-5.4 mini
- GPT-5.4 nano
- Coding Assistants
- Subagents
- Multimodal AI
Best for: AI Architect, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, Software Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by OpenAI News.