Japan’s Sakana Fugu Beats Opus 4.8 and GPT-5.5 by Conducting Them, Not Replacing Them

· Source: Towards AI - Medium · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Emerging Technologies & Innovation · Depth: Intermediate, medium

Summary

Sakana AI, a Tokyo lab, introduced Fugu on June 22, 2026, a novel system that orchestrates leading frontier models like OpenAI's GPT-5.5, Google's Gemini 3.1 Pro, and Anthropic's Claude Opus 4.8, rather than replacing them. Fugu is a small model designed to route tasks, delegate work to the best-suited specialist model, and synthesize their outputs. Benchmarks show Fugu-Ultra, its high-quality variant, surpasses individual models on most demanding tests. For instance, on SWE-Bench Pro, Fugu-Ultra scored 73.7, outperforming Opus 4.8 (69.2), GPT-5.5 (58.6), and Gemini 3.1 Pro (54.2). Similarly, on Terminal Bench 2.1, it achieved 82.1, exceeding GPT-5.5 (78.2), Opus 4.8 (74.6), and Gemini (70.3). While these are vendor-reported figures and not a complete sweep—Fable 5 still leads SWE-Bench Pro—Fugu demonstrates that coordinating existing models can achieve frontier-level performance.

Key takeaway

For AI Engineers or Directors of AI/ML building advanced systems, this demonstrates that your focus shouldn't solely be on acquiring the largest single models. Instead, you should explore orchestration frameworks like Sakana AI's Fugu, which coordinate existing frontier models to achieve superior performance on complex tasks. Evaluate such multi-model approaches against your specific workloads, as they offer a validated path to frontier-level results, potentially optimizing both performance and cost by leveraging existing powerful APIs.

Key insights

Orchestrating multiple frontier models can yield performance exceeding individual top-tier models.

Principles

Method

A small model plans, routes sub-tasks to specialist frontier models, then synthesizes their responses into a single coherent answer.

In practice

Topics

Best for: CTO, VP of Engineering/Data, AI Architect, AI Scientist, AI Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Towards AI - Medium.