Previewing GPT-5.6 Sol: a next-generation model

2026-06-18 · Source: OpenAI News · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy, Robotics & Autonomous Systems · Depth: Advanced, medium

Summary

OpenAI has initiated a limited preview of its GPT-5.6 series, featuring Sol as the flagship model, Terra for balanced work, and Luna for fast, affordable capabilities. Terra offers performance competitive with GPT-5.5 at half the cost, while Luna provides strong capabilities at the lowest price point. GPT-5.6 Sol introduces OpenAI's most robust safety stack, with enhanced protections against high-risk activities and misuse, developed through extensive pressure-testing. The models demonstrate improved agentic capabilities across coding (Terminal-Bench 2.1), biology (GeneBench v1), and cybersecurity (ExploitBench, ExploitGym), with Sol setting new benchmarks. New features include "max" reasoning effort and an "ultra" mode leveraging subagents. Pricing for 1M tokens is \$5 input / \$30 output for Sol, \$2.50 input / \$15 output for Terra, and \$1 input / \$6 output for Luna. Sol will also launch on Cerebras at up to 750 tokens per second in July.

Key takeaway

For AI Engineers evaluating next-generation models, the GPT-5.6 series offers significant capability boosts in coding, biology, and cybersecurity, coupled with a robust safety stack. You should consider Sol for demanding agentic tasks requiring deep reasoning or "ultra" mode, or Terra/Luna for cost-efficient, balanced performance. Be aware that initial limited preview access and layered safeguards may occasionally pause or block legitimate dual-use security work, requiring feedback during the preview period.

Key insights

OpenAI's GPT-5.6 series enhances AI capabilities across domains while integrating a multi-layered safety framework and new reasoning modes.

Principles

Layered safeguards are crucial for frontier models.
Automated red-teaming improves safeguard robustness.
Differentiated access balances safety and utility.

Method

Safeguards involve model-trained refusals, real-time misuse classifiers, account-level review, and automated red-teaming with human expert testing.

In practice

Utilize "max" reasoning effort for deep problem-solving.
Employ "ultra" mode with subagents for complex tasks.
Leverage GPT-5.6 for vulnerability research and patch development.

Topics

GPT-5.6 Sol
Large Language Models
AI Safety
Cybersecurity
Automated Red Teaming
Model Pricing

Best for: CTO, VP of Engineering/Data, Machine Learning Engineer, AI Engineer, Director of AI/ML, AI Security Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by OpenAI News.