Google Deepmind upgrades Gemini 3 Deep Think for complex science and engineering tasks

· Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Emerging Technologies & Innovation · Depth: Intermediate, quick

Summary

Google Deepmind has released an upgraded version of its specialized thinking mode, "Gemini 3 Deep Think," designed for complex science, research, and engineering tasks. This enhanced model is accessible to Google AI Ultra subscribers via the Gemini app and through an early access API program via Vertex AI for developers and researchers. Gemini 3 Deep Think demonstrates superior performance across several key benchmarks, scoring 84.6% on ARC-AGI-2 for logical reasoning, 48.4% on Humanity's Last Exam for academic reasoning, and an Elo rating of 3,455 on Codeforces for coding and algorithms. While it leads in logic and coding, its 81.5% on MMMU-Pro for multimodal reasoning is only slightly higher than Gemini 3 Pro Preview's 81.0%, indicating a strong focus on abstract reasoning over visual processing. The model also achieved gold medal-level results in the 2025 Physics and Chemistry Olympiads.

Key takeaway

For AI/ML Directors evaluating models for scientific and engineering applications, Gemini 3 Deep Think offers a significant advantage in logical reasoning, academic problem-solving, and coding. Its benchmark performance, particularly in abstract reasoning, suggests it could accelerate research and development workflows. You should explore its capabilities through the Gemini app or the Vertex AI early access program to assess its fit for your organization's most demanding technical challenges.

Key insights

Gemini 3 Deep Think excels in abstract reasoning, outperforming competitors on logic, academic, and coding benchmarks.

Principles

In practice

Topics

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Researcher, AI Engineer, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.