Databricks partners with OpenAI on GPT-5.5

· Source: Databricks · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Data Science & Analytics · Depth: Intermediate, quick

Summary

Databricks has partnered with OpenAI to integrate GPT-5.5, OpenAI's newest frontier model, which is designed for advanced agentic work, complex document reasoning, and long-horizon coding tasks within enterprise environments. GPT-5.5 now powers Codex, OpenAI's coding agent, enhancing its reasoning and execution capabilities. The model excels at understanding user intent, enabling it to manage multi-part knowledge work, including code writing and debugging, online research, data analysis, document creation, and software operation. On Databricks' OfficeQA benchmark, which evaluates document-heavy, multi-step analytical tasks, GPT-5.5 achieved a score of 64.66% with oracle retrieval, a 13% improvement over GPT-5.4's 57.14%. In a full-agent workflow evaluation, GPT-5.5 scored 52.63%, significantly reducing errors by 46% compared to GPT-5.4's 36.10%, demonstrating its practical gains in end-to-end enterprise scenarios.

Key takeaway

For AI Architects evaluating large language models for enterprise deployment, GPT-5.5's demonstrated improvements in agentic reasoning and complex task handling, particularly its 46% error reduction in full-agent workflows, indicate a significant leap in practical applicability. You should consider its integration for automating multi-step analytical tasks and enhancing developer productivity, especially when secure, scalable processing of enterprise data is critical.

Key insights

GPT-5.5 significantly advances AI agent capabilities for complex enterprise tasks and coding workflows.

Principles

Method

OfficeQA benchmark measures model performance on document retrieval, table interpretation, and precise calculations using 89,000 pages of U.S. Treasury Bulletins.

In practice

Topics

Best for: CTO, AI Architect, Investor, AI Engineer, Machine Learning Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Databricks.