New Azure Open AI models bring fast, expressive, and real‑time AI experiences in Microsoft Foundry

2026-02-25 · Source: Microsoft Foundry Blog articles · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Cloud Computing & IT Infrastructure · Depth: Intermediate, short

Summary

Microsoft Foundry is rolling out three new Azure OpenAI models: GPT-5.3-Codex, GPT-Realtime-1.5, and GPT-Audio-1.5. GPT-5.3-Codex, priced at $1.75/1M input tokens and $14.00/1M output tokens, offers 25% faster execution and unifies advanced coding with broader reasoning for long-running engineering tasks, supporting multi-step migrations, agentic developer workflows, and automated code reviews. GPT-Realtime-1.5 and GPT-Audio-1.5 enhance real-time voice interactions, showing a +5% lift on Big Bench Audio reasoning, +10.23% in alphanumeric transcription, and +7% in instruction following. These models feature more natural-sounding speech, higher audio quality, improved instruction following, and function calling support, suitable for conversational voice agents and hands-free workflows. GPT-Realtime-1.5 text input is $4.00/1M tokens, audio input $32.00/1M tokens; GPT-Audio-1.5 text input is $2.50/1M tokens, audio input $32.00/1M tokens.

Key takeaway

For NLP Engineers and CTOs building complex AI applications, these new Azure OpenAI models offer significant advancements in handling long-running tasks and real-time voice interactions. You should evaluate GPT-5.3-Codex for multi-step developer workflows and GPT-Realtime-1.5/GPT-Audio-1.5 for voice-first experiences requiring high accuracy and low latency, leveraging Microsoft Foundry's integrated evaluation and deployment capabilities to accelerate your projects.

Key insights

New Azure OpenAI models prioritize continuity and reliability for complex, real-time AI applications and long-running engineering tasks.

Principles

AI systems benefit from sustained context and adaptability.
Reliability and low latency are critical for real-time voice AI.

Method

The models integrate advanced coding with reasoning (GPT-5.3-Codex) and enhance speech understanding with function calling (GPT-Realtime-1.5, GPT-Audio-1.5) to support multi-step, context-aware interactions.

In practice

Use GPT-5.3-Codex for large-scale code refactoring.
Deploy GPT-Realtime-1.5 for low-latency voice agents.
Automate code reviews with GPT-5.3-Codex.

Topics

GPT-5.3-Codex
Real-time Voice AI
AI-assisted Coding
Conversational AI
Microsoft Foundry

Best for: NLP Engineer, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, Software Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Microsoft Foundry Blog articles.