New Azure Open AI models bring fast, expressive, and real‑time AI experiences in Microsoft Foundry
Summary
Microsoft Foundry is rolling out three new Azure OpenAI models: GPT-5.3-Codex, GPT-Realtime-1.5, and GPT-Audio-1.5. GPT-5.3-Codex, priced at $1.75/1M input tokens and $14.00/1M output tokens, offers 25% faster execution and unifies advanced coding with broader reasoning for long-running engineering tasks, supporting multi-step migrations, agentic developer workflows, and automated code reviews. GPT-Realtime-1.5 and GPT-Audio-1.5 enhance real-time voice interactions, showing a +5% lift on Big Bench Audio reasoning, +10.23% in alphanumeric transcription, and +7% in instruction following. These models feature more natural-sounding speech, higher audio quality, improved instruction following, and function calling support, suitable for conversational voice agents and hands-free workflows. GPT-Realtime-1.5 text input is $4.00/1M tokens, audio input $32.00/1M tokens; GPT-Audio-1.5 text input is $2.50/1M tokens, audio input $32.00/1M tokens.
Key takeaway
For NLP Engineers and CTOs building complex AI applications, these new Azure OpenAI models offer significant advancements in handling long-running tasks and real-time voice interactions. You should evaluate GPT-5.3-Codex for multi-step developer workflows and GPT-Realtime-1.5/GPT-Audio-1.5 for voice-first experiences requiring high accuracy and low latency, leveraging Microsoft Foundry's integrated evaluation and deployment capabilities to accelerate your projects.
Key insights
New Azure OpenAI models prioritize continuity and reliability for complex, real-time AI applications and long-running engineering tasks.
Principles
- AI systems benefit from sustained context and adaptability.
- Reliability and low latency are critical for real-time voice AI.
Method
The models integrate advanced coding with reasoning (GPT-5.3-Codex) and enhance speech understanding with function calling (GPT-Realtime-1.5, GPT-Audio-1.5) to support multi-step, context-aware interactions.
In practice
- Use GPT-5.3-Codex for large-scale code refactoring.
- Deploy GPT-Realtime-1.5 for low-latency voice agents.
- Automate code reviews with GPT-5.3-Codex.
Topics
- GPT-5.3-Codex
- Real-time Voice AI
- AI-assisted Coding
- Conversational AI
- Microsoft Foundry
Best for: NLP Engineer, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, Software Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Microsoft Foundry Blog articles.