Introducing OpenAI's GPT-image-2 in Microsoft Foundry

· Source: Microsoft Foundry Blog articles · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Intermediate, medium

Summary

OpenAI's GPT-image-2, now generally available through Microsoft Foundry, represents a significant advancement in image generation capabilities for enterprise workflows. This new model offers enhanced control over image output, featuring real-world intelligence with a knowledge cutoff of December 2025, enabling web searches and self-checking outputs. It also provides increased multilingual understanding across Japanese, Korean, Chinese, Hindi, and Bengali, allowing for localized text rendering within images. Furthermore, GPT-image-2 supports 4K resolution, generating detailed and photorealistic images up to 8,294,400 pixels, with dimensions required to be multiples of 16. An intelligent routing layer with two modes (Legacy size selection and Token size bucket selection) automatically optimizes generation configurations, making it suitable for diverse applications in retail, marketing, media, education, and UI/UX design. Pricing is set per 1M tokens, with image input tokens at $8 and output tokens at $30.

Key takeaway

For Machine Learning Engineers and designers building visual content pipelines, GPT-image-2's 4K resolution and intelligent routing layer enable the creation of production-grade assets with precise control. You can now generate highly localized, contextually relevant images at custom dimensions, significantly streamlining workflows for retail, marketing, and UI/UX design. Explore its capabilities in Microsoft Foundry to integrate advanced image generation into your enterprise applications.

Key insights

GPT-image-2 enhances enterprise visual AI with advanced intelligence, multilingual support, 4K resolution, and intelligent routing.

Principles

Method

GPT-image-2 uses an intelligent routing layer with two modes (Legacy or Token size bucket selection) to automatically configure image generation based on request parameters, optimizing for quality and efficiency.

In practice

Topics

Best for: Machine Learning Engineer, Computer Vision Engineer, CTO, AI Engineer, Product Designer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Microsoft Foundry Blog articles.