Grok 4.0 Goes GA in Microsoft Foundry and Grok 4.1 Fast Arrives with Major Enhancements

· Source: Microsoft Foundry Blog articles · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure, Emerging Technologies & Innovation · Depth: Intermediate, short

Summary

Grok 4.0 is now generally available (GA) in Microsoft Foundry, providing enterprises a production-ready path for deploying xAI's frontier reasoning models. Building on this, Grok 4.1 Fast, including both Reasoning and Non-Reasoning variants, is now available or coming soon in Microsoft Foundry. Grok 4.1 Fast introduces significant enhancements such as improved conversational quality, enhanced creativity, greater emotional awareness, and reduced hallucination. The Non-Reasoning variant is optimized for high throughput tasks like summarization and classification, priced at $0.2 per 1M input tokens and $0.5 per 1M output tokens, available in Public Preview on February 27, 2026. The Reasoning variant is designed for multi-step reasoning and complex input interpretation. Microsoft Foundry offers governance, compliance, and operational tooling for these deployments, alongside integrated safety features.

Key takeaway

For CTOs and VPs of Engineering evaluating new frontier AI models, Grok 4.1 Fast's specialized reasoning and non-reasoning variants in Microsoft Foundry offer a flexible path for production deployments. You should consider the specific task requirements to select the appropriate Grok 4.1 Fast variant to optimize both performance and cost, while actively implementing Azure AI Content Safety to mitigate potential risks associated with increased model capabilities.

Key insights

Grok 4.1 Fast offers specialized reasoning and non-reasoning variants for optimized AI application performance and cost.

Principles

Method

Deploy Grok 4.1 Fast (Reasoning) for complex analysis or Grok 4.1 Fast (Non-Reasoning) for high-throughput tasks like summarization, leveraging Microsoft Foundry's enterprise features and safety controls.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Engineer, MLOps Engineer, AI Architect

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Microsoft Foundry Blog articles.