Microsoft's Copilot Cowork moves to usage-based billing and may tap DeepSeek

· Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Corporate Strategy & Leadership · Depth: Fundamental Awareness, quick

Summary

Microsoft's Copilot Cowork is transitioning to usage-based pricing and exploring the integration of a self-hosted, fine-tuned DeepSeek V4 model as a more cost-effective alternative. Currently, Copilot Cowork leverages Anthropic's Claude technology, known for its agentic reasoning and high token consumption. The shift to usage-based billing, confirmed by Copilot EVP Charles Lamanna, addresses the unsustainability of flat rates due to users performing "hundreds of tasks a week," a move mirroring GitHub Copilot's recent change. Microsoft is weighing DeepSeek V4, a Chinese AI model, as an optional choice, emphasizing it would be fully hosted on Azure to maintain customer data within Microsoft's cloud and include customized bias safeguards. This strategic direction aligns with CEO Satya Nadella's vision for an AI ecosystem where companies can select and optimize models based on specific use cases and cost considerations, fostering "intense users and intense usage."

Key takeaway

For AI Product Managers evaluating model strategies and cost structures, Microsoft's move to usage-based billing for Copilot Cowork and its consideration of DeepSeek V4 underscore a critical industry shift. You should analyze your team's AI consumption patterns to anticipate future costs under usage-based models. Furthermore, explore integrating a diverse portfolio of fine-tuned, cost-optimized models, ensuring data residency and security, to maintain competitive pricing and performance for your specific use cases.

Key insights

AI service sustainability requires flexible pricing and diverse model options to meet varied user demands.

Principles

In practice

Topics

Best for: CTO, VP of Engineering/Data, AI Architect, Director of AI/ML, AI Product Manager, Executive

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.