Google’s cheapest AI model just got 3x more expensive

· Source: Data Science on Medium · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Intermediate, quick

Summary

Google has significantly increased the pricing for its Gemini 3.5 Flash AI model, tripling the cost compared to its predecessor and making it six times more expensive than the previous Flash-Lite tier. This change challenges the long-held industry expectation that AI inference costs would continuously decrease. Gemini 3.5 Flash was previously positioned as an economical option for high-volume, smaller tasks like classification and summarization. Despite the price hike, Google is encouraging its use across a wide range of applications. This move suggests a shift in the economic landscape of AI inference, particularly for models designed for cost-effective, high-throughput operations.

Key takeaway

For AI Architects and VP of Engineering managing large-scale inference workloads, you must immediately re-evaluate your operational budgets and cost models. The 3x price increase for Google's Gemini 3.5 Flash indicates that the assumption of perpetually decreasing AI costs is no longer valid for all tiers. Adjust your financial forecasts and explore alternative, cost-optimized models or providers to mitigate unexpected expenditure spikes.

Key insights

AI inference costs are rising for high-volume, economical models, challenging prior industry expectations.

Principles

In practice

Topics

Best for: CTO, VP of Engineering/Data, AI Architect, AI Engineer, MLOps Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Data Science on Medium.