Google fixes several bugs in Gemini usage limits that burned through quotas too fast

· Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Fundamental Awareness, quick

Summary

Google has addressed several critical bugs impacting Gemini usage limits, as announced by VP Josh Woodward on May 29, 2026. A significant issue causing one or two Omni video generations to consume an entire quota has been resolved, and Ultra members now receive double the Omni video generations. Additionally, complex requests to the 3.1 Pro model involving large files will no longer excessively deplete quotas, with maximum consumption per prompt now capped. Further improvements include not charging for failed requests, making Flash Lite requests free, and providing more detailed consumption displays for complex features like Deep Research. The platform also now retains model selections across sessions.

Key takeaway

For Gemini subscribers evaluating their plan's value, these bug fixes significantly enhance quota predictability and fairness. You can now expect more consistent usage from your subscription, especially for Omni video generations as an Ultra member or when using the 3.1 Pro model with large files. Review your usage patterns and consider using Flash Lite for free requests to optimize your plan.

Key insights

Google fixed Gemini quota bugs, improving usage fairness and transparency for subscribers.

Principles

In practice

Topics

Best for: CTO, VP of Engineering/Data, AI Engineer, AI Product Manager, Director of AI/ML, Tech Journalist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.