Google fixes several bugs in Gemini usage limits that burned through quotas too fast
Summary
Google has addressed several critical bugs impacting Gemini usage limits, as announced by VP Josh Woodward on May 29, 2026. A significant issue causing one or two Omni video generations to consume an entire quota has been resolved, and Ultra members now receive double the Omni video generations. Additionally, complex requests to the 3.1 Pro model involving large files will no longer excessively deplete quotas, with maximum consumption per prompt now capped. Further improvements include not charging for failed requests, making Flash Lite requests free, and providing more detailed consumption displays for complex features like Deep Research. The platform also now retains model selections across sessions.
Key takeaway
For Gemini subscribers evaluating their plan's value, these bug fixes significantly enhance quota predictability and fairness. You can now expect more consistent usage from your subscription, especially for Omni video generations as an Ultra member or when using the 3.1 Pro model with large files. Review your usage patterns and consider using Flash Lite for free requests to optimize your plan.
Key insights
Google fixed Gemini quota bugs, improving usage fairness and transparency for subscribers.
Principles
- Quota consumption should be predictable.
- Failed operations should not incur cost.
- Complex features need clear billing.
In practice
- Ultra members get double Omni videos.
- 3.1 Pro complex prompts capped.
- Flash Lite requests are now free.
Topics
- Google Gemini
- Usage Limits
- Quota Management
- AI Subscriptions
- Bug Fixes
- Omni Video
Best for: CTO, VP of Engineering/Data, AI Engineer, AI Product Manager, Director of AI/ML, Tech Journalist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.