The Pulse: ‘Tokenmaxxing’ as a weird new trend
Summary
A new trend called "Tokenmaxxing" is emerging in large tech companies like Meta, Microsoft, and Salesforce, where developers intentionally inflate AI token usage to meet internal metrics. This occurs concurrently with challenges in AI agent subsidies, as Anthropic ceased enterprise plan subsidies and Uber depleted its 2026 AI token budget in three months, suggesting a shift towards per-engineer AI budgeting. Additionally, the industry is seeing developments such as the "Claude Mythos" and reports of Claude's degradation, Cal.com's partial move to closed source citing AI and security, Vercel open-sourcing its "agent factories" tool, and the implementation of AI usage guidelines in the Linux kernel.
Key takeaway
For engineering leaders managing AI initiatives, the "Tokenmaxxing" trend and rapid budget depletion at companies like Uber highlight the critical need for robust AI cost management. You should establish clear, auditable AI usage policies and consider implementing per-engineer AI token budgets to prevent wasteful spending and ensure responsible resource allocation.
Key insights
Developers are inflating AI token usage to meet internal metrics, while companies face challenges managing AI budgets.
Principles
- AI usage metrics can drive unintended behavior.
- Uncapped AI budgets are unsustainable.
In practice
- Monitor AI token usage for anomalies.
- Implement per-engineer AI budgeting.
Topics
- Tokenmaxxing
- AI Usage Metrics
- AI Agent Subsidies
- Per-Engineer AI Budgets
- Open-Source Strategy
Best for: CTO, VP of Engineering/Data, Executive, Director of AI/ML, Consultant, Investor
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Pragmatic Engineer.