Windsurf 1.9600.40
Summary
Windsurf has introduced "Adaptive," a new model option that intelligently selects the best underlying AI model for each task to optimize quota usage for self-serve users on Pro, Max, and Teams plans. This feature dynamically chooses models while maintaining a fixed per-token rate for quota drawdown. To support this, the model picker now displays per-model input, output, and cache read token pricing for extra usage, which is currently promoted at $0.50 per 1M input tokens, $2.00 per 1M output tokens, and $0.10 per 1M cache read tokens for two weeks. Additionally, a prompt cache timer and token counts in response cards have been integrated to enhance cost transparency. A bug preventing model switching after the first request was fixed, and affected users had their quota reset and overage restored.
Key takeaway
For AI Architects managing cloud AI costs, the Adaptive model offers a strategic advantage by optimizing model selection and quota consumption. Your teams can benefit from consistent per-token pricing while the system intelligently allocates resources, potentially extending your budget. Evaluate its performance on diverse tasks and leverage the transparent token pricing to forecast and control extra usage expenses effectively.
Key insights
Adaptive model routing optimizes AI quota usage by dynamically selecting the best model for each task at a fixed token rate.
Principles
- Dynamic model selection optimizes resource allocation.
- Cost transparency enhances user control.
Method
The Adaptive model automatically selects an appropriate underlying AI model per task, drawing down quota at a consistent per-token rate, while displaying detailed pricing.
In practice
- Utilize Adaptive model for cost-efficient AI task execution.
- Monitor token pricing in the model picker for extra usage.
Topics
- Adaptive Model Router
- Large Language Models
- Arena Mode
- Plan Mode
- Cascade Agents
Best for: AI Architect, NLP Engineer, CTO, Software Engineer, Machine Learning Engineer, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Windsurf Changelog.