Introducing Adaptive: a smarter way to use Windsurf
Summary
Windsurf has released significant updates, including an "Adaptive" model router, a redesigned model picker with integrated pricing context, and the removal of daily usage limits for Max plan users. The "Adaptive" router intelligently selects optimal models for tasks, aiming to extend user quota by avoiding overuse of premium models, and is available to all self-serve users (Pro, Max, Teams). The updated model picker now displays token pricing information directly, including input, output, and cache read token rates (e.g., USD 0.50 per 1M input tokens, USD 2.00 per 1M output tokens, USD 0.10 per 1M cache read tokens for extra usage). Additionally, prompt caching is highlighted with a new timer, and response cards show token counts. Max users now have weekly, not daily, quota limits, offering more flexibility for bursty workloads.
Key takeaway
For AI Architects and NLP Engineers managing Windsurf deployments, the "Adaptive" model router and transparent pricing in the model picker offer better control over operational costs and quota utilization. Your teams can now leverage the "Adaptive" option to automatically optimize model selection, potentially reducing overall token consumption. Max plan users gain flexibility with the removal of daily limits, enabling more intensive, bursty workloads without interruption, though careful monitoring of weekly quotas is still advised.
Key insights
Windsurf's new features enhance model selection, cost transparency, and quota flexibility for users.
Principles
- Transparency improves user trust.
- Dynamic model routing optimizes resource use.
Method
The "Adaptive" model router dynamically selects the best underlying model for a task, drawing down quota at a fixed per-token rate, while the model picker provides real-time token pricing.
In practice
- Use "Adaptive" in Windsurf for quota optimization.
- Monitor prompt cache timer for cost awareness.
- Max users can now burst usage within weekly limits.
Topics
- Windsurf
- Adaptive Model Router
- Model Picker
- Token Pricing
- Quota Management
Best for: AI Architect, NLP Engineer, CTO, AI Engineer, Machine Learning Engineer, MLOps Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Windsurf Blog.