llm-anthropic 0.25.1

2026-05-28 · Source: Simon Willison's Weblog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Intermediate, quick

Summary

The "llm-anthropic" plugin, version 0.25.1, was released on May 28th, 2026, significantly enhancing access to Anthropic's Claude series large language models. This update introduces direct support for the newly available Claude Opus 4.8 model, which can be accessed using the "claude-opus-4.8" identifier. A key new feature is the "-o fast 1" option, designed to enable "fast mode" for organizations that have this capability activated on their Anthropic accounts, potentially speeding up certain operations. Additionally, the plugin now intelligently sets the default max_tokens for each model to match its maximum possible output, rather than a previously fixed 8,192. This change, addressing issue #72, optimizes token handling and improves overall model utilization for users interacting with Anthropic's latest offerings.

Key takeaway

For AI Engineers integrating Anthropic models, this "llm-anthropic" 0.25.1 update is crucial. You should upgrade your plugin to immediately access Claude Opus 4.8, enabling use of Anthropic's latest flagship model. If your organization has fast mode enabled, utilize the new "-o fast 1" option to potentially accelerate inference. Additionally, the improved default max_tokens handling means you no longer need to manually configure this for optimal model output, streamlining your development workflow.

Key insights

llm-anthropic 0.25.1 integrates Claude Opus 4.8, adds fast mode, and optimizes max_tokens defaults for better model interaction.

In practice

Access Claude Opus 4.8 via `llm-anthropic`.
Enable fast mode with `-o fast 1` option.
Utilize automatic max_tokens for models.

Topics

llm-anthropic
Claude Opus 4.8
Large Language Models
Anthropic API
Fast Mode
Token Management

Code references

simonw/llm-anthropic

Best for: AI Engineer, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Simon Willison's Weblog.