Introducing Mistral Small 4

· Source: Simon Willison's Weblog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Intermediate, quick

Summary

Mistral has released Mistral Small 4, an Apache 2 licensed 119B parameter Mixture-of-Experts model with 6B active parameters. This new model unifies the capabilities of previous flagship models: Magistral for reasoning, Pixtral for multimodal tasks, and Devstral for agentic coding into a single, versatile offering. It supports a `reasoning_effort` parameter with "none" or "high" settings, where "high" provides verbosity equivalent to prior Magistral models. The model is available on Hugging Face as a 242GB download. Additionally, Mistral introduced Leanstral, an open-weight model specifically fine-tuned for generating code in the Lean 4 formally verifiable programming language.

Key takeaway

For AI Architects evaluating new foundation models, Mistral Small 4 offers a compelling option by consolidating diverse capabilities into one model. This unification could simplify deployment and management compared to using separate specialized models. Monitor the Mistral API for the `reasoning_effort` parameter to fine-tune performance for specific reasoning-intensive applications.

Key insights

Mistral Small 4 unifies multiple specialized AI capabilities into a single, versatile model.

Principles

In practice

Topics

Code references

Best for: AI Architect, NLP Engineer, Computer Vision Engineer, AI Engineer, Machine Learning Engineer, Prompt Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Simon Willison's Weblog.