Meta's new model is Muse Spark, and meta.ai chat has some interesting tools

· Source: Simon Willison's Weblog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Intermediate, long

Summary

Meta has released Muse Spark, its first model since Llama 4, available through a private API preview and on meta.ai (requiring Facebook or Instagram login). Benchmarks indicate Muse Spark is competitive with Opus 4.6, Gemini 3.1 Pro, and GPT 5.4, though it lags on Terminal-Bench 2.0. The meta.ai interface offers "Instant" and "Thinking" modes, with a "Contemplating" mode planned for longer reasoning. The platform reveals 16 integrated tools, including `browser.search`, `meta_1p.content_search` for Meta content, `meta_1p.meta_catalog_search` for products, `media.image_gen` for image creation, and `container.python_execution` for sandboxed Python 3.9 code. Notably, `container.visual_grounding` performs image analysis, identifying, locating, and counting objects in images, demonstrated by counting raccoon whiskers and pelicans. The model also supports `subagents.spawn_agent` and `third_party.link_third_party_account` for services like Google Calendar. Future open-sourcing of models is anticipated.

Key takeaway

For Machine Learning Engineers evaluating new LLM platforms, Meta's Muse Spark offers a robust tool-integrated environment, including a Python sandbox and advanced visual grounding capabilities. You should explore its `container.python_execution` and `container.visual_grounding` tools on meta.ai to understand its potential for complex multimodal workflows and agentic system development, especially given the promise of future open-source releases.

Key insights

Meta's Muse Spark model integrates advanced tooling for multimodal interaction and sandboxed code execution.

Principles

Method

Meta AI's chat harness exposes tools via direct queries, allowing users to inspect tool names, parameters, and descriptions for enhanced interaction and understanding of agent capabilities.

In practice

Topics

Code references

Best for: Machine Learning Engineer, Computer Vision Engineer, CTO, AI Engineer, AI Scientist, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Simon Willison's Weblog.