Mutation testing for the agentic era

· Source: The Trail of Bits Blog · Field: Technology & Digital — Software Development & Engineering, Artificial Intelligence & Machine Learning, Blockchain & Distributed Ledger Technology · Depth: Advanced, medium

Summary

Trail of Bits has released MuTON and mewt, two new open-source mutation testing tools optimized for agentic use, alongside a configuration optimization skill for AI agents. MuTON specifically supports TON blockchain languages (FunC, Tolk, Tact), while mewt is a language-agnostic core also supporting Solidity, Rust, and Go. These tools aim to overcome the limitations of earlier regex-based tools like Universalmutator and even Slither-mutate, which suffered from slow runtimes, language-specific coupling, and poor result handling. MuTON and mewt leverage Tree-sitter for robust language comprehension and store results in a SQLite database, enabling persistent sessions, flexible filtering, and SARIF export. The article highlights how AI agents, with specialized skills, can significantly reduce configuration complexity and improve result triage, transforming mutation testing into a more efficient process.

Key takeaway

For Security Engineers developing smart contracts or critical applications, traditional code coverage metrics are insufficient. You should integrate MuTON or mewt into your testing pipeline to uncover hidden vulnerabilities that coverage misses. Utilize the new configuration optimization skill with AI agents to streamline campaign setup and efficiently triage results. This approach ensures more thorough verification, reducing the risk of high-severity exploits in your codebase.

Key insights

Mutation testing tools MuTON and mewt, powered by Tree-sitter and AI agents, enhance software quality by efficiently identifying untested code paths.

Principles

Method

MuTON and mewt utilize Tree-sitter for multi-language parsing, generate syntactically valid mutations, and store results in a SQLite database. This enables persistent sessions, flexible filtering, and AI-assisted configuration and triage.

In practice

Topics

Code references

Best for: AI Engineer, AI Security Engineer, Machine Learning Engineer, Software Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Trail of Bits Blog.