MaxSAT Fuzzing and Delta Debugging
Summary
A systematic study evaluated automated fuzzing techniques for Maximum Satisfiability (MaxSAT) solvers, combining large-scale stress testing with a novel MaxSAT-specific delta debugging method. This approach generated millions of structured MaxSAT instances and efficiently isolated failure-inducing inputs, distilling them into minimal counterexamples. Over 100 hours of fuzzing, previously unknown failures were discovered in almost all 43 solvers from recent MaxSAT competitions and in three certificate-producing solvers. These failures included crashes, incorrect optimality bounds, and severe performance slowdowns, with one certified solver exhibiting a critical soundness error. The resulting corpus of minimal counterexamples was published as a public regression suite, which was adopted for the 2024 MaxSAT Evaluation, reducing average solver failure rates by over 50%.
Key takeaway
For AI scientists developing or deploying MaxSAT solvers, this study highlights the critical need for robust validation. You should integrate systematic fuzz testing and delta debugging into your development pipeline to preemptively identify and fix soundness errors, crashes, and performance issues. Adopting the published regression suite (Paxian 2025a) can immediately improve solver reliability and ensure compliance with community standards like the 2024 MaxSAT Evaluation.
Key insights
Systematic fuzzing and delta debugging significantly enhance MaxSAT solver reliability and development.
Principles
- Combine stress testing with targeted debugging.
- Isolate and minimize failure-inducing inputs.
- Publish counterexamples for community adoption.
Method
Orchestrate parallel generation of millions of structured MaxSAT instances, then apply MaxSAT-specific delta debugging to isolate and distill failing cases into minimal counterexamples for precise failure localization.
In practice
- Integrate fuzzers into solver development workflows.
- Use minimal counterexamples for regression testing.
Topics
- MaxSAT Solvers
- Fuzz Testing
- Delta Debugging
- Solver Robustness
- Regression Testing
Best for: AI Scientist, AI Researcher, Research Scientist, Software Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Journal of Artificial Intelligence Research.