Metaprogramming, Synthesis & Verification • Nada Amin • YOW! 2025

· Source: GOTO Conferences · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Expert, extended

Summary

Nada Amin's presentation explores metaprogramming, synthesis, and verification, emphasizing the critical role of formal verification in ensuring the correctness of LLM-generated code. The talk introduces Daphne, a verification language used to define design-by-contract specifications and formally prove program properties, as demonstrated with arithmetic expression evaluation and optimization. Daphne's semi-automation, based on decades of research, has been applied in safety-critical systems like AWS's authorization engine, which handles a billion invocations per second. The discussion extends to orchestrating LLMs with verifiers using Monte Carlo Tree Search for verified synthesis, addressing challenges like the "scaffolding paradox" and the need for high-quality edits. It also delves into constraint solving with Holi for synthesizing values and expressions, and relational programming for generating code from high-level specifications, advocating for strong guarantees in future code generation.

Key takeaway

For AI Engineers developing code with large language models, integrating formal verification tools like Daphne into your workflow is crucial. This ensures strong correctness guarantees, reduces manual review burdens, and guides development by identifying bugs early with counter-examples. Consider adopting verification-aware languages and exploring neuro-symbolic orchestration methods to enhance the reliability and trustworthiness of your generated code, particularly for safety-critical applications.

Key insights

Integrating formal verification with LLM-driven code synthesis is essential for ensuring strong guarantees and correctness.

Principles

Method

Monte Carlo Tree Search orchestrates LLMs and verifiers by using verifier feedback in the evaluation phase to guide program synthesis, ensuring correctness at each step.

In practice

Topics

Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, AI Engineer, AI Architect

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by GOTO Conferences.