COEVO: Co-Evolutionary Framework for Joint Functional Correctness and PPA Optimization in LLM-Based RTL Generation

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, RTL Design & Synthesis · Depth: Expert, extended

Summary

COEVO is a co-evolutionary framework for Large Language Model (LLM)-based Register-Transfer Level (RTL) code generation that jointly optimizes functional correctness and Power, Performance, and Area (PPA) quality. Unlike existing methods that decouple these objectives, COEVO unifies them within a single evolutionary loop. It treats correctness as a continuous co-optimization dimension, enabled by an enhanced testbench providing fine-grained scoring and diagnostic feedback. An adaptive correctness gate with annealing allows partially correct but architecturally promising candidates to guide the search. To preserve PPA trade-offs, COEVO uses four-dimensional Pareto-based non-dominated sorting with configurable intra-level sorting, replacing scalar fitness. Evaluated on VerilogEval 2.0 and RTLLM 2.0, COEVO achieved 97.5% and 94.5% Pass@1 with GPT-5.4-mini, outperforming all agentic baselines and securing the best PPA on 43 out of 49 synthesizable RTLLM designs.

Key takeaway

For research scientists developing LLM-based hardware design tools, you should consider integrating co-evolutionary frameworks that unify correctness and PPA optimization. Decoupling these objectives or reducing PPA to a scalar fitness risks discarding architecturally superior, partially correct designs. Your approach should leverage continuous correctness scoring and multi-dimensional Pareto selection to explore the design space more effectively and achieve superior functional and PPA outcomes.

Key insights

Jointly optimizing functional correctness and PPA in RTL generation requires continuous co-evolution and multi-objective Pareto sorting.

Principles

Method

COEVO uses LLM-driven evolutionary operators, an enhanced testbench for continuous correctness scoring, and 4D Pareto-based non-dominated sorting with an adaptive correctness gate for survivor selection.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Hardware Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.