Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Expert, quick

Summary

Solvita is an agentic evolution framework designed to enhance Large Language Models (LLMs) for competitive programming by enabling continuous learning without requiring weight updates to the base LLM. Existing multi-agent frameworks for competitive programming are often stateless, failing to retain problem-solving and debugging experience. Solvita addresses this by organizing problem-solving into a closed-loop system involving four specialized agents: Planner, Solver, Oracle, and Hacker. Each agent is coupled with a trainable, graph-structured knowledge network. Outcome signals, such as pass/fail verdicts, test certification quality, and adversarial vulnerabilities, are used as reinforcement learning updates to these network weights, allowing agents to dynamically route future queries based on accumulated experience. Solvita sets a new state-of-the-art among code-generation agents, outperforming existing multi-agent pipelines and nearly doubling the accuracy of single-pass baselines across benchmarks like CodeContests, APPS, AetherCode, and live Codeforces rounds.

Key takeaway

For research scientists developing LLM-based code generation systems, Solvita demonstrates that incorporating agentic evolution with trainable knowledge networks can significantly improve performance on complex reasoning tasks like competitive programming. You should consider designing multi-agent architectures where agents learn and adapt from past experiences, rather than relying on stateless retrieval, to achieve continuous skill improvement and higher accuracy in challenging domains.

Key insights

Solvita enables LLMs to continuously learn and accumulate problem-solving experience in competitive programming via agentic evolution.

Principles

Method

Solvita employs a closed-loop system of strategy selection, program synthesis, certified supervision, and targeted hacking, with outcome signals updating graph-structured knowledge networks for dynamic query routing.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.