EvoDrive: Pareto Evolution for Safety-Critical Autonomous Driving via Self-Improving LLM Agents
Summary
EvoDrive is introduced as the first automated, LLM-based agentic evolution framework designed for multi-objective scenario generation in safety-critical autonomous driving systems. This framework addresses the challenge of maximizing adversariality to expose system failures while simultaneously preserving realism, a trade-off often managed by handcrafted heuristics in existing methods. EvoDrive utilizes a simulator-grounded actor-critic architecture, where a memory-driven actor iteratively proposes generator improvements, critics filter implausible candidates, and a self-evolving world evaluator optimizes simulation budgets by routing promising proposals. It also maintains a Pareto archive of evaluated candidates to ensure diverse attack-realism trade-offs and guide future evolution through simulation feedback. Benchmark results on MetaDrive and CARLA demonstrate that EvoDrive significantly expands the Pareto frontier across various generators and generates valuable scenarios for policy training.
Key takeaway
For autonomous driving engineers focused on system validation and policy training, EvoDrive offers a robust approach to scenario generation. You should consider integrating LLM-based agentic evolution to move beyond handcrafted heuristics, enabling the creation of more adversarial yet realistic test cases. This framework helps expand the Pareto frontier of attack-realism trade-offs, directly improving the robustness and safety of your autonomous systems by exposing underexplored failure patterns.
Key insights
EvoDrive uses LLM-based agentic evolution to generate diverse, safety-critical autonomous driving scenarios by balancing adversariality and realism.
Principles
- Maximize adversariality while preserving realism.
- Employ multi-objective optimization for scenario generation.
- Use simulation feedback to guide evolution.
Method
EvoDrive employs a simulator-grounded actor-critic architecture with a memory-driven actor, critics, and a self-evolving world evaluator. It maintains a Pareto archive for diverse trade-offs.
In practice
- Generate diverse attack-realism scenarios.
- Improve autonomous driving policy training.
Topics
- Autonomous Driving
- Scenario Generation
- LLM Agents
- Pareto Optimization
- Actor-Critic Architecture
- Safety-Critical Systems
Best for: Computer Vision Engineer, Research Scientist, AI Scientist, Robotics Engineer, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.