A Generative Model for Closed-Loop Microsimulation of Signalized Intersections

· Source: Takara TLDR - Daily AI Papers · Field: Transportation & Mobility — Autonomous Vehicles & Smart Transportation, Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, medium

Summary

Enactor is an actor-centric generative model designed for closed-loop microsimulation of signalized intersections, addressing limitations of traditional traffic models and short-horizon trajectory predictors. It focuses on vehicle behavior, incorporating pedestrians as contextual influences, and encodes dynamic actors and lane polylines using polar coordinates. The model employs a transformer architecture with distinct spatial and temporal attention blocks to predict distributions for each actor's next-step motion ($s$, $α$). Trained with a closed-loop curriculum, Enactor was evaluated in a 4000-second simulation-in-the-loop test across two intersection geometries. It accurately recovers SUMO data generator's speed and travel-time distributions, achieving KL divergence an order of magnitude lower than a transformer baseline for travel time, and approximately $5\times$ lower for speed at Site 1. Furthermore, Enactor significantly reduces red-light violations by over an order of magnitude and outperforms a constant-velocity baseline on multi-horizon predictive tasks using real-world field data.

Key takeaway

For Machine Learning Engineers developing autonomous driving simulation environments, Enactor demonstrates a robust approach to closed-loop traffic microsimulation. You should consider adopting actor-centric generative models with closed-loop curriculum training to achieve higher fidelity in vehicle interaction and reduce simulation instability. This method significantly improves realism in speed and travel-time distributions while drastically cutting red-light violations, offering a more reliable platform for policy evaluation.

Key insights

Actor-centric generative model Enactor enables stable, realistic closed-loop microsimulation of signalized intersections by learning from its own predictions.

Principles

Method

Encode dynamic actors and lane polylines in polar coordinates. Employ a transformer with separate spatial and temporal attention blocks to predict actor motion. Train using a closed-loop curriculum, exposing the model to its own predictions.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Robotics Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.