From XXLTraffic to EvoXXLTraffic: Scaling Traffic Forecasting to Sensor-Evolving Networks

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, quick

Summary

The XXLTraffic dataset family, extended to EvoXXLTraffic, addresses a critical gap in traffic forecasting benchmarks by accounting for continuously evolving road-sensor networks. Unlike existing benchmarks assuming fixed sensor sets, XXLTraffic spans up to 27 years of California PeMS and Transport for NSW data, offering fixed-sensor subsets for multi-year and standard long-horizon forecasting. EvoXXLTraffic reorganizes this data to expose per-year active sensors, yearly traffic-flow matrices, and yearly graph snapshots across nine PeMS districts, showing growth ratios from +305% to over +10,000%. A yearly streaming forecasting protocol is defined on EvoXXLTraffic, treating each calendar year as a continual task. Benchmarking various baselines, including static spatio-temporal GNNs and evolving-graph methods, reveals that many state-of-the-art (SOTA) results no longer work on this ultra-large evolutionary dataset, highlighting its more realistic reflection of real-world conditions.

Key takeaway

For Machine Learning Engineers developing traffic forecasting models, your current benchmarks likely overstate model performance due to their fixed-sensor assumptions. You should evaluate your spatio-temporal GNNs and other methods against the EvoXXLTraffic dataset's yearly streaming protocol to accurately assess their robustness in continuously evolving sensor networks. This will reveal true model limitations and guide development towards more adaptive, real-world solutions.

Key insights

Existing traffic forecasting benchmarks fail to account for real-world sensor network evolution, leading to unrealistic SOTA performance.

Principles

Method

The EvoXXLTraffic dataset defines a yearly streaming forecasting protocol where each calendar year is a continual task, benchmarking diverse baselines against sensor-evolving networks.

In practice

Topics

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.