From XXLTraffic to EvoXXLTraffic: Scaling Traffic Forecasting to Sensor-Evolving Networks
Summary
The XXLTraffic dataset family, extended to EvoXXLTraffic, addresses a critical gap in traffic forecasting benchmarks by accounting for continuously evolving road-sensor networks. Unlike existing benchmarks assuming fixed sensor sets, XXLTraffic spans up to 27 years of California PeMS and Transport for NSW data, offering fixed-sensor subsets for multi-year and standard long-horizon forecasting. EvoXXLTraffic reorganizes this data to expose per-year active sensors, yearly traffic-flow matrices, and yearly graph snapshots across nine PeMS districts, showing growth ratios from +305% to over +10,000%. A yearly streaming forecasting protocol is defined on EvoXXLTraffic, treating each calendar year as a continual task. Benchmarking various baselines, including static spatio-temporal GNNs and evolving-graph methods, reveals that many state-of-the-art (SOTA) results no longer work on this ultra-large evolutionary dataset, highlighting its more realistic reflection of real-world conditions.
Key takeaway
For Machine Learning Engineers developing traffic forecasting models, your current benchmarks likely overstate model performance due to their fixed-sensor assumptions. You should evaluate your spatio-temporal GNNs and other methods against the EvoXXLTraffic dataset's yearly streaming protocol to accurately assess their robustness in continuously evolving sensor networks. This will reveal true model limitations and guide development towards more adaptive, real-world solutions.
Key insights
Existing traffic forecasting benchmarks fail to account for real-world sensor network evolution, leading to unrealistic SOTA performance.
Principles
- Real-world sensor networks continuously grow.
- Fixed-sensor benchmarks misrepresent performance.
- Ultra-long evolutionary data reveals SOTA limitations.
Method
The EvoXXLTraffic dataset defines a yearly streaming forecasting protocol where each calendar year is a continual task, benchmarking diverse baselines against sensor-evolving networks.
In practice
- Benchmark models on EvoXXLTraffic.
- Develop methods for evolving graph data.
- Evaluate SOTA under sensor growth.
Topics
- Traffic Forecasting
- Sensor Networks
- Evolving Graphs
- Spatio-temporal GNNs
- XXLTraffic Dataset
- PeMS Data
Best for: AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.