HawkesNest: A Multi-Axis Synthetic Benchmark for Spatiotemporal Pattern Complexity

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

HawkesNest is a new generator-aligned synthetic benchmark designed to evaluate spatiotemporal point process (STPP) models by controlling pattern complexity. Built on a multivariate Hawkes backbone, HawkesNest addresses the challenge of opaque real-world datasets where latent generative structures are unknown, hindering model failure attribution. The benchmark defines four distinct complexity axes: space--time entanglement, background heterogeneity, cross-type interaction, and domain topology. Each axis is associated with a deterministic index derived from the latent data-generating mechanism. By varying these axes while maintaining fixed global rate, stability, and simulation budget, HawkesNest enables diagnostic stress tests. Verification shows the indices are monotone and nearly orthogonal. For instance, Hawkes-family baselines degrade under joint heterogeneity--entanglement complexity, and AutoSTPP exhibits vulnerability to isolated increases in space--time entanglement. The code is available at https://github.com/YahyaAalaila/HawkesNest.

Key takeaway

For Machine Learning Engineers evaluating spatiotemporal point process (STPP) models, HawkesNest offers a critical tool to move beyond opaque real-world data. You should integrate this benchmark into your model development pipeline to precisely diagnose performance degradations under known structural difficulties, such as space--time entanglement or background heterogeneity. This allows you to identify specific vulnerabilities in both traditional Hawkes-family models and neural STPP architectures like AutoSTPP, guiding targeted improvements and more robust model selection.

Key insights

HawkesNest provides a multi-axis synthetic benchmark for diagnostic stress testing of spatiotemporal point process models.

Principles

Method

HawkesNest generates synthetic spatiotemporal point process data using a multivariate Hawkes backbone, varying four complexity axes (space--time entanglement, background heterogeneity, cross-type interaction, domain topology) via deterministic indices, while fixing global rate, stability, and simulation budget.

In practice

Topics

Code references

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.