Interventional Time Series Priors for Causal Foundation Models

· Source: cs.LG updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Advanced, long

Summary

CausalTimePrior is a novel framework designed to generate synthetic temporal structural causal models (TSCMs) with paired observational and interventional time series, addressing a critical gap in training causal foundation models for time series data. Existing benchmarks provide observational data but lack the interventional targets necessary for learning causal effects. CausalTimePrior supports configurable causal graph structures, nonlinear autoregressive mechanisms, regime-switching dynamics, and multiple intervention types, including hard, soft, and time-varying interventions. A proof-of-concept PFN trained on CausalTimePrior demonstrated in-context causal effect estimation on held-out TSCMs, achieving comparable RMSE to per-dataset VAR baselines without requiring per-sample fitting. The framework generates diverse TSCMs, including 15% with regime-switching dynamics, and maintains a 0% divergence rate across 100K samples, ensuring data quality and stability.

Key takeaway

For research scientists developing foundation models for time series causal inference, CausalTimePrior offers a crucial synthetic data generation framework. You should leverage this tool to create diverse, interventional time series datasets, enabling the training of models that can perform in-context causal effect estimation without task-specific fine-tuning. This approach helps overcome the current limitation of observational-only benchmarks, accelerating the development of robust causal AI systems capable of reasoning about interventions in dynamic environments.

Key insights

CausalTimePrior generates synthetic interventional time series data to train foundation models for time series causal inference.

Principles

Method

The CausalTimePrior method samples TSCMs with time-lagged DAGs, nonlinear structural equations, and diverse noise, then generates paired observational and interventional time series by modifying structural equations for various intervention types.

In practice

Topics

Code references

Best for: Research Scientist, AI Researcher, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.LG updates on arXiv.org.