Robust State-Conditional Feature-Weighted Jump Models for Temporal Clustering

· Source: stat.ML updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, extended

Summary

A Robust State-Conditional Feature-Weighted Jump Model (FWJM) is introduced for time-dependent clustering, incorporating a Tukey's biweight loss function for robustness against outliers and a penalty to encourage smooth temporal transitions. This model also features an additional parameter that controls the variability of feature weights across different states, allowing for state-specific feature relevance. Simulation studies, including scenarios with T=1000 observations and P=5 or P=50 features, demonstrate that FWJM accurately recovers true cluster sequences and reliably identifies relevant features, consistently outperforming competing methods, especially in the presence of 5% data contamination. Empirical applications include analyzing daily conflict-related homicides in Kosovo from 1998-2000 (T=1081, P=3) and macroeconomic performance indicators for twelve European countries from 1949-2024 (T=49, P=36).

Key takeaway

For Machine Learning Engineers analyzing multivariate time series with dynamic regimes and potential outliers, you should consider implementing Robust Feature-Weighted Jump Models. This approach provides superior clustering accuracy and identifies state-specific feature relevance, which is crucial for understanding underlying dynamics. Tune hyperparameters like temporal persistence (λ) and feature weight entropy (ζ) based on your data's time series length and feature sparsity to optimize performance.

Key insights

Robust Feature-Weighted Jump Models accurately cluster time series and identify state-specific feature importance, even with outliers.

Principles

Method

FWJM minimizes an objective function combining dissimilarity to medoids, an entropy penalty for weights, and a temporal persistence penalty. Estimation uses an alternating optimization procedure with three steps: updating medoids, state sequence via dynamic programming, and weights in closed form.

In practice

Topics

Code references

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.