Robust State-Conditional Feature-Weighted Jump Models for Temporal Clustering

2026-06-12 · Source: stat.ML updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, extended

Summary

A Robust State-Conditional Feature-Weighted Jump Model (FWJM) is introduced for time-dependent clustering, incorporating a Tukey's biweight loss function for robustness against outliers and a penalty to encourage smooth temporal transitions. This model also features an additional parameter that controls the variability of feature weights across different states, allowing for state-specific feature relevance. Simulation studies, including scenarios with T=1000 observations and P=5 or P=50 features, demonstrate that FWJM accurately recovers true cluster sequences and reliably identifies relevant features, consistently outperforming competing methods, especially in the presence of 5% data contamination. Empirical applications include analyzing daily conflict-related homicides in Kosovo from 1998-2000 (T=1081, P=3) and macroeconomic performance indicators for twelve European countries from 1949-2024 (T=49, P=36).

Key takeaway

For Machine Learning Engineers analyzing multivariate time series with dynamic regimes and potential outliers, you should consider implementing Robust Feature-Weighted Jump Models. This approach provides superior clustering accuracy and identifies state-specific feature relevance, which is crucial for understanding underlying dynamics. Tune hyperparameters like temporal persistence (λ) and feature weight entropy (ζ) based on your data's time series length and feature sparsity to optimize performance.

Key insights

Robust Feature-Weighted Jump Models accurately cluster time series and identify state-specific feature importance, even with outliers.

Principles

State-specific feature weights enhance temporal clustering.
Robust loss functions handle time series outliers effectively.
Temporal regularization improves latent state sequence persistence.

Method

FWJM minimizes an objective function combining dissimilarity to medoids, an entropy penalty for weights, and a temporal persistence penalty. Estimation uses an alternating optimization procedure with three steps: updating medoids, state sequence via dynamic programming, and weights in closed form.

In practice

Use Tukey's biweight loss for outlier robustness.
Select hyperparameters via Silhouette index.
Employ multi-start optimization for global optimum.

Topics

Temporal Clustering
Feature Weighting
Robust Statistics
Time Series Analysis
Jump Models
Multivariate Data

Code references

FedericoCortese/FWJM

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

See Counsel's argued verdicts on the open AI decisions leaders are weighing →

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.