Estimating Mutual Information between Time Series and Temporal Event Sequences Across Diverse Analysis Tasks
Summary
A new nonparametric mutual information estimator has been proposed to robustly quantify dependence between continuous time series and discrete temporal event sequences. This method addresses limitations of existing approaches, which often yield biased or unstable results due to sensitivity to quantization, repeated values, and event redundancy. The estimator directly measures dependence without requiring data transformation, learning, or ad hoc discretization. It incorporates a model for the continuous-discrete duality of real-world time series to manage quantization and repeated-value artifacts, and introduces a latent event clustering strategy to mitigate bias from event co-occurrence and redundancy. Evaluated on four tasks—causality analysis, temporal repetition discovery, discrete covariate selection for forecasting, and continuous feature selection for classification—experiments on synthetic and real-world datasets demonstrated consistent improvements in accuracy, robustness, and interpretability. The approach is presented as a general-purpose dependence operator for heterogeneous temporal data, with code available at https://github.com/HaojiHu/Multimodal-Temporal-Data-Quantification, published on 2026-06-01.
Key takeaway
For data scientists analyzing multimodal temporal data, this new nonparametric mutual information estimator offers a robust alternative to traditional methods. You can now quantify dependence between continuous time series and discrete event sequences more accurately, avoiding issues like quantization sensitivity or event redundancy bias. Consider integrating this approach for tasks such as causality analysis, feature selection, or temporal repetition discovery to achieve more reliable and interpretable results in your models.
Key insights
A nonparametric mutual information estimator quantifies dependence between heterogeneous time series and event sequences without transformation or discretization.
Principles
- Model continuous-discrete duality for robust time series analysis.
- Mitigate event redundancy bias via latent clustering.
- Directly measure dependence without ad hoc transformations.
Method
The method models continuous-discrete duality to handle quantization and repeated values, then applies a latent event clustering strategy to mitigate bias from event co-occurrence and redundancy.
In practice
- Apply for causality analysis in heterogeneous data.
- Use for discrete covariate selection in forecasting.
- Perform continuous feature selection for classification.
Topics
- Mutual Information Estimation
- Time Series Analysis
- Temporal Event Sequences
- Heterogeneous Data
- Causality Analysis
- Feature Selection
Code references
Best for: Research Scientist, AI Scientist, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.