Understanding Long-Term Dynamics of Individual Metro Usage: A Hidden Semi-Markov State Framework with Survival Analysis

· Source: stat.ML updates on arXiv.org · Field: Transportation & Mobility — Public Transportation & Urban Mobility, Mobility Services & Technology · Depth: Expert, extended

Summary

A new state-based lifecycle modeling framework integrates Hidden Semi-Markov Models (HSMM) with discrete-time survival analysis to characterize individual metro usage evolution over multi-year horizons. This approach addresses gaps in existing methods that treat mobility patterns as static or focus on short-term variability. Applied to four years of Shanghai metro smart card data (2021–2024) from 120,000 users, the framework identifies five robust mobility states and a directional transition hierarchy centered on an occasional-usage gateway state. The analysis reveals that exit hazard is state-dependent but duration-independent, while re-entry hazard decays sharply with inactivity length, offering a comprehensive view of transit participation lifecycles.

Key takeaway

For transit operators and planners aiming to improve ridership forecasting and passenger retention, you should adopt dynamic lifecycle monitoring over static user segmentation. This framework allows you to identify users vulnerable to disengagement based on their inferred behavioral states and understand how participation evolves, enabling the design of more targeted retention and reactivation strategies.

Key insights

The framework models individual metro usage lifecycles by integrating latent mobility states with exit and re-entry hazards.

Principles

Method

Sequentially integrate HSMM for latent state inference and trajectory decoding (Viterbi algorithm) with discrete-time survival models (logistic regression) for state-dependent exit/re-entry hazard estimation.

In practice

Topics

Best for: AI Scientist, Research Scientist, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.