Understanding Long-Term Dynamics of Individual Metro Usage: A Hidden Semi-Markov State Framework with Survival Analysis
Pith reviewed 2026-06-26 21:58 UTC · model grok-4.3
The pith
A hidden semi-Markov model with survival analysis on four years of Shanghai metro data identifies five mobility states centered on an occasional-usage gateway, where exit risk depends on state but not duration while re-entry risk falls with
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The framework reveals five robust mobility states with a directional transition hierarchy centered on an occasional-usage gateway state, and fundamentally different temporal mechanisms governing disengagement and return: exit hazard is state-dependent but duration-independent, whereas re-entry hazard decays sharply with inactivity length.
What carries the argument
Hidden Semi-Markov Model integrated with discrete-time survival analysis, which jointly infers latent states, explicit duration distributions, a transition matrix, and state-dependent hazard functions for exit and re-entry.
If this is right
- Operators can flag riders in high-exit states for targeted retention before they disengage.
- Re-entry campaigns can be timed to the early part of inactivity windows when hazard remains high.
- Planning models can replace static user clusters with state trajectories that evolve over years.
- Retention interventions become state-specific rather than uniform across all riders.
Where Pith is reading between the lines
- The same state-and-hazard structure could be tested on bus or bike-share data to check whether the gateway-state pattern and duration-independent exit hold across modes.
- If exit is truly duration-independent, short-term promotions may have limited carry-over once a rider enters a low-usage state.
- System-wide forecasts of ridership loss could incorporate the measured re-entry decay curve to estimate net retention after campaigns.
Load-bearing premise
The four-year Shanghai smart card records, after preprocessing, contain enough consistent signal to recover stable latent states and their hazard patterns without major distortion from unobserved rider differences or recording artifacts.
What would settle it
Re-running the identical HSMM-plus-survival pipeline on an independent multi-year smart-card dataset from another city produces a different number of states or hazard functions in which exit risk depends on duration or re-entry risk does not decay with inactivity.
Figures
read the original abstract
Understanding how individual metro usage evolves over multi-year horizons is essential for transit planning and passenger retention. However, existing approaches typically characterize mobility patterns as static clusters or short-term variability, leaving the lifecycle dynamics of transit participation underexplored. This study proposes a state-based lifecycle modeling framework that integrates Hidden Semi-Markov Models (HSMM) with discrete-time survival analysis to characterize the evolution of individual metro mobility. The HSMM infers latent mobility states with explicit duration distributions and a transition matrix governing regime changes, while the survival component models exit and re-entry events via state-dependent hazard functions conditioned on mobility-state trajectories and behavioral history. Applied to four years of smart card data from the Shanghai metro system (2021-2024), the framework enables the identification of interpretable mobility states, the characterization of transition dynamics, and the quantification of state-dependent exit and re-entry processes. The analysis reveals five robust mobility states with a directional transition hierarchy centered on an occasional-usage gateway state, and fundamentally different temporal mechanisms governing disengagement and return: exit hazard is state-dependent but duration-independent, whereas re-entry hazard decays sharply with inactivity length. These findings provide a methodological foundation for lifecycle-oriented mobility analysis and practical guidance for transit operators to identify at-risk users and time retention interventions.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a framework integrating Hidden Semi-Markov Models (HSMM) with discrete-time survival analysis to characterize long-term individual metro usage dynamics. Using four years of Shanghai smart card data (2021-2024), the HSMM infers five latent mobility states with explicit duration distributions and a transition matrix, while the survival component models state-dependent exit and re-entry hazards conditioned on trajectories and history. The analysis identifies a directional transition hierarchy centered on an occasional-usage gateway state, with exit hazards state-dependent but duration-independent and re-entry hazards decaying sharply with inactivity length.
Significance. If the model specification, estimation, and robustness checks hold, the work supplies a methodological foundation for lifecycle-oriented mobility analysis and practical guidance for identifying at-risk users. The empirical distinction in hazard mechanisms (state-dependent exit vs. duration-dependent re-entry) is a substantive contribution to transit behavior modeling.
minor comments (2)
- [Abstract] Abstract: the claim of 'five robust mobility states' would be strengthened by explicit reference to the validation procedure (e.g., cross-validation likelihood or state stability metrics) in the main text.
- The manuscript should include the explicit form of the joint likelihood (HSMM emission + duration + survival hazard) to allow readers to assess identifiability of the five states.
Simulated Author's Rebuttal
We thank the referee for their positive evaluation of the manuscript, accurate summary of the framework and findings, and recommendation for minor revision. We appreciate the recognition of the methodological integration of HSMM with survival analysis and the substantive distinction between exit and re-entry mechanisms.
Circularity Check
No significant circularity detected
full rationale
The abstract describes a standard application of HSMM integrated with discrete-time survival analysis to infer latent mobility states and state-dependent hazards from smart-card trajectory data. No equations, parameter-fitting procedures, or derived quantities are presented that reduce the reported states, transition hierarchy, or hazard functions to definitions or direct renamings of the model inputs themselves. The central claims concern empirical patterns (five states, directional transitions, duration-independent exit vs. decaying re-entry) obtained after model fitting, with no evidence of self-definitional loops, fitted-input predictions, or load-bearing self-citations in the supplied text. The derivation chain therefore remains self-contained against external data.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
author Allison, P.D. , year 1982 . title Discrete-time methods for the analysis of event histories . journal Sociological Methodology volume 13 , pages 61--98 . :10.2307/270718
-
[2]
author Andersen, P.K. , author Ravn, H. , year 2023 . title Models for Multi-State Survival Data: Rates, Risks, and Pseudo-Values . publisher CRC Press . :10.1201/9780429029684
-
[3]
author Baratchi, M. , author Meratnia, N. , author Havinga, P.J. , author Skidmore, A.K. , author Toxopeus, B.A. , year 2014 . title A hierarchical hidden semi-markov model for modeling mobility data , in: booktitle Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing , organization ACM . pp. pages 401--412 . :1...
-
[4]
author Bartl, E. , author Bauer, A. , author Weigert, M. , author Karl, M. , author Schmude, J. , author K\" u chenhoff, H. , year 2024 . title Disentangling temporal changes in travel behavior: An age-period-cohort analysis based on german travel demand . journal Annals of Tourism Research Empirical Insights volume 5 , pages 100155 . :10.1016/j.annale.20...
-
[5]
author Briand, A.S. , author Come, E. , author Tr\' e panier, M. , author Oukhellou, L. , year 2017 . title Analyzing year-to-year changes in public transport passenger behaviour using smart card data . journal Transportation Research Part C: Emerging Technologies volume 79 , pages 274--289 . :10.1016/j.trc.2017.03.021
-
[6]
author Burzacchi, A. , author Urbano, V.M. , author Arena, M. , author Azzone, G. , author Secchi, P. , author Vantini, S. , year 2026 . title Spatio-temporal analysis of public transportation ridership: leveraging APC data for a comprehensive evaluation of usage rates . journal Public Transport :10.1007/s12469-025-00416-8
-
[7]
author Cardell-Oliver, R.M. , author Olaru, D. , year 2022 . title CIAM : A data-driven approach for classifying long-term engagement of public transport riders at multiple temporal scales . journal Transportation Research Part A: Policy and Practice volume 165 , pages 321--336 . :10.1016/j.tra.2022.09.002
-
[8]
author Chowdhury, R.I. , author Ataharul Islam, M. , year 2020 . title Prediction of risks of sequence of events using multistage proportional hazards model: a marginal-conditional modelling approach . journal Statistical Methods and Applications volume 29 , pages 141--171 . :10.1007/s10260-019-00460-2
-
[9]
author Cleynen, A. , author de Saporta, B. , author Rossini, O. , author Vernay, A. , year 2025 . title Controlled hidden semi-markov models , in: booktitle A Comprehensive Guide to HSMM: Theory, Software, and Advanced Extensions . publisher Wiley . chapter chapter 6 . :10.1002/9781394427581.ch6
-
[10]
author Dickinson, J.E. , author Filimonau, V. , author Cherrett, T. , author Davies, N. , author Norgate, S. , author Speed, C. , author Winstanley, C. , year 2013 . title Understanding temporal rhythms and travel behaviour at destinations: potential ways to achieve more sustainable travel . journal Journal of Sustainable Tourism volume 21 , pages 1070--1...
-
[11]
author Ding, J. , author Shah, S.P. , year 2010 . title Robust hidden semi-markov modeling of array cgh data , in: booktitle 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) , organization IEEE . pp. pages 603--608 . :10.1109/BIBM.2010.5706637
-
[12]
author Dong, M. , author He, D. , year 2007 a. title Hidden semi-markov model-based methodology for multi-sensor equipment health diagnosis and prognosis . journal European Journal of Operational Research volume 178 , pages 858--878 . :10.1016/j.ejor.2006.01.041
-
[13]
author Dong, M. , author He, D. , year 2007 b. title A segmental hidden semi-markov model (hsmm)-based diagnostics and prognostics framework and methodology . journal Mechanical Systems and Signal Processing volume 21 , pages 2248--2266 . :10.1016/j.ymssp.2006.10.001
-
[14]
author Duong, T.V. , author Bui, H. , author Phung, D. , author Venkatesh, S. , year 2005 . title Activity recognition and abnormality detection with the switching hidden semi-markov model , in: booktitle 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005) , organization IEEE . pp. pages 838--845 . :10.1109/CVPR.2005.61
-
[15]
author Ghasri, M. , author Rashidi, T.H. , author Saberi, M. , year 2018 . title Comparing survival analysis and discrete choice specifications simulating dynamics of vehicle ownership . journal Transportation Research Record volume 2672 , pages 34--45 . :10.1177/0361198118791911
-
[16]
author Govindarajulu, U.S. , author D'Agostino, R.B. , year 2020 . title Review of current advances in survival analysis and frailty models . journal Wiley Interdisciplinary Reviews: Computational Statistics volume 12 , pages e1504 . :10.1002/wics.1504
-
[17]
author Gu, J. , author Jiang, Z. , author Fan, W.W. , author Chen, J. , year 2022 . title Short-term trajectory prediction for individual metro passengers integrating diverse mobility patterns with adaptive location-awareness . journal Information Sciences volume 599 , pages 25--43 . :10.1016/j.ins.2022.03.074
-
[18]
author Haji-Maghsoudi, S. , author Bulla, J. , author Sadeghifar, M. , author Roshanaei, G. , author Mahjub, H. , year 2021 . title Generalized linear mixed hidden semi-markov models in longitudinal settings: A bayesian approach . journal Statistics in Medicine volume 40 , pages 2373--2388 . :10.1002/sim.8908
-
[19]
author ter Hofstede, F. , author Wedel, M. , year 1998 . title A monte carlo study of time aggregation in continuous-time and discrete-time parametric hazard models . journal Economics Letters volume 58 , pages 149--156 . :10.1016/S0165-1765(97)00265-6
-
[20]
author Hong, Y. , author Martin, H. , author Xin, Y. , author Bucher, D. , author Reck, D.J. , author Axhausen, K.W. , author Raubal, M. , year 2023 . title Conserved quantities in human mobility: From locations to trips . journal Transportation Research Part C: Emerging Technologies volume 146 , pages 103979 . :10.1016/j.trc.2022.103979
-
[21]
author van den Hout, A. , year 2016 . title Multi-State Survival Models for Interval-Censored Data . publisher CRC Press . :10.1201/9781315374321
-
[22]
author Huang, C. , author Huang, Y. , year 2024 . title A novel approach for real-time monitoring and counting of metro passenger and vehicle flow . journal International Journal of Sensor Networks volume 46 , pages 176--185 . :10.1504/IJSNET.2024.142514
-
[23]
author Jiao, P.P. , author Zhao, X. , author Zhang, Y. , author Yin, B.C. , year 2021 . title Review of human mobility pattern analysis based on big transportation data . journal China Journal of Highway and Transport volume 34 , pages 1--18 . :10.19721/j.cnki.1001-7372.2021.12.014
-
[24]
author Juang, B.H. , author Rabiner, L.R. , year 1990 . title The segmental K -means algorithm for estimating parameters of hidden M arkov models . journal IEEE Transactions on Acoustics, Speech, and Signal Processing volume 38 , pages 1639--1641 . :10.1109/29.60082
-
[25]
author van Kasteren, T.L. , author Englebienne, G. , author Krose, B.J. , year 2010 . title Activity recognition using semi-markov models on real world smart home datasets . journal Journal of Ambient Intelligence and Smart Environments volume 2 , pages 311--325 . :10.3233/AIS-2010-0070
-
[26]
author Koslik, J.O. , year 2025 . title Hidden semi-markov models with inhomogeneous state dwell-time distributions . journal Computational Statistics and Data Analysis volume 209 , pages 108171 . :10.1016/j.csda.2025.108171
-
[27]
author Kuhn, H.W. , year 1955 . title The H ungarian method for the assignment problem . journal Naval Research Logistics Quarterly volume 2 , pages 83--97 . :10.1002/nav.3800020109
-
[28]
author Langrock, R. , author King, R. , author Matthiopoulos, J. , author Thomas, L. , author Fortin, D. , author Morales, J.M. , year 2012 . title Flexible and practical modeling of animal telemetry data: hidden Markov models and extensions . journal Ecology volume 93 , pages 2336--2342 . :10.1890/11-2241.1
-
[29]
author Langrock, R. , author Zucchini, W. , year 2011 . title Hidden markov models with arbitrary state dwell-time distributions . journal Computational Statistics and Data Analysis volume 55 , pages 715--724 . :10.1016/j.csda.2010.06.015
-
[30]
author Liu, X. , author Zou, G. , author Duan, Z. , author Li, W. , year 2026 . title Individual-level metro route extraction and travel behavior pattern mining . journal Journal of Transportation Engineering and Information volume 24 , pages 45--58 . :10.19961/j.cnki.1672-4747.2025.07.026
-
[31]
author Lu, J. , author Zhou, S. , author Xu, Y. , year 2025 . title How spatial fixity of individual daily activities evolves in the long-term: A life course and multi-scale behavior explanation . journal Applied Geography volume 178 , pages 103609 . :10.1016/j.apgeog.2025.103609
-
[32]
author Malefaki, S. , author Trevezas, S. , author Limnios, N. , year 2010 . title An em and a stochastic version of the em algorithm for nonparametric hidden semi-markov models . journal Communications in Statistics: Simulation and Computation volume 39 , pages 240--261 . :10.1080/03610910903411185
-
[33]
author Mo, B. , author Zhao, Z. , author Koutsopoulos, H.N. , author Zhao, J. , year 2022 . title Individual mobility prediction in mass transit systems using smart card data: An interpretable activity-based hidden markov approach . journal IEEE Transactions on Intelligent Transportation Systems volume 23 , pages 12014--12026 . :10.1109/TITS.2021.3109428
-
[34]
author Nithin, K. , author Mulangi, R. , author Sharma, R. , author Baishya, H. , author Panth, P. , author Mohtashim, M. , year 2024 . title Visualisation and assessment of seasonal variations in bus passenger mobility pattern , in: booktitle Technologies for Sustainable Transportation Infrastructures . publisher Springer . volume volume 529 , pp. pages ...
-
[35]
author Peyrard, N. , author de Saporta, B. , year 2025 . title A Comprehensive Guide to HSMM: Theory, Software, and Advanced Extensions . publisher Wiley . :10.1002/9781394427581
-
[36]
author Pohle, J. , author Adam, T. , author Beumer, L.T. , year 2022 . title Flexible estimation of the state dwell-time distribution in hidden semi-markov models . journal Computational Statistics and Data Analysis volume 172 , pages 107479 . :10.1016/j.csda.2022.107479
-
[37]
author Qi, G. , author Ceder, A.A. , author Huang, A. , author Guan, W. , year 2021 . title A methodology to attain public transit origin-destination mobility patterns using multi-layered mesoscopic analysis . journal IEEE Transactions on Intelligent Transportation Systems volume 22 , pages 6256--6274 . :10.1109/TITS.2020.2990719
-
[38]
author Rabiner, L.R. , year 1989 . title A tutorial on hidden M arkov models and selected applications in speech recognition . journal Proceedings of the IEEE volume 77 , pages 257--286 . :10.1109/5.18626
-
[39]
In: 2024 7th International Confer- ence on Informatics and Computational Sciences (ICICoS)
author Saputra, R. , author Suprapto, S. , author Sihabuddin, A. , year 2024 . title Mobility prediction using markov models: A survey , in: booktitle Proceedings of the 7th International Conference on Informatics and Computational Sciences (ICICoS) , organization IEEE . pp. pages 508--513 . :10.1109/ICICoS62600.2024.10636860
-
[40]
author Singer, J.D. , author Willett, J.B. , year 2003 . title Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence . publisher Oxford University Press , address New York . :10.1093/acprof:oso/9780195152968.001.0001
work page doi:10.1093/acprof:oso/9780195152968.001.0001 2003
-
[41]
author Sun, L. , author Zhao, J. , author Zhang, F. , author Zhang, R. , author Ye, K. , year 2024 . title Fmsys: Fine-grained passenger flow monitoring in a large-scale metro system based on afc smart card data , in: booktitle Advances in Knowledge Discovery and Data Mining . publisher Springer . volume volume 14649 of series Lecture Notes in Computer Sc...
-
[42]
author Tr\' e panier, M. , author Habib, K.M. , author Morency, C. , year 2012 . title Are transit users loyal? revelations from a hazard model based on smart card data . journal Canadian Journal of Civil Engineering volume 39 , pages 610--618 . :10.1139/l2012-048
-
[43]
author Urbano, V.M. , author Arena, M. , author Azzone, G. , year 2025 . title Big data for decision-making in public transport management: A comparison of different data sources . journal Research in Transportation Business & Management volume 59 , pages 101298 . :10.1016/j.rtbm.2025.101298
-
[44]
author Urbano, V.M. , author Arena, M. , author Azzone, G. , author Cecconi, L. , year 2026 . title Aggregated mobile phone data in transportation: A literature review . journal Transportation Research Procedia volume 95 , pages 121--128 . :10.1016/j.trpro.2026.02.016
-
[45]
author Wang, D. , author Zhong, W. , author Yin, Z. , author Xie, D. , author Luo, X. , year 2018 . title Spatio-temporal dynamics of population in shanghai: A case study based on cell phone signaling data , in: booktitle Big Data Support of Urban Planning and Management: The Experience in China . publisher Springer . Advances in Geographic Information Sc...
-
[46]
author Xiao, Y. , author Jiang, S. , author Zhang, Z. , year 2025 . title Spatio-temporal tourist behavior (sttb) under digital footprints: a systematic literature review . journal Information Technology and Tourism volume 27 , pages 517--545 . :10.1007/s40558-025-00322-6
-
[47]
author Xue, Y. , author Zhou, D. , author Du, N. , author Dai, A.M. , author Xu, Z. , author Zhang, K. , author Cui, C. , year 2020 . title Deep state-space generative model for correlated time-to-event predictions , in: booktitle Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , organization ACM . pp. pa...
-
[48]
author Yong, N. , author Ni, S. , author Shen, S. , year 2016 . title A preliminary study of mobility patterns in urban subway , in: booktitle Social, Cultural, and Behavioral Modeling . publisher Springer . volume volume 9708 of series Lecture Notes in Computer Science , pp. pages 61--70 . :10.1007/978-3-319-39931-7\_7
-
[49]
author Yu, C. , author Lin, H. , author Dong, W. , author Fang, S. , author Yuan, Q. , author Yang, C. , year 2024 . title TripChain2RecDeepSurv : A novel framework to predict transit users' lifecycle behavior status transitions for user management . journal Transportation Research Part C: Emerging Technologies volume 167 , pages 104818 . :10.1016/j.trc.2...
-
[50]
author Yu, C. , author Yang, C. , author Dong, W. , author Chen, Y. , author Yuan, Q. , year 2025 . title Retaining bus riders: A lifecycle longitudinal analysis of behavioral status transitions from entry to exit . journal International Journal of Transportation Science and Technology volume 18 , pages 176--192 . :10.1016/j.ijtst.2024.06.004
-
[51]
author Yu, S.Z. , year 2010 . title Hidden semi-markov models . journal Artificial Intelligence volume 174 , pages 215--243 . :10.1016/j.artint.2009.11.011
-
[52]
author Yu, S.Z. , year 2015 . title Hidden Semi-Markov Models: Theory, Algorithms and Applications . publisher Elsevier . :10.1016/C2014-0-02508-7
-
[53]
author Yu, S.Z. , author Kobayashi, H. , year 2003 . title A hidden semi-markov model with missing data and multiple observation sequences for mobility tracking . journal Signal Processing volume 83 , pages 235--250 . :10.1016/S0165-1684(02)00378-X
-
[54]
author Yue, Y. , author Lan, T. , author Yeh, A.G. , author Li, Q.Q. , year 2014 . title Zooming into individuals to understand the collective: A review of trajectory-based travel behaviour studies . journal Travel Behaviour and Society volume 1 , pages 69--78 . :10.1016/j.tbs.2013.12.002
-
[55]
author Zucchini, W. , author MacDonald, I.L. , author Langrock, R. , year 2016 . title Hidden Markov Models for Time Series: An Introduction Using R . edition 2nd ed., publisher Chapman and Hall/CRC . :10.1201/b20790
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.