Predicting metro incident duration using structured data and unstructured text logs

被引:0
|
作者
Zhao, Yangyang [1 ]
Ma, Zhenliang [2 ]
Peng, Hui [1 ]
Cheng, Zhanhong [3 ]
机构
[1] Changan Univ, Coll Transportat Engn, Xian, Peoples R China
[2] KTH Royal Inst Technol, Dept Civil & Architectural Engn, Stockholm, Sweden
[3] McGill Univ, Dept Civil Engn, Montreal, PQ, Canada
基金
中国国家自然科学基金;
关键词
Incident duration prediction; metro; transportation resilience; text analysis; topic model; BAYESIAN NETWORK; MIXTURE MODEL; TRANSPORT; TRENDS; LIKELIHOOD; REGRESSION; SELECTION; FORECAST; TOPICS; TREE;
D O I
10.1080/23249935.2024.2396951
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Predicting metro incident duration is crucial for passengers and transit operators to choose appropriate response strategies. Most existing research focuses on structured data, the rich information embedded within unstructured incident logs is often neglected. This paper incorporates a probabilistic topic model tailored for short texts, the biterm topic model, into the generic incident duration prediction models. By capturing text co-occurrence patterns through Bayesian inference, the biterm topic model extracts hidden topics from incident narratives, and each topic serves as a condensed summary of detailed incident causes and countermeasures. These extracted topics are then combined with structured information to serve as predictors. We validated our model using five years of incident data from the Hong Kong Mass Transit Railway across two scenarios: with all incident information available and information revealed over time . The results demonstrate that our method significantly improves the prediction accuracy, particularly for incidents lasting longer than 30-minute.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Predicting Project's Uncertainty Risk in the Bidding Process by Integrating Unstructured Text Data and Structured Numerical Data Using Text Mining
    Lee, JeeHee
    Yi, June-Seong
    [J]. APPLIED SCIENCES-BASEL, 2017, 7 (11):
  • [2] Proposed Architecture for Automatic Conversion of Unstructured Text Data into Structured Text Data on the Web
    Madhusudhan, Ch.
    Rao, K. Mrithyunjaya
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2013, 13 (12): : 110 - 116
  • [3] Predicting Customer Behavior with Combination of Structured and Unstructured Data
    Afolabi, Ibukun T.
    Worlu, Rowland E.
    Adebayo, O. P.
    Jonathan, Oluranti
    [J]. 3RD INTERNATIONAL CONFERENCE ON SCIENCE AND SUSTAINABLE DEVELOPMENT (ICSSD 2019): SCIENCE, TECHNOLOGY AND RESEARCH: KEYS TO SUSTAINABLE DEVELOPMENT, 2019, 1299
  • [4] Predicting Mortality in Critical Care Patients with Fungemia Using Structured and Unstructured Data
    Baxter, Sally L.
    Klie, Adam R.
    Saseendrakumar, Bharanidharan Radha
    Ye, Gordon Y.
    Hogarth, Michael
    Nemati, Shamim
    [J]. 42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 5459 - 5463
  • [5] INTEGRATION OF STRUCTURED AND UNSTRUCTURED TEXT DATA IN A CLINICAL INFORMATION SYSTEM
    Wei, Ching-Song
    Sung, Sam
    Doong, Simon
    Ng, Peter
    [J]. JOURNAL OF INTEGRATED DESIGN & PROCESS SCIENCE, 2006, 10 (03) : 61 - 77
  • [6] An example of the ESTEST approach to combining unstructured text and structured data
    Williams, D
    Poulovassilis, A
    [J]. 15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 191 - 195
  • [7] Predicting adolescent suicidal behavior following inpatient discharge using structured and unstructured data
    Carson, Nicholas J.
    Yang, Xinyu
    Mullin, Brian
    Stettenbauer, Elizabeth
    Waddington, Marin
    Zhang, Alice
    Williams, Peyton
    Perez, Gabriel E. Rios
    Le Cook, Benjamin
    [J]. JOURNAL OF AFFECTIVE DISORDERS, 2024, 350 : 382 - 387
  • [8] Predicting incident duration using random forests
    Hamad, Khaled
    Al-Ruzouq, Rami
    Zeiada, Waleed
    Abu Dabous, Saleh
    Khalil, Mohamad Ali
    [J]. TRANSPORTMETRICA A-TRANSPORT SCIENCE, 2020, 16 (03) : 1269 - 1293
  • [9] Predicting Publication of Clinical Trials Using Structured and Unstructured Data: Model Development and Validation Study
    Wang, Siyang
    Suster, Simon
    Baldwin, Timothy
    Verspoor, Karin
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (12)
  • [10] Predicting Freeway Incident Duration Using Machine Learning
    Hamad, Khaled
    Khalil, Mohamad Ali
    Alozi, Abdul Razak
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS RESEARCH, 2020, 18 (02) : 367 - 380