A Two-Stage Data-Driven Spatiotemporal Analysis to Predict Failure Risk of Urban Sewer Systems Leveraging Machine Learning Algorithms

被引:19
|
作者
Fontecha, John E. [1 ]
Agarwal, Puneet [1 ]
Torres, Maria N. [2 ]
Mukherjee, Sayanti [1 ]
Walteros, Jose L. [1 ]
Rodriguez, Juan P. [3 ]
机构
[1] SUNY Buffalo, Dept Ind & Syst Engn, 411 Bell Hall, Buffalo, NY 14260 USA
[2] SUNY Buffalo, Dept Struct Civil & Environm Engn, Buffalo, NY USA
[3] Univ Andes, Dept Civil & Environm Engn, Bogota, Colombia
关键词
Infrastructure failure risk prediction; machine learning models; maintenance planning; predictive and prescriptive modeling; spatiotemporal analysis; urban sewer system; OF-THE-ART; STATISTICAL-ANALYSIS; STRUCTURAL CONDITION; CLIMATE SENSITIVITY; ASSESSMENT MODEL; PIPES; WATER; NETWORK; STATE; MANAGEMENT;
D O I
10.1111/risa.13742
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Risk-informed asset management is key to maintaining optimal performance and efficiency of urban sewer systems. Although sewer system failures are spatiotemporal in nature, previous studies analyzed failure risk from a unidimensional aspect (either spatial or temporal), not accounting for bidimensional spatiotemporal complexities. This is owing to the insufficiency of good-quality data, which ultimately leads to under-/overestimation of failure risk. Here, we propose a generalized methodology/framework to facilitate a robust spatiotemporal analysis of urban sewer system failure risk, overcoming the intrinsic challenges of data imperfections-e.g., missing data, outliers, and imbalanced information. The framework includes a two-stage data-driven modeling technique that efficiently models the highly right-skewed sewer system failure data to predict the failure risk, leveraging a bidimensional space-time approach. We implemented our analysis for Bogota, the capital city of Colombia. We train, test, and validate a battery of machine learning algorithms-logistic regression, decision trees, random forests, and XGBoost-and select the best model in terms of goodness-of-fit and predictive accuracy. Finally, we illustrate the applicability of the framework in planning/scheduling sewer system maintenance operations using state-of-the-art optimization techniques. Our proposed framework can help stakeholders to analyze the failure-risk models' performance under different discrimination thresholds, and provide managerial insights on the model's adequate spatial resolution and appropriateness of decentralized management for sewer system maintenance.
引用
收藏
页码:2356 / 2391
页数:36
相关论文
共 50 条
  • [31] Data-Driven Distributionally Robust Risk-Averse Two-Stage Stochastic Linear Programming over Wasserstein Ball
    Gu, Yining
    Huang, Yicheng
    Wang, Yanjun
    [J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2024, 200 (01) : 242 - 279
  • [32] Data-Driven Risk-Averse Two-Stage Optimal Stochastic Scheduling of Energy and Reserve With Correlated Wind Power
    Xu, Xiaoyuan
    Yan, Zheng
    Shahidehpour, Mohammad
    Li, Zhiyi
    Yan, Mingyu
    Kong, Xiangrui
    [J]. IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2020, 11 (01) : 436 - 447
  • [33] Two-stage data-driven dispatch for integrated power and natural gas systems by using stochastic model predictive control
    Zhao, Yuehao
    Li, Zhiyi
    Ju, Ping
    Zhou, Yue
    [J]. APPLIED ENERGY, 2023, 343
  • [34] Discrete Train Speed Profile Optimization for Urban Rail Transit: A Data-Driven Model and Integrated Algorithms Based on Machine Learning
    Huang, Kang
    Wu, Jianjun
    Yang, Xin
    Gao, Ziyou
    Liu, Feng
    Zhu, Yuting
    [J]. JOURNAL OF ADVANCED TRANSPORTATION, 2019,
  • [35] Discrete Train Speed Profile Optimization for Urban Rail Transit: A Data-Driven Model and Integrated Algorithms Based on Machine Learning
    Huang, Kang
    Wu, Jianjun
    Yang, Xin
    Gao, Ziyou
    Liu, Feng
    Zhu, Yuting
    [J]. Journal of Advanced Transportation, 2019, 2019
  • [36] Large-scale data-driven financial risk management & analysis using machine learning strategies
    Murugan M.S.
    T S.K.
    [J]. Measurement: Sensors, 2023, 27
  • [37] Data-driven dynamic causality analysis of industrial systems using interpretable machine learning and process mining
    Nadim, Karim
    Ragab, Ahmed
    Ouali, Mohamed-Salah
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (01) : 57 - 83
  • [38] Data-driven dynamic causality analysis of industrial systems using interpretable machine learning and process mining
    Karim Nadim
    Ahmed Ragab
    Mohamed-Salah Ouali
    [J]. Journal of Intelligent Manufacturing, 2023, 34 : 57 - 83
  • [39] Data-driven forward and inverse problems for chaotic and hyperchaotic dynamic systems based on two machine learning architectures
    Zhong, Ming
    Yan, Zhenya
    [J]. PHYSICA D-NONLINEAR PHENOMENA, 2023, 446
  • [40] Data-driven two-stage sparse distributionally robust risk optimization model for location allocation problems under uncertain environment
    Liu, Zhimin
    [J]. AIMS MATHEMATICS, 2022, 8 (02): : 2910 - 2939