A Two-Stage Data-Driven Spatiotemporal Analysis to Predict Failure Risk of Urban Sewer Systems Leveraging Machine Learning Algorithms

被引:19
|
作者
Fontecha, John E. [1 ]
Agarwal, Puneet [1 ]
Torres, Maria N. [2 ]
Mukherjee, Sayanti [1 ]
Walteros, Jose L. [1 ]
Rodriguez, Juan P. [3 ]
机构
[1] SUNY Buffalo, Dept Ind & Syst Engn, 411 Bell Hall, Buffalo, NY 14260 USA
[2] SUNY Buffalo, Dept Struct Civil & Environm Engn, Buffalo, NY USA
[3] Univ Andes, Dept Civil & Environm Engn, Bogota, Colombia
关键词
Infrastructure failure risk prediction; machine learning models; maintenance planning; predictive and prescriptive modeling; spatiotemporal analysis; urban sewer system; OF-THE-ART; STATISTICAL-ANALYSIS; STRUCTURAL CONDITION; CLIMATE SENSITIVITY; ASSESSMENT MODEL; PIPES; WATER; NETWORK; STATE; MANAGEMENT;
D O I
10.1111/risa.13742
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Risk-informed asset management is key to maintaining optimal performance and efficiency of urban sewer systems. Although sewer system failures are spatiotemporal in nature, previous studies analyzed failure risk from a unidimensional aspect (either spatial or temporal), not accounting for bidimensional spatiotemporal complexities. This is owing to the insufficiency of good-quality data, which ultimately leads to under-/overestimation of failure risk. Here, we propose a generalized methodology/framework to facilitate a robust spatiotemporal analysis of urban sewer system failure risk, overcoming the intrinsic challenges of data imperfections-e.g., missing data, outliers, and imbalanced information. The framework includes a two-stage data-driven modeling technique that efficiently models the highly right-skewed sewer system failure data to predict the failure risk, leveraging a bidimensional space-time approach. We implemented our analysis for Bogota, the capital city of Colombia. We train, test, and validate a battery of machine learning algorithms-logistic regression, decision trees, random forests, and XGBoost-and select the best model in terms of goodness-of-fit and predictive accuracy. Finally, we illustrate the applicability of the framework in planning/scheduling sewer system maintenance operations using state-of-the-art optimization techniques. Our proposed framework can help stakeholders to analyze the failure-risk models' performance under different discrimination thresholds, and provide managerial insights on the model's adequate spatial resolution and appropriateness of decentralized management for sewer system maintenance.
引用
收藏
页码:2356 / 2391
页数:36
相关论文
共 50 条
  • [1] Failure risk analysis of pipelines using data-driven machine learning algorithms
    Mazumder, Ram K.
    Salman, Abdullahi M.
    Li, Yue
    [J]. STRUCTURAL SAFETY, 2021, 89
  • [2] Data-driven dimensional analysis of critical heat flux in subcooled vertical flow: A two-stage machine learning approach
    Yang, Kuang
    Liang, Zhicheng
    Xu, Bo
    Hou, Zhenghui
    Wang, Haijun
    [J]. APPLIED THERMAL ENGINEERING, 2024, 248
  • [3] Data-driven two-stage distributionally robust optimization with risk aversion
    Huang, Ripeng
    Qu, Shaojian
    Gong, Zaiwu
    Goh, Mark
    Ji, Ying
    [J]. APPLIED SOFT COMPUTING, 2020, 87
  • [4] Data-driven atmospheric corrosion prediction model for alloys based on a two-stage machine learning approach
    Chen, Qian
    Wang, Han
    Ji, Haodi
    Ma, Xiaobing
    Cai, Yikun
    [J]. PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2024, 188 : 1093 - 1105
  • [5] A two-stage data-driven metaheuristic to predict last-mile delivery route sequences
    Mesa, Juan Pablo
    Montoya, Alejandro
    Ramos-Pollan, Raul
    Toro, Mauricio
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [6] Data-driven decision model based on local two-stage weighted ensemble learning
    Xu, Che
    Chang, Wenjun
    Liu, Weiyong
    [J]. ANNALS OF OPERATIONS RESEARCH, 2023, 325 (02) : 995 - 1028
  • [7] Data-driven decision model based on local two-stage weighted ensemble learning
    Che Xu
    Wenjun Chang
    Weiyong Liu
    [J]. Annals of Operations Research, 2023, 325 : 995 - 1028
  • [8] Automated data-driven modeling of building energy systems via machine learning algorithms
    Raetz, Martin
    Javadi, Amir Pasha
    Baranski, Marc
    Finkbeiner, Konstantin
    Mueller, Dirk
    [J]. ENERGY AND BUILDINGS, 2019, 202
  • [9] The drivers of systemic risk in financial networks: a data-driven machine learning analysis
    Alexandre, Michel
    Silva, Thiago Christiano
    Connaughton, Colm
    Rodrigues, Francisco A.
    [J]. CHAOS SOLITONS & FRACTALS, 2021, 153 (153)
  • [10] Data-driven two-stage scheduling of multi-energy systems for operational flexibility enhancement
    Li, Hengyi
    Qin, Boyu
    Wang, Shihan
    Ding, Tao
    Wang, Hongzhen
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 162