Modeling Pipe Break Data Using Survival Analysis with Machine Learning Imputation Methods

被引:9
|
作者
Xu, Hao [1 ]
Sinha, Sunil K. [1 ]
机构
[1] Virginia Tech, Dept Civil & Environm Engn, Blacksburg, VA 24060 USA
关键词
FAILURE MODELS; WATER; PREDICTION; REGRESSION; NETWORKS;
D O I
10.1061/(ASCE)CF.1943-5509.0001649
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The development of asset life estimation tools based on historical data is essential to the effective management of pipeline assets. One tool that may assist with asset management is survival analysis. However, left-truncated break records pose a challenge in the practice of survival analysis to obtain sound inferences and predictions. In this study, we propose a data-driven approach that integrates machine learning imputation methods with survival analysis. To demonstrate the proposed methodology, we perform a case study using ductile iron (DI) water distribution pipes from an anonymized utility in the midwestern United States. Two artificial neural network (ANN) models are developed as imputation methods to calibrate the survival curves and mean time to first failure (MTTF) estimates from the Weibull proportional hazards model (WPHM). Results show that the MTTF estimation bias is reduced from 14.3% to 2.1% by using imputation as a preceding procedure. Empirical findings show that despite the limited accuracy of imputation models, the use of imputation methods can still improve the survival analysis results and mitigate the impact of left-truncated break records. (C) 2021 American Society of Civil Engineers.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Energetic Map Data Imputation: A Machine Learning Approach
    Straub, Tobias
    Nagy, Madalina Mandy
    Sidorov, Maxim
    Tonetto, Leonardo
    Frey, Michael
    Gauterin, Frank
    [J]. ENERGIES, 2020, 13 (04)
  • [32] Approximate Imputation Method for Missing Data in Machine Learning
    Cao W.
    Chu Y.
    Li X.
    [J]. 1600, Xi'an Jiaotong University (51): : 142 - 148
  • [33] Performance Evaluation of Pipe Break Machine Learning Models Using Datasets from Multiple Utilities
    Chen, Thomas Ying-Jeh
    Vladeanu, Greta
    Yazdekhasti, Sepideh
    Daly, Craig Michael
    [J]. JOURNAL OF INFRASTRUCTURE SYSTEMS, 2022, 28 (02)
  • [34] Research Trend analysis for Seismic Data Interpolation Methods using Machine Learning
    Bae, Wooram
    Kwon, Yeji
    Ha, Wansoo
    [J]. GEOPHYSICS AND GEOPHYSICAL EXPLORATION, 2020, 23 (03): : 192 - 207
  • [35] Big Data Analysis Using Modern Statistical and Machine Learning Methods in Medicine
    Yoo, Changwon
    Ramirez, Luis
    Liuzzi, Juan
    [J]. INTERNATIONAL NEUROUROLOGY JOURNAL, 2014, 18 (02) : 50 - 57
  • [36] Data Analysis to Predictive Modeling of Marine Engine Performance Using Machine Learning
    Chan, T. K.
    Chin, C. S.
    [J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2076 - 2080
  • [37] CondiS web app: imputation of censored lifetimes for machine learning-based survival analysis
    Wang, Yizhuo
    Flowers, Christopher R.
    Li, Ziyi
    Huang, Xuelin
    [J]. BIOINFORMATICS, 2022, 38 (17) : 4252 - 4254
  • [38] A Review of Methods Used in Machine Learning and Data Analysis
    Wu, Qingyang
    [J]. ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 37 - 45
  • [39] Analysis and Modeling of Geodetic Data Based on Machine Learning
    Wu, Tong
    [J]. Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [40] Machine learning algorithms for analysis and modeling of GeoSpatial data
    Pozdnoukhov, Alexei
    Kanevski, Mikhail
    [J]. PROCEEDINGS OF THE IAMG '07: GEOMATHEMATICS AND GIS ANALYSIS OF RESOURCES, ENVIRONMENT AND HAZARDS, 2007, : 216 - +