Estimation of a logistic regression model by a genetic algorithm to predict pipe failures in sewer networks

被引:18
|
作者
Robles-Velasco, Alicia [1 ,2 ]
Cortes, Pablo [1 ,2 ]
Munuzuri, Jesus [1 ]
Onieva, Luis [1 ]
机构
[1] Univ Seville, ETSI, Dept Org Ind & Gest Empresas, C Camino Descubrimientos S-N, Seville 41092, Spain
[2] Univ Seville, EMASESA, Catedra Agua, Seville, Spain
关键词
Logistic regression; Binary classifier; Pipe failures; Genetic algorithm; Sewer networks;
D O I
10.1007/s00291-020-00614-9
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Sewer networks are mainly composed of pipelines which are in charge of transporting sewage and rainwater to wastewater treatment plants. A failure in a sewer pipe has many negative consequences, such as accidents, flooding, pollution or extra costs. Machine learning arises as a very powerful tool to predict these incidents when the amount of available data is large enough. In this study, a real-coded genetic algorithm is implemented to estimate the optimal weights of a logistic regression model whose objective is to forecast pipe failures in wastewater networks. The goal is to create an autonomous and independent predictive system able to support the decisions about pipe replacement plans of companies. From the data processing to the validation of the model, all stages for the implementation of the machine-learning system are explored and carefully explained. Moreover, the methodology is applied to a real sewer network of a Spanish city to check its performance. Results demonstrate that by annually replacing 4% of pipe segments, those whose estimated failure probability is higher than 0.75, almost 30% of unexpected pipe failures are prevented. Furthermore, the analysis of the estimated weights of the logistic regression model reveals some weaknesses of the network as well as the influence of the features in the pipe failures. For instance, the predisposition of vitrified clay pipes to fail and of that pipes with smaller diameters.
引用
收藏
页码:759 / 776
页数:18
相关论文
共 50 条
  • [31] A Model to Predict Breast Cancer Survivability Using Logistic Regression
    Nourelahi, Mehdi
    Zamani, Ali
    Talei, Abdolrasoul
    Tahmasebi, Sedigheh
    MIDDLE EAST JOURNAL OF CANCER, 2019, 10 (02) : 132 - 138
  • [32] Logistic regression model to predict acute uncomplicated and complicated appendicitis
    Eddama, M. M. R.
    Fragkos, K. C.
    Renshaw, S.
    Aldridge, M.
    Bough, G.
    Bonthala, L.
    Wang, A.
    Cohen, R.
    ANNALS OF THE ROYAL COLLEGE OF SURGEONS OF ENGLAND, 2019, 101 (02) : 107 - 118
  • [33] Convolutional Neural Networks Optimized by Logistic Regression Model
    Yang, Bo
    Zhao, Zuopeng
    Xu, Xinzheng
    INTELLIGENT INFORMATION PROCESSING VIII, 2016, 486 : 91 - 96
  • [34] Bayesian networks with a logistic regression model for the conditional probabilities
    Rijmen, Frank
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2008, 48 (02) : 659 - 666
  • [35] Genetic algorithm solution for Logistic model parameters
    Key Laboratory for Silviculture and Conservation, Beijing Forestry University, Beijing 100083, China
    不详
    不详
    不详
    Beijing Linye Daxue Xuebao, 2008, SUPPL. 1 (192-195): : 192 - 195
  • [36] Logistic model parameter genetic algorithm solution
    Ma You-ping
    Feng Zhong-ke
    ICMS2010: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION, VOL 4: MODELLING AND SIMULATION IN BIOLOGY, ECOLOGY & ENVIRONMENT, 2010, : 103 - 106
  • [37] Practical investigation of the performance of robust logistic regression to predict the genetic risk of hypertension
    Miriam Kesselmeier
    Carine Legrand
    Barbara Peil
    Maria Kabisch
    Christine Fischer
    Ute Hamann
    Justo Lorenzo Bermejo
    BMC Proceedings, 8 (Suppl 1)
  • [38] Genetic algorithm search for large logistic regression models with significant variables
    Stacey, A
    Kildea, D
    ITI 2000: PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2000, : 275 - 279
  • [39] Genetic algorithm with logistic regression for prediction of progression to Alzheimer's disease
    Johnson, Piers
    Vandewater, Luke
    Wilson, William
    Maruff, Paul
    Savage, Greg
    Graham, Petra
    Macaulay, Lance S.
    Ellis, Kathryn A.
    Szoeke, Cassandra
    Martins, Ralph N.
    Rowe, Christopher C.
    Masters, Colin L.
    Ames, David
    Zhang, Ping
    BMC BIOINFORMATICS, 2014, 15
  • [40] Genetic algorithm with logistic regression for prediction of progression to Alzheimer's disease
    Piers Johnson
    Luke Vandewater
    William Wilson
    Paul Maruff
    Greg Savage
    Petra Graham
    Lance S Macaulay
    Kathryn A Ellis
    Cassandra Szoeke
    Ralph N Martins
    Christopher C Rowe
    Colin L Masters
    David Ames
    Ping Zhang
    BMC Bioinformatics, 15