Infilling of missing data in groundwater pollution prediction models using statistical methods

被引:2
|
作者
Pal, Jayashree [1 ]
Chakrabarty, Dibakar [1 ]
机构
[1] Natl Inst Technol Silchar, Dept Civil Engn, Silchar, India
关键词
infilling; statistical methods; artificial neural networks; pollutant transport; groundwater; MONITORING NETWORK; NEURAL-NETWORKS; IDENTIFICATION; INTERPOLATION;
D O I
10.1080/02626667.2023.2258867
中图分类号
TV21 [水资源调查与水利规划];
学科分类号
081501 ;
摘要
Missing data is ubiquitous in hydrology. This phenomenon poses difficulty in the development of data-driven models. Events of missing data in groundwater pollution monitoring networks may occur due to failure of recording devices, malfunctioning of sensors, etc. Handling such missing data implies filling the missing portions of the data structure. Though several studies are available for dealing with missing data in the field of hydrology, literature dealing with such scenarios in groundwater pollution prediction is scarce. This paper assesses four imputation techniques - viz. linear, cubic spline, piece-wise cubic Hermite and modified Akima with cubic Hermite interpolation methods - for developing groundwater pollution prediction models using artificial neural network (ANN). The study employs the development of cascade-forward back-propagation ANN models using missing data ranging from 5% to 75% and evaluating their performance. Results show that imputation techniques can be effective in such circumstances.
引用
收藏
页码:2208 / 2222
页数:15
相关论文
共 50 条
  • [41] Statistical image reconstruction methods in PET with compensation for missing data
    Kinahan, PE
    Fessler, JA
    Karp, JS
    1996 IEEE NUCLEAR SCIENCE SYMPOSIUM - CONFERENCE RECORD, VOLS 1-3, 1997, : 1486 - 1490
  • [42] Stability of clinical prediction models developed using statistical or machine learning methods
    Riley, Richard D.
    Collins, Gary S.
    BIOMETRICAL JOURNAL, 2023, 65 (08)
  • [43] Statistical Issues on Optimization for Software Metric Models with Missing Data
    Xie, Tianfa
    Ding, Wenxing
    2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 1155 - 1159
  • [44] Bayesian Case Influence Measures for Statistical Models With Missing Data
    Zhu, Hongtu
    Ibrahim, Joseph G.
    Cho, Hyunsoon
    Tang, Niansheng
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2012, 21 (01) : 253 - 271
  • [45] Improved Infilling of Missing Metadata from Expendable Bathythermographs (XBTs) Using Multiple Machine Learning Methods
    Haddad, Stephen
    Killick, Rachel E.
    Palmer, Matthew D.
    Webb, Mark J.
    Prudden, Rachel
    Capponi, Francesco
    Adams, Samantha V.
    JOURNAL OF ATMOSPHERIC AND OCEANIC TECHNOLOGY, 2022, 39 (09) : 1367 - 1385
  • [46] Geochemical data handling, using multivariate statistical methods for environmental monitoring and pollution studies
    Sikakwe, Gregory Udie
    Nwachukwu, Arthur Nwachukwu
    Uwa, Clementina Ukamaka
    Eyong, God'swill Abam
    ENVIRONMENTAL TECHNOLOGY & INNOVATION, 2020, 18
  • [47] RE: "METHODS FOR HANDLING MISSING VARIABLES IN RISK PREDICTION MODELS"
    Wolfson, Julian
    Roetker, Nicholas S.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2017, 185 (05) : 405 - 405
  • [48] Using Probabilistic Models for Missing Data Prediction in Network Industries Performance Measurement Systems
    Kuhi, Kristjan
    Kaare, Kati Korbe
    Koppel, Ott
    25TH DAAAM INTERNATIONAL SYMPOSIUM ON INTELLIGENT MANUFACTURING AND AUTOMATION, 2014, 2015, 100 : 1348 - 1353
  • [49] A simulation study on missing data imputation for dichotomous variables using statistical and machine learning methods
    Yingfeng Ge
    Zhiwei Li
    Jinxin Zhang
    Scientific Reports, 13
  • [50] A simulation study on missing data imputation for dichotomous variables using statistical and machine learning methods
    Ge, Yingfeng
    Li, Zhiwei
    Zhang, Jinxin
    SCIENTIFIC REPORTS, 2023, 13 (01)