Infilling of missing data in groundwater pollution prediction models using statistical methods

被引:2
|
作者
Pal, Jayashree [1 ]
Chakrabarty, Dibakar [1 ]
机构
[1] Natl Inst Technol Silchar, Dept Civil Engn, Silchar, India
关键词
infilling; statistical methods; artificial neural networks; pollutant transport; groundwater; MONITORING NETWORK; NEURAL-NETWORKS; IDENTIFICATION; INTERPOLATION;
D O I
10.1080/02626667.2023.2258867
中图分类号
TV21 [水资源调查与水利规划];
学科分类号
081501 ;
摘要
Missing data is ubiquitous in hydrology. This phenomenon poses difficulty in the development of data-driven models. Events of missing data in groundwater pollution monitoring networks may occur due to failure of recording devices, malfunctioning of sensors, etc. Handling such missing data implies filling the missing portions of the data structure. Though several studies are available for dealing with missing data in the field of hydrology, literature dealing with such scenarios in groundwater pollution prediction is scarce. This paper assesses four imputation techniques - viz. linear, cubic spline, piece-wise cubic Hermite and modified Akima with cubic Hermite interpolation methods - for developing groundwater pollution prediction models using artificial neural network (ANN). The study employs the development of cascade-forward back-propagation ANN models using missing data ranging from 5% to 75% and evaluating their performance. Results show that imputation techniques can be effective in such circumstances.
引用
收藏
页码:2208 / 2222
页数:15
相关论文
共 50 条
  • [21] Imputation methods for missing data for polygenic models
    Brooke Fridley
    Kari Rabe
    Mariza de Andrade
    BMC Genetics, 4
  • [22] Imputation methods for missing data for polygenic models
    Fridley, B
    Rabe, K
    de Andrade, M
    BMC GENETICS, 2003, 4 (Suppl 1)
  • [23] Prediction of Air Pollution Concentration Using Weather Data and Regression Models
    Trenchevski, Aleksandar
    Kalendar, Marija
    Gjoreski, Hristijan
    Efnusheva, Danijela
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON APPLIED INNOVATIONS IN IT, 2020, 8 (01): : 55 - 61
  • [24] Methods for Handling Missing Variables in Risk Prediction Models
    Held, Ulrike
    Kessels, Alfons
    Aymerich, Judith Garcia
    Basagana, Xavier
    ter Riet, Gerben
    Moons, Karel G. M.
    Puhan, Milo A.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2016, 184 (07) : 545 - 551
  • [25] Statistical Inference Methods for Clinical Medical Data with Missing and Truncated Data
    Cai K.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [26] IMPUTING MISSING DATA IN A SWAT WATER QUALITY MODELLING STUDY USING STATISTICAL METHODS
    Boyacioglu, Hulya
    Uyar, Meltem Kaya
    Boyacioglu, Hayal
    ENVIRONMENTAL ENGINEERING AND MANAGEMENT JOURNAL, 2024, 23 (03): : 579 - 586
  • [27] STATISTICAL PREDICTION OF AIR-POLLUTION LEVELS USING NON-PHYSICAL MODELS
    SAWARAGI, Y
    SOEDA, T
    TAMURA, H
    YOSHIMURA, T
    OHE, S
    CHUJO, Y
    ISHIHARA, H
    AUTOMATICA, 1979, 15 (04) : 441 - 451
  • [28] DATA MINING METHODS FOR PREDICTION OF AIR POLLUTION
    Siwek, Krzysztof
    Osowski, Stanislaw
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2016, 26 (02) : 467 - 478
  • [29] Identification of groundwater pollution from livestock farming using fluorescence spectroscopy coupled with multivariate statistical methods
    Zhang, Yuanzheng
    Liu, Yunde
    Zhou, Aiguo
    Zhang, Li
    WATER RESEARCH, 2021, 206
  • [30] Filter-based feature selection methods in the presence of missing data for medical prediction models
    Aydin, Zeliha Ergul
    Ozturk, Zehra Kamisli
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 24187 - 24216