Dealing with Missing Values for Effective Prediction of NPC Recurrence

被引:0
|
作者
Kumdee, Orrawan [1 ]
Ritthipravat, Panrasee [2 ]
Bhongmakapat, Thongchai [3 ]
Cheewaruangroj, Wichit [3 ]
机构
[1] Mahidol Univ, Fac Engn, Technol Informat Syst Management, 25-25 Puttamolthon 4, Salaya, Nakornpathom, Thailand
[2] Mahidol Univ, Fac Engn, Biomed Engn Programme, Salaya, Nakornpathom, Thailand
[3] Ramathibodi Hosp, Fac Med, Dept Otolaryngol, Bangkok, Thailand
关键词
Missing Data Techniques; EM imputation; KNN imputation; nasopharyngeal carcinoma recurrence;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper aims to investigate missing data techniques for effective prediction of nasopharyngeal carcinoma (NPC) recurrence. The techniques include listwise deletion, imputations by mean, k-nearest neighbor, and expectation maximization. The completed data are used to predict the presence or absence of NPC recurrence in each year by means of logistic regression, multilayer perceptron with backpropagation training, and naive bayes. Five year predictions are carried out. Validity of each predictive model is assured by 10-fold cross validation. Their results are compared in order to determine proper missing data treatment and the most efficient prediction technique. The results showed that EM imputation was superior to the other missing data techniques because it can be efficiently applied to all predictive models. The multilayer perceptron with backpropagation training gave the highest prediction performance and it was the most robust to the data completed by different missing data techniques.
引用
收藏
页码:1231 / +
页数:3
相关论文
共 50 条
  • [41] XGBoost in handling missing values for life insurance risk prediction
    Deandra Aulia Rusdah
    Hendri Murfi
    [J]. SN Applied Sciences, 2020, 2
  • [42] Toward the Imputation and Prediction of Condition Monitoring Data with Missing Values
    Zhang, Di
    Li, Canbing
    Zhu, Jizhong
    [J]. 2023 IEEE/IAS INDUSTRIAL AND COMMERCIAL POWER SYSTEM ASIA, I&CPS ASIA, 2023, : 996 - 1002
  • [43] Prediction of genetic correlations and international breeding values for missing traits
    Mark, T.
    Fikse, W. F.
    Sullivan, P. G.
    VanRaden, P. M.
    [J]. JOURNAL OF DAIRY SCIENCE, 2007, 90 (10) : 4805 - 4813
  • [44] Traffic Time Prediction Based on Imputation Algorithms for Missing Values
    Guo, Cong
    Gu, Xinyu
    Li, Qiangian
    Qu, Jiabin
    Zhang, Lin
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 223 - 228
  • [45] XGBoost in handling missing values for life insurance risk prediction
    Rusdah, Deandra Aulia
    Murfi, Hendri
    [J]. SN APPLIED SCIENCES, 2020, 2 (08):
  • [46] EXORCISE - AN ALGORITHM FOR DETECTION OF SPURIOUS VALUES AND PREDICTION OF MISSING DATA
    ZHANG, TS
    SCHULTZ, A
    [J]. COMPUTERS & GEOSCIENCES, 1990, 16 (08) : 1027 - 1065
  • [47] Effective connectivity of fMRI data using ancestral graph theory: Dealing with missing regions
    Waldorp, Lourens
    Christoffels, Ingrid
    van de Ven, Vincent
    [J]. NEUROIMAGE, 2011, 54 (04) : 2695 - 2705
  • [48] THE RELATIVE EFFECTIVENESS OF PROCEDURES COMMONLY USED IN MULTIPLE-REGRESSION ANALYSIS FOR DEALING WITH MISSING VALUES
    DONNER, A
    [J]. AMERICAN STATISTICIAN, 1982, 36 (04): : 378 - 381
  • [49] Intention-to-treat: methods for dealing with missing values in clinical trials of progressively deteriorating diseases
    Unnebrink, K
    Windeler, J
    [J]. STATISTICS IN MEDICINE, 2001, 20 (24) : 3931 - 3946
  • [50] Causal Discovery from Medical Data: Dealing with Missing Values and a Mixture of Discrete and Continuous Data
    Sokolova, Elena
    Groot, Perry
    Claassen, Tom
    von Rhein, Daniel
    Buitelaar, Jan
    Heskes, Tom
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2015), 2015, 9105 : 177 - 181