Dealing with Missing Values for Effective Prediction of NPC Recurrence

被引:0
|
作者
Kumdee, Orrawan [1 ]
Ritthipravat, Panrasee [2 ]
Bhongmakapat, Thongchai [3 ]
Cheewaruangroj, Wichit [3 ]
机构
[1] Mahidol Univ, Fac Engn, Technol Informat Syst Management, 25-25 Puttamolthon 4, Salaya, Nakornpathom, Thailand
[2] Mahidol Univ, Fac Engn, Biomed Engn Programme, Salaya, Nakornpathom, Thailand
[3] Ramathibodi Hosp, Fac Med, Dept Otolaryngol, Bangkok, Thailand
关键词
Missing Data Techniques; EM imputation; KNN imputation; nasopharyngeal carcinoma recurrence;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper aims to investigate missing data techniques for effective prediction of nasopharyngeal carcinoma (NPC) recurrence. The techniques include listwise deletion, imputations by mean, k-nearest neighbor, and expectation maximization. The completed data are used to predict the presence or absence of NPC recurrence in each year by means of logistic regression, multilayer perceptron with backpropagation training, and naive bayes. Five year predictions are carried out. Validity of each predictive model is assured by 10-fold cross validation. Their results are compared in order to determine proper missing data treatment and the most efficient prediction technique. The results showed that EM imputation was superior to the other missing data techniques because it can be efficiently applied to all predictive models. The multilayer perceptron with backpropagation training gave the highest prediction performance and it was the most robust to the data completed by different missing data techniques.
引用
收藏
页码:1231 / +
页数:3
相关论文
共 50 条
  • [31] Ordering attributes for missing values prediction and data classification
    Hruschka, ER
    Ebecken, NFF
    [J]. DATA MINING III, 2002, 6 : 593 - 601
  • [32] Impute vs. Ignore: Missing Values for Prediction
    Zhang, Qianyu
    Rahman, Ashfaqur
    D'Este, Claire
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [33] Ensemble learning for wind profile prediction with missing values
    Haibo He
    Yuan Cao
    Yi Cao
    Jinyu Wen
    [J]. Neural Computing and Applications, 2013, 22 : 287 - 294
  • [34] Dealing with missing images
    Geneix, Nicolas
    [J]. POSITIF, 2024, (757): : 21 - 22
  • [35] Ensemble learning for wind profile prediction with missing values
    He, Haibo
    Cao, Yuan
    Cao, Yi
    Wen, Jinyu
    [J]. NEURAL COMPUTING & APPLICATIONS, 2013, 22 (02): : 287 - 294
  • [36] Dealing With Missing Data
    Sainani, Kristin L.
    [J]. PM&R, 2015, 7 (09) : 990 - 994
  • [37] Don't Do Imputation: Dealing with Informative Missing Values in EHR Data Analysis
    Li, Jia
    Wang, Mengdie
    Steinbach, Michael S.
    Kumar, Vipin
    Simon, Gyorgy J.
    [J]. 2018 9TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK), 2018, : 415 - 422
  • [38] Dealing with missing values in large-scale studies: microarray data imputation and beyond
    Aittokallio, Tero
    [J]. BRIEFINGS IN BIOINFORMATICS, 2010, 11 (02) : 253 - 264
  • [39] Dealing with missing usage data in defect prediction: A case study of a welding supplier
    Gashi, Milot
    Ofner, Patrick
    Ennsbrunner, Helmut
    Thalmann, Stefan
    [J]. COMPUTERS IN INDUSTRY, 2021, 132
  • [40] Innovations in dealing with missing data or missing reports
    Meng, Xiao-Li
    [J]. STATISTICA SINICA, 2006, 16 (04) : 1061 - 1070