Dealing with Missing Values for Effective Prediction of NPC Recurrence

被引:0
|
作者
Kumdee, Orrawan [1 ]
Ritthipravat, Panrasee [2 ]
Bhongmakapat, Thongchai [3 ]
Cheewaruangroj, Wichit [3 ]
机构
[1] Mahidol Univ, Fac Engn, Technol Informat Syst Management, 25-25 Puttamolthon 4, Salaya, Nakornpathom, Thailand
[2] Mahidol Univ, Fac Engn, Biomed Engn Programme, Salaya, Nakornpathom, Thailand
[3] Ramathibodi Hosp, Fac Med, Dept Otolaryngol, Bangkok, Thailand
关键词
Missing Data Techniques; EM imputation; KNN imputation; nasopharyngeal carcinoma recurrence;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper aims to investigate missing data techniques for effective prediction of nasopharyngeal carcinoma (NPC) recurrence. The techniques include listwise deletion, imputations by mean, k-nearest neighbor, and expectation maximization. The completed data are used to predict the presence or absence of NPC recurrence in each year by means of logistic regression, multilayer perceptron with backpropagation training, and naive bayes. Five year predictions are carried out. Validity of each predictive model is assured by 10-fold cross validation. Their results are compared in order to determine proper missing data treatment and the most efficient prediction technique. The results showed that EM imputation was superior to the other missing data techniques because it can be efficiently applied to all predictive models. The multilayer perceptron with backpropagation training gave the highest prediction performance and it was the most robust to the data completed by different missing data techniques.
引用
收藏
页码:1231 / +
页数:3
相关论文
共 50 条
  • [1] Dealing with missing values in PLS
    不详
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 49 (01) : 117 - 119
  • [2] Dealing with Missing Predictor Values When Applying Clinical Prediction Models
    Janssen, Kristel J. M.
    Vergouwe, Yvonne
    Donders, A. Rogier T.
    Harrell, Frank E., Jr.
    Chen, Qingxia
    Grobbee, Diederick E.
    Moons, Karel G. M.
    [J]. CLINICAL CHEMISTRY, 2009, 55 (05) : 994 - 1001
  • [3] Techniques for dealing with missing values in classification
    Liu, WZ
    White, AP
    Thompson, SG
    Bramer, MA
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS: REASONING ABOUT DATA, 1997, 1280 : 527 - 536
  • [4] Dealing with missing values in proteomics data
    Kong, Weijia
    Hui, Harvard Wai Hann
    Peng, Hui
    Bin Goh, Wilson Wen
    [J]. PROTEOMICS, 2022, 22 (23-24)
  • [5] Dealing with Missing Values in Microarray Data
    Mohammadi, Azadeh
    Saraee, Mohammad Hossein
    [J]. 2008 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2008, : 258 - 263
  • [6] Combined association rules for dealing with missing values
    Shen, Jau-Ji
    Chang, Chin-Chen
    Li, Yu-Chiang
    [J]. JOURNAL OF INFORMATION SCIENCE, 2007, 33 (04) : 468 - 480
  • [7] Gene expression clustering: Dealing with the missing values
    Gruzdz, A
    Ihnatowicz, A
    Slezak, D
    [J]. INTELLIGENT INFORMATION PROCESSING AND WEB MINING, PROCEEDINGS, 2005, : 521 - 530
  • [8] DEALING WITH MISSING VALUES IN NUCLEAR SAFEGUARDS EVALUATION
    Kabak, Ozgur
    Ruan, Da
    [J]. INTELLIGENT DECISION MAKING SYSTEMS, VOL. 2, 2010, : 145 - +
  • [9] Dealing with missing values and outliers in principal component analysis
    Stanimirova, I.
    Daszykowski, M.
    Walczak, B.
    [J]. TALANTA, 2007, 72 (01) : 172 - 178
  • [10] A Novel Approach for Dealing with Missing Values in Machine Learning Datasets with Discrete Values
    Abu-Soud, Saleh M.
    [J]. 2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, : 118 - 122