A kernel PLS based classification method with missing data handling

被引:0
|
作者
Thuy Tuong Nguyen
Yury Tsoy
机构
[1] University of California,
[2] Institut Pasteur Korea,undefined
来源
Statistical Papers | 2017年 / 58卷
关键词
Kernel partial least squares; Discriminant analysis ; PLS; Missing data; Classification;
D O I
暂无
中图分类号
学科分类号
摘要
We provide a data classification mechanism with missing data handling based on kernel partial least squares (kernel PLS) and discriminant analysis (kernel PLSDA). The novelty of the method is that class variables are used for validation of the missing values imputation. Likewise, this paper is first in utilizing the kernel PLS in handling and classifying missing data. By experimentally comparing the results of different classification methods including missing data handling on three opened biomedical datasets (Arrhythmia, Mammographic Mass, and Pima Indians Diabetes at UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/datasets.html), we found that the proposed kernel PLS plus kernel PLSDA yielded better accuracies than the existing methods.
引用
收藏
页码:211 / 225
页数:14
相关论文
共 50 条
  • [32] A method of handling missing data in the context of learning Bayesian network structure
    Chen, Chong
    Yu, Hua
    Wang, Juyun
    [J]. APPLIED SCIENCE AND PRECISION ENGINEERING INNOVATION, PTS 1 AND 2, 2014, 479-480 : 906 - +
  • [33] A novel method for handling Missing Not at Random Data in the electronic health records
    Shen, Xinpeng
    Ma, Sisi
    Caraballo, Pedro J.
    Vemuri, Prashanthi
    Simon, Gyorgy J.
    [J]. 2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 21 - 26
  • [34] A method for image classification based on kernel PCA
    Yan, Su
    Zhao, Jiu-Fen
    Zhao, Jiu-Ling
    Li, Qing-Zhen
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 718 - 722
  • [35] Kernel Density Estimation with Missing Data: Misspecifying the Missing Data Mechanism
    Dubnicka, Suzanne R.
    [J]. NONPARAMETRIC STATISTICS AND MIXTURE MODELS: A FESTSCHRIFT IN HONOR OF THOMAS P HETTMANSPERGER, 2011, : 114 - 135
  • [36] Kernel PLS-based GLRT method for fault detection of chemical processes
    Botre, Chiranjivi
    Mansouri, Majdi
    Nounou, Mohamed
    Nounou, Hazem
    Karim, M. Nazmul
    [J]. JOURNAL OF LOSS PREVENTION IN THE PROCESS INDUSTRIES, 2016, 43 : 212 - 224
  • [37] Estimating missing weather data for agricultural simulations using group method of data handling
    Acock, MC
    Pachepsky, YA
    [J]. JOURNAL OF APPLIED METEOROLOGY, 2000, 39 (07): : 1176 - 1184
  • [38] The classification method based on evolutionary algorithm for high-dimensional imbalanced missing data
    Liu, Yi
    Li, Gengsong
    Li, Xiang
    Qin, Wei
    Zheng, Qibin
    Ren, Xiaoguang
    [J]. ELECTRONICS LETTERS, 2023, 59 (12)
  • [39] BASED METHOD FOR HANDLING UNLABELED DATA
    Alvarez Gomez, Sharon Diznarda
    Machuca Vivar, Silvio Amable
    Salas Medina, Paulina Elizabeth
    [J]. REVISTA UNIVERSIDAD Y SOCIEDAD, 2021, 13 : 452 - 458
  • [40] A Pathway-Based Kernel Boosting Method for Sample Classification Using Genomic Data
    Zeng, Li
    Yu, Zhaolong
    Zhao, Hongyu
    [J]. GENES, 2019, 10 (09)