A kernel PLS based classification method with missing data handling

被引:0
|
作者
Thuy Tuong Nguyen
Yury Tsoy
机构
[1] University of California,
[2] Institut Pasteur Korea,undefined
来源
Statistical Papers | 2017年 / 58卷
关键词
Kernel partial least squares; Discriminant analysis ; PLS; Missing data; Classification;
D O I
暂无
中图分类号
学科分类号
摘要
We provide a data classification mechanism with missing data handling based on kernel partial least squares (kernel PLS) and discriminant analysis (kernel PLSDA). The novelty of the method is that class variables are used for validation of the missing values imputation. Likewise, this paper is first in utilizing the kernel PLS in handling and classifying missing data. By experimentally comparing the results of different classification methods including missing data handling on three opened biomedical datasets (Arrhythmia, Mammographic Mass, and Pima Indians Diabetes at UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/datasets.html), we found that the proposed kernel PLS plus kernel PLSDA yielded better accuracies than the existing methods.
引用
收藏
页码:211 / 225
页数:14
相关论文
共 50 条
  • [41] Bhattacharyya Distance based Kernel Method for Hyperspectral Data Multi-Class Classification
    Zhang, Miao
    Wang, Qiang
    He, Zhi
    Shen, Yi
    Lin, Yurong
    [J]. 2010 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE I2MTC 2010, PROCEEDINGS, 2010,
  • [42] A PLS KERNEL ALGORITHM FOR DATA SETS WITH MANY VARIABLES AND FEW OBJECTS .2. CROSS-VALIDATION, MISSING DATA AND EXAMPLES
    RANNAR, S
    GELADI, P
    LINDGREN, F
    WOLD, S
    [J]. JOURNAL OF CHEMOMETRICS, 1995, 9 (06) : 459 - 470
  • [43] The randomized marker method for single-case randomization tests: Handling data missing at random and data missing not at random
    Tamal Kumar De
    Patrick Onghena
    [J]. Behavior Research Methods, 2022, 54 : 2905 - 2938
  • [44] The randomized marker method for single-case randomization tests: Handling data missing at random and data missing not at random
    De, Tamal Kumar
    Onghena, Patrick
    [J]. BEHAVIOR RESEARCH METHODS, 2022, 54 (06) : 2905 - 2938
  • [45] Handling Missing Data with Markov Boundary
    Mohammed, Azhar
    Nguyen, Dang
    Duong, Bao
    Nichols, Melanie
    Nguyen, Thin
    [J]. ADVANCED DATA MINING AND APPLICATIONS (ADMA 2022), PT I, 2022, 13725 : 319 - 333
  • [46] Best Practices for Handling Missing Data
    Srijan, Shukla
    Rajagopalan, Iyer R.
    [J]. ANNALS OF SURGICAL ONCOLOGY, 2024, 31 (01) : 12 - 13
  • [47] Handling Missing Data in CGM Records
    Zulj, Sara
    Carvalho, Paulo
    Ribeiro, Rogerio
    Magjarevic, Ratko
    [J]. FUTURE TRENDS IN BIOMEDICAL AND HEALTH INFORMATICS AND CYBERSECURITY IN MEDICAL DEVICES, ICBHI 2019, 2020, 74 : 420 - 427
  • [48] Handling missing values in trait data
    Johnson, Thomas F.
    Isaac, Nick J. B.
    Paviolo, Agustin
    Gonzalez-Suarez, Manuela
    [J]. GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2021, 30 (01): : 51 - 62
  • [49] Active Learning for Handling Missing Data
    Tharwat, Alaa
    Schenck, Wolfram
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [50] Handling missing data in clinical research
    Heymans, Martijn W.
    Twisk, Jos W. R.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2022, 151 : 185 - 188