Asymptotic properties of distance-weighted discrimination and its bias correction for high-dimension, low-sample-size data

被引:0
|
作者
Kento Egashira
Kazuyoshi Yata
Makoto Aoshima
机构
[1] University of Tsukuba,Degree Programs in Pure and Applied Sciences, Graduate School of Science and Technology
[2] University of Tsukuba,Institute of Mathematics
关键词
Bias-corrected DWD; Discriminant analysis; HDLSS; Large ; small ; Weighted DWD;
D O I
暂无
中图分类号
学科分类号
摘要
While distance-weighted discrimination (DWD) was proposed to improve the support vector machine in high-dimensional settings, it is known that the DWD is quite sensitive to the imbalanced ratio of sample sizes. In this paper, we study asymptotic properties of the DWD in high-dimension, low-sample-size (HDLSS) settings. We show that the DWD includes a huge bias caused by a heterogeneity of covariance matrices as well as sample imbalance. We propose a bias-corrected DWD (BC-DWD) and show that the BC-DWD can enjoy consistency properties about misclassification rates. We also consider the weighted DWD (WDWD) and propose an optimal choice of weights in the WDWD. Finally, we discuss performances of the BC-DWD and the WDWD with the optimal weights in numerical simulations and actual data analyses.
引用
收藏
页码:821 / 840
页数:19
相关论文
共 50 条
  • [41] Unsupervised classification of high-dimension and low-sample data with variational autoencoder based dimensionality reduction
    Mahmud, Mohammad Sultan
    Fu, Xianghua
    [J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2019), 2019, : 498 - 503
  • [42] Deep Neural Networks for High Dimension, Low Sample Size Data
    Liu, Bo
    Wei, Ying
    Zhang, Yu
    Yang, Qiang
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2287 - 2293
  • [43] High-dimension, low-sample size perspectives in constrained statistical inference: The SARSCoV RNA genome in illustration
    Sen, Pranab K.
    Tsai, Ming-Tien
    Jou, Yuh-Shan
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (478) : 686 - 694
  • [44] Experimental Analysis of Feature Selection Stability for High-Dimension and Low-Sample Size Gene Expression Classification Task
    Dernoncourt, David
    Hanczar, Blaise
    Zucker, Jean-Daniel
    [J]. IEEE 12TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS & BIOENGINEERING, 2012, : 350 - 355
  • [45] Some considerations of classification for high dimension low-sample size data
    Zhang, Lingsong
    Lin, Xihong
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2013, 22 (05) : 537 - 550
  • [46] On Some Fast And Robust Classifiers For High Dimension, Low Sample Size Data
    Roy, Sarbojit
    Choudhury, Jyotishka Ray
    Dutta, Subhajit
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [47] A dimension reduction technique applied to regression on high dimension, low sample size neurophysiological data sets
    Adrielle C. Santana
    Adriano V. Barbosa
    Hani C. Yehia
    Rafael Laboissière
    [J]. BMC Neuroscience, 22
  • [48] A dimension reduction technique applied to regression on high dimension, low sample size neurophysiological data sets
    Santana, Adrielle C.
    Barbosa, Adriano V.
    Yehia, Hani C.
    Laboissiere, Rafael
    [J]. BMC NEUROSCIENCE, 2021, 22 (01)
  • [49] Multiple-instance ensemble for construction of deep heterogeneous committees for high-dimensional low-sample-size data
    Zhou, Qinghua
    Wang, Shuihua
    Zhu, Hengde
    Zhang, Xin
    Zhang, Yudong
    [J]. NEURAL NETWORKS, 2023, 167 : 380 - 399
  • [50] Biobjective gradient descent for feature selection on high dimension, low sample size data
    Issa, Tina
    Angel, Eric
    Zehraoui, Farida
    [J]. PLOS ONE, 2024, 19 (07):