Deep Neural Networks for High Dimension, Low Sample Size Data

被引:0
|
作者
Liu, Bo [1 ]
Wei, Ying [1 ]
Zhang, Yu [1 ]
Yang, Qiang [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
FEATURE-SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks (DNN) have achieved breakthroughs in applications with large sample size. However, when facing high dimension, low sample size (HDLSS) data, such as the phenotype prediction problem using genetic data in bioinformatics, DNN suffers from overfitting and high-variance gradients. In this paper, we propose a DNN model tailored for the HDLSS data, named Deep Neural Pursuit (DNP). DNP selects a subset of high dimensional features for the alleviation of overfitting and takes the average over multiple dropouts to calculate gradients with low variance. As the first DNN method applied on the HDLSS data, DNP enjoys the advantages of the high nonlinearity, the robustness to high dimensionality, the capability of learning from a small number of samples, the stability in feature selection, and the end-to-end training. We demonstrate these advantages of DNP via empirical results on both synthetic and real-world biological datasets.
引用
收藏
页码:2287 / 2293
页数:7
相关论文
共 50 条
  • [1] On Perfect Clustering of High Dimension, Low Sample Size Data
    Sarkar, Soham
    Ghosh, Anil K.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (09) : 2257 - 2272
  • [2] Geometric representation of high dimension, low sample size data
    Hall, P
    Marron, JS
    Neeman, A
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2005, 67 : 427 - 444
  • [3] Classification for high-dimension low-sample size data
    Shen, Liran
    Er, Meng Joo
    Yin, Qingbo
    PATTERN RECOGNITION, 2022, 130
  • [4] Classification for high-dimension low-sample size data
    Shen, Liran
    Er, Meng Joo
    Yin, Qingbo
    PATTERN RECOGNITION, 2022, 130
  • [5] Tree enhanced deep adaptive network for cancer prediction with high dimension low sample size microarray data
    Wu, Yao
    Zhu, Donghua
    Wang, Xuefeng
    APPLIED SOFT COMPUTING, 2023, 136
  • [6] Some considerations of classification for high dimension low-sample size data
    Zhang, Lingsong
    Lin, Xihong
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2013, 22 (05) : 537 - 550
  • [7] Comparison of binary discrimination methods for high dimension low sample size data
    Bolivar-Cime, A.
    Marron, J. S.
    JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 115 : 108 - 121
  • [8] On Some Fast And Robust Classifiers For High Dimension, Low Sample Size Data
    Roy, Sarbojit
    Choudhury, Jyotishka Ray
    Dutta, Subhajit
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [9] CLUSTERING HIGH DIMENSION, LOW SAMPLE SIZE DATA USING THE MAXIMAL DATA PILING DISTANCE
    Ahn, Jeongyoun
    Lee, Myung Hee
    Yoon, Young Joo
    STATISTICA SINICA, 2012, 22 (02) : 443 - 464
  • [10] A dimension reduction technique applied to regression on high dimension, low sample size neurophysiological data sets
    Santana, Adrielle C.
    Barbosa, Adriano V.
    Yehia, Hani C.
    Laboissiere, Rafael
    BMC NEUROSCIENCE, 2021, 22 (01)