Geometric representation of high dimension, low sample size data

被引:296
|
作者
Hall, P
Marron, JS [1 ]
Neeman, A
机构
[1] Univ N Carolina, Dept Stat, Chapel Hill, NC 27599 USA
[2] Australian Natl Univ, Canberra, ACT, Australia
关键词
chemometrics; large dimensional data; medical images; microarrays; multivariate analysis; non-standard asymptotics;
D O I
10.1111/j.1467-9868.2005.00510.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
High dimension, low sample size data are emerging in various areas of science. We find a common structure underlying many such data sets by using a non-standard type of asymptotics: the dimension tends to infinity while the sample size is fixed. Our analysis shows a tendency for the data to lie deterministically at the vertices of a regular simplex. Essentially all the randomness in the data appears only as a random rotation of this simplex. This geometric representation is used to obtain several new statistical insights.
引用
收藏
页码:427 / 444
页数:18
相关论文
共 50 条
  • [1] The high-dimension, low-sample-size geometric representation holds under mild conditions
    Ahn, Jeongyoun
    Marron, J. S.
    Muller, Keith M.
    Chi, Yueh-Yun
    [J]. BIOMETRIKA, 2007, 94 (03) : 760 - 766
  • [2] On Perfect Clustering of High Dimension, Low Sample Size Data
    Sarkar, Soham
    Ghosh, Anil K.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (09) : 2257 - 2272
  • [3] Classification for high-dimension low-sample size data
    Shen, Liran
    Er, Meng Joo
    Yin, Qingbo
    [J]. PATTERN RECOGNITION, 2022, 130
  • [4] Classification for high-dimension low-sample size data
    Shen, Liran
    Er, Meng Joo
    Yin, Qingbo
    [J]. PATTERN RECOGNITION, 2022, 130
  • [5] Deep Neural Networks for High Dimension, Low Sample Size Data
    Liu, Bo
    Wei, Ying
    Zhang, Yu
    Yang, Qiang
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2287 - 2293
  • [6] Effective PCA for high-dimension, low-sample-size data with noise reduction via geometric representations
    Yata, Kazuyoshi
    Aoshima, Makoto
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 105 (01) : 193 - 215
  • [7] Some considerations of classification for high dimension low-sample size data
    Zhang, Lingsong
    Lin, Xihong
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2013, 22 (05) : 537 - 550
  • [8] Comparison of binary discrimination methods for high dimension low sample size data
    Bolivar-Cime, A.
    Marron, J. S.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 115 : 108 - 121
  • [9] On Some Fast And Robust Classifiers For High Dimension, Low Sample Size Data
    Roy, Sarbojit
    Choudhury, Jyotishka Ray
    Dutta, Subhajit
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [10] Geometric representation of graphs in low dimension
    Chandran, L. Sunil
    Sivadasan, Naveen
    [J]. COMPUTING AND COMBINATORICS, PROCEEDINGS, 2006, 4112 : 398 - 407