The high-dimension, low-sample-size geometric representation holds under mild conditions

被引:97
|
作者
Ahn, Jeongyoun [1 ]
Marron, J. S.
Muller, Keith M.
Chi, Yueh-Yun
机构
[1] Univ Georgia, Dept Stat, Athens, GA 30602 USA
[2] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
[3] Univ Florida, Dept Epidemiol & Hlth Policy Res, Gainesville, FL 32610 USA
[4] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
关键词
high-dimension; low-sample-size; iarge p small n; linear discrimination; sample covariance matrix;
D O I
10.1093/biomet/asm050
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
High-dimension, low-small-sample size datasets have different geometrical properties from those of traditional low-dimensional data. In their asymptotic study regarding increasing dimensionality with a fixed sample size, Hall et al. ( 2005) showed that each data vector is approximately located on the vertices of a regular simplex in a high-dimensional space. A perhaps unappealing aspect of their result is the underlying assumption which requires the variables, viewed as a time series, to be almost independent. We establish an equivalent geometric representation under much milder conditions using asymptotic properties of sample covariance matrices. We discuss implications of the results, such as the use of principal component analysis in a high-dimensional space, extension to the case of nonindependent samples and also the binary classification problem.
引用
收藏
页码:760 / 766
页数:7
相关论文
共 50 条
  • [21] Asymptotic properties of distance-weighted discrimination and its bias correction for high-dimension, low-sample-size data
    Kento Egashira
    Kazuyoshi Yata
    Makoto Aoshima
    [J]. Japanese Journal of Statistics and Data Science, 2021, 4 : 821 - 840
  • [22] Correction to: Asymptotic properties of distance-weighted discrimination and its bias correction for high-dimension, low-sample-size data
    Kento Egashira
    Kazuyoshi Yata
    Makoto Aoshima
    [J]. Japanese Journal of Statistics and Data Science, 2022, 5 : 717 - 718
  • [23] Re-Stabilizing Large-Scale Network Systems Using High-Dimension Low-Sample-Size Data Analysis
    Shen, Xun
    Sasahara, Hampei
    Imura, Jun-ichi
    Oku, Makito
    Aihara, Kazuyuki
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [24] Prediction of Microcystis Occurrences and Analysis Using Machine Learning in High-Dimension, Low-Sample-Size and Imbalanced Water Quality Data
    Mori, Masaya
    Flores, Roberto Gonzalez
    Suzuki, Yoshihiro
    Nukazawa, Kei
    Hiraoka, Toru
    Nonaka, Hirofumi
    [J]. HARMFUL ALGAE, 2022, 117
  • [25] Design of input assignment and feedback gain for re-stabilizing undirected networks with High-Dimension Low-Sample-Size data
    Yasukata, Hitoshi
    Shen, Xun
    Sasahara, Hampei
    Imura, Jun-ichi
    Oku, Makito
    Aihara, Kazuyuki
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (12) : 6734 - 6753
  • [26] Ultra-early medical treatment-oriented system identification using High-Dimension Low-Sample-Size data
    Shen, Xun
    Shimada, Naruto
    Sasahara, Hampei
    Imura, Jun-ichi
    [J]. IFAC JOURNAL OF SYSTEMS AND CONTROL, 2024, 27
  • [27] Random forest kernel for high-dimension low sample size classification
    Lucca Portes Cavalheiro
    Simon Bernard
    Jean Paul Barddal
    Laurent Heutte
    [J]. Statistics and Computing, 2024, 34
  • [28] Random forest kernel for high-dimension low sample size classification
    Cavalheiro, Lucca Portes
    Bernard, Simon
    Barddal, Jean Paul
    Heutte, Laurent
    [J]. STATISTICS AND COMPUTING, 2024, 34 (01)
  • [29] Statistical Significance of Clustering for High-Dimension, Low-Sample Size Data
    Liu, Yufeng
    Hayes, David Neil
    Nobel, Andrew
    Marron, J. S.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (483) : 1281 - 1293
  • [30] Asymptotic properties of distance-weighted discrimination and its bias correction for high-dimension, low-sample-size data (vol 4, pg 821, 2021)
    Egashira, Kento
    Yata, Kazuyoshi
    Aoshima, Makoto
    [J]. JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE, 2022, 5 (02) : 717 - 718