Intrinsic Dimensionality Estimation of High-Dimension, Low Sample Size Data with D-Asymptotics

被引:15
|
作者
Yata, Kazuyoshi [2 ]
Aoshima, Makoto [1 ]
机构
[1] Univ Tsukuba, Inst Math, Ibaraki 3058571, Japan
[2] Univ Tsukuba, Grad Sch Pure & Appl Sci, Ibaraki 3058571, Japan
基金
日本学术振兴会;
关键词
Dual covariance matrix; Effective dimension; HDLSS; Large p small n; Maximum eigenvalue; GEOMETRIC REPRESENTATION; LARGEST EIGENVALUE;
D O I
10.1080/03610920903121999
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
High-dimension, low sample size (HDLSS) data are becoming common in various fields such as genetic microarrays, medical imaging, text recognition, finance, chemometrics, and so on. Such data have surprising and often counterintuitive geometric structures because of the high-dimensional noise that dominates and corrupts the local neighborhoods. In this article, we estimate the intrinsic dimension (ID) that allows one to distinguish between deterministic chaos and random noise of HDLSS data. A new ID estimating methodology is given and its properties are studied by using a d-asymptotic approach.
引用
收藏
页码:1511 / 1521
页数:11
相关论文
共 50 条
  • [31] Population-guided large margin classifier for high-dimension low-sample-size problems
    Yin, Qingbo
    Adeli, Ehsan
    Shen, Liran
    Shen, Dinggang
    [J]. PATTERN RECOGNITION, 2020, 97
  • [32] Correction to: Asymptotic properties of distance-weighted discrimination and its bias correction for high-dimension, low-sample-size data
    Kento Egashira
    Kazuyoshi Yata
    Makoto Aoshima
    [J]. Japanese Journal of Statistics and Data Science, 2022, 5 : 717 - 718
  • [33] Prediction of Microcystis Occurrences and Analysis Using Machine Learning in High-Dimension, Low-Sample-Size and Imbalanced Water Quality Data
    Mori, Masaya
    Flores, Roberto Gonzalez
    Suzuki, Yoshihiro
    Nukazawa, Kei
    Hiraoka, Toru
    Nonaka, Hirofumi
    [J]. HARMFUL ALGAE, 2022, 117
  • [34] Re-Stabilizing Large-Scale Network Systems Using High-Dimension Low-Sample-Size Data Analysis
    Shen, Xun
    Sasahara, Hampei
    Imura, Jun-ichi
    Oku, Makito
    Aihara, Kazuyuki
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [35] Ultra-early medical treatment-oriented system identification using High-Dimension Low-Sample-Size data
    Shen, Xun
    Shimada, Naruto
    Sasahara, Hampei
    Imura, Jun-ichi
    [J]. IFAC JOURNAL OF SYSTEMS AND CONTROL, 2024, 27
  • [36] Design of input assignment and feedback gain for re-stabilizing undirected networks with High-Dimension Low-Sample-Size data
    Yasukata, Hitoshi
    Shen, Xun
    Sasahara, Hampei
    Imura, Jun-ichi
    Oku, Makito
    Aihara, Kazuyuki
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (12) : 6734 - 6753
  • [37] SMURC: High-Dimension Small-Sample Multivariate Regression With Covariance Estimation
    Bayar, Belhassen
    Bouaynaya, Nidhal
    Shterenberg, Roman
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2017, 21 (02) : 573 - 581
  • [38] Deep Neural Networks for High Dimension, Low Sample Size Data
    Liu, Bo
    Wei, Ying
    Zhang, Yu
    Yang, Qiang
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2287 - 2293
  • [39] High-dimension, low-sample size perspectives in constrained statistical inference: The SARSCoV RNA genome in illustration
    Sen, Pranab K.
    Tsai, Ming-Tien
    Jou, Yuh-Shan
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (478) : 686 - 694
  • [40] OPIT: A Simple but Effective Method for Sparse Subspace Tracking in High-Dimension and Low-Sample-Size Context
    Le, Thanh Trung
    Abed-Meraim, Karim
    Trung, Nguyen Linh
    Hafiane, Adel
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 521 - 534