High-dimension, low-sample size perspectives in constrained statistical inference: The SARSCoV RNA genome in illustration

被引:7
|
作者
Sen, Pranab K. [1 ]
Tsai, Ming-Tien
Jou, Yuh-Shan
机构
[1] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
关键词
Hamming distance; marginal approach; nuisance functional contours; resampling plan; union-intersection principle;
D O I
10.1198/016214507000000077
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
High-dimensional categorical data models, often with inadequately large sample sizes, crop up in many fields of application. The SARS epidemic, originating in southern China in 2002, had an identified single-stranded and positive-sense RNA virus with large genome size and moderate mutation rate. The present genomic study is used as a prime illustration for motivating appropriate statistical methodology for comprehending the genomic variation in such high-dimensional categorical data models. Because of underlying restraints, a pseudomarginal approach based on Hamming distance is considered in a constrained statistical inference setup. The union-intersection principle and jackknifing methods are incorporated in exploring appropriate statistical procedures.
引用
收藏
页码:686 / 694
页数:9
相关论文
共 38 条
  • [1] Statistical Significance of Clustering for High-Dimension, Low-Sample Size Data
    Liu, Yufeng
    Hayes, David Neil
    Nobel, Andrew
    Marron, J. S.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (483) : 1281 - 1293
  • [2] Classification for high-dimension low-sample size data
    Shen, Liran
    Er, Meng Joo
    Yin, Qingbo
    [J]. PATTERN RECOGNITION, 2022, 130
  • [3] Classification for high-dimension low-sample size data
    Shen, Liran
    Er, Meng Joo
    Yin, Qingbo
    [J]. PATTERN RECOGNITION, 2022, 130
  • [4] Experimental Analysis of Feature Selection Stability for High-Dimension and Low-Sample Size Gene Expression Classification Task
    Dernoncourt, David
    Hanczar, Blaise
    Zucker, Jean-Daniel
    [J]. IEEE 12TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS & BIOENGINEERING, 2012, : 350 - 355
  • [5] Structural Classification based Correlation and its Application to Principal Component Analysis for High-Dimension Low-Sample Size Data
    Sato-Ilic, Mika
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2012,
  • [6] Some considerations of classification for high dimension low-sample size data
    Zhang, Lingsong
    Lin, Xihong
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2013, 22 (05) : 537 - 550
  • [7] Unsupervised classification of high-dimension and low-sample data with variational autoencoder based dimensionality reduction
    Mahmud, Mohammad Sultan
    Fu, Xianghua
    [J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2019), 2019, : 498 - 503
  • [8] Random forest kernel for high-dimension low sample size classification
    Lucca Portes Cavalheiro
    Simon Bernard
    Jean Paul Barddal
    Laurent Heutte
    [J]. Statistics and Computing, 2024, 34
  • [9] Random forest kernel for high-dimension low sample size classification
    Cavalheiro, Lucca Portes
    Bernard, Simon
    Barddal, Jean Paul
    Heutte, Laurent
    [J]. STATISTICS AND COMPUTING, 2024, 34 (01)
  • [10] The high-dimension, low-sample-size geometric representation holds under mild conditions
    Ahn, Jeongyoun
    Marron, J. S.
    Muller, Keith M.
    Chi, Yueh-Yun
    [J]. BIOMETRIKA, 2007, 94 (03) : 760 - 766