Is there correlation between the estimated and true classification errors in small-sample settings?

被引:0
|
作者
Ranczar, Blaise [1 ]
Hua, B. Jianping [2 ]
Dougherty, Edward R. [1 ,2 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
[2] Translat Genom Res Inst, Comp Biol Div, Phoenix, AZ USA
关键词
error estimation; small-sample; classification;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The validity of a classifier model, consisting of a trained classifier and it estimated error, depends upon the relationship between the estimated and true errors of the classifier. Absent a good error estimation rule, the classifier-error model lacks scientific meaning. This paper demonstrates that in high-dimensionality feature selection settings in the context of small samples there can be virtually no correlation between the true and estimated errors. This conclusion has serious ramifications in the domain of high-throughput genomic classification, such as gene-expression classification, where the number of potential features (gene expressions) is usually in the tens of thousands and the number of sample points (microarrays) is often under one hundred.
引用
下载
收藏
页码:16 / +
页数:2
相关论文
共 50 条
  • [41] SMALL-SAMPLE PROCEDURES FOR TESTING THE HYPOTHESIS THAT 2 VARIABLES MEASURE THE SAME TRAIT EXCEPT FOR ERRORS OF MEASUREMENT
    RAE, G
    BULLETIN OF THE BRITISH PSYCHOLOGICAL SOCIETY, 1982, 35 (SEP): : A62 - A62
  • [42] Correlation control in small-sample Monte Carlo type simulations I: A simulated annealing approach
    Vorechovsky, M.
    Novak, D.
    PROBABILISTIC ENGINEERING MECHANICS, 2009, 24 (03) : 452 - 462
  • [43] SMALL-SAMPLE CLASSIFICATION OF HYPERSPECTRAL DATA IN A GRAPH-BASED SEMI-SUPERVISION FRAMWORK
    Zhang, Chunmei
    Wang, Junyan
    Zhang, Yunbin
    Liu, Yaoyao
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 3194 - 3197
  • [44] Applying support vector machines to a diagnostic classification model for polytomous attributes in small-sample contexts
    Li, Xiaoyu
    Dong, Shenghong
    Guo, Shaoyang
    Zheng, Chanjin
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2024,
  • [45] A Small-Sample Text Classification Model Based on Pseudo-Label Fusion Clustering Algorithm
    Yang, Linda
    Huang, Baohua
    Guo, Shiqian
    Lin, Yunjie
    Zhao, Tong
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [46] Dual Cross-Entropy Loss for Small-Sample Fine-Grained Vehicle Classification
    Li, Xiaoxu
    Yu, Liyun
    Chang, Dongliang
    Ma, Zhanyu
    Cao, Jie
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) : 4204 - 4212
  • [47] Image-text dual neural network with decision strategy for small-sample image classification
    Zhu, Fangyi
    Ma, Zhanyu
    Li, Xiaoxu
    Chen, Guang
    Chien, Jen-Tzung
    Xue, Jing-Hao
    Guo, Jun
    NEUROCOMPUTING, 2019, 328 : 182 - 188
  • [48] Research on the method of eliminating differences between small-sample databases based on cloud computing
    Que, Lingyan
    Jiang, Zhengwei
    Zhang, Xinxin
    Pi, Yu
    Chen, Qi
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [49] SMALL-SAMPLE PROPERTIES OF SEVERAL 2-STAGE REGRESSION METHODS IN CONTEXT OF AUTO-CORRELATED ERRORS
    RAO, P
    GRILICHE.Z
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1969, 64 (325) : 253 - &
  • [50] THE SMALL-SAMPLE PROPERTIES OF SOME PRELIMINARY TEST ESTIMATORS IN A LINEAR-MODEL WITH AUTO-CORRELATED ERRORS
    GRIFFITHS, WE
    BEESLEY, PAA
    JOURNAL OF ECONOMETRICS, 1984, 25 (1-2) : 49 - 61