Error Estimation Procedure for Large Dimensionality Data with Small Sample Sizes

被引:0
|
作者
Williams, Arnold [1 ]
Wagner, Gregory [1 ]
机构
[1] Raytheon Missile Syst, Tucson, AZ 85706 USA
来源
关键词
ATR Performance Evaluation; Bayes Error Estimation; Friedman-Rafsky tests; small sample size tests; INTRINSIC DIMENSIONALITY;
D O I
10.1117/12.819272
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Using multivariate data analysis to estimate the classification error rates and separability between sets of data samples is a useful tool for understanding the characteristics of data sets. By understanding the classifiability and separability of the data, one can better direct the appropriate resources and effort to achieve the desired performance. The following report describes our procedure for estimating the separability of given data sets. The multivariate tools described in this paper include calculating the intrinsic dimensionality estimates, Bayes error estimates, and the Friedman-Rafsky tests. These analysis techniques are based on previous work used to evaluate data for synthetic aperture radar (SAR) automatic target recognition (ATR), but the current work is unique in the methods used to analyze large dimensionality sets with a small number of samples. The results of this report show that our procedure can quantitatively measure the performance between two data sets in both the measure and feature space with the Bayes error estimator procedure and the Friedman-Rafsky test, respectively. Our procedure, which included the error estimation and Friedman-Rafsky test, is used to evaluate SAR data but can be used as effective ways to measure the classifiability of many other multidimensional data sets.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Error bounds for asymptotic approximations of the linear discriminant function when the sample sizes and dimensionality are large
    Fujikoshi, Y
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2000, 73 (01) : 1 - 17
  • [2] Cross-validation failure: Small sample sizes lead to large error bars
    Varoquaux, Gael
    [J]. NEUROIMAGE, 2018, 180 : 68 - 77
  • [3] Out-of-Sample Error Estimation: the Blessing of High Dimensionality
    Oneto, Luca
    Ghio, Alessandro
    Ridella, Sandro
    Ortiz, Jorge Luis Reyes
    Anguita, Davide
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 637 - 644
  • [4] Nonparametric analysis of clustered data in diagnostic trials: Estimation problems in small sample sizes
    Konietschke, Frank
    Brunner, Edgar
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (03) : 730 - 741
  • [5] Estimation of house prices in regions with small sample sizes
    Pereira, Luis Nobre
    Coelho, Pedro Simoes
    [J]. ANNALS OF REGIONAL SCIENCE, 2013, 50 (02): : 603 - 621
  • [6] Estimation of individual reference intervals in small sample sizes
    Hansen, Ase Marie
    Garde, Anne Helene
    Eller, Nanna Hurwitz
    [J]. INTERNATIONAL JOURNAL OF HYGIENE AND ENVIRONMENTAL HEALTH, 2007, 210 (3-4) : 471 - 478
  • [7] Estimation of house prices in regions with small sample sizes
    Luis Nobre Pereira
    Pedro Simões Coelho
    [J]. The Annals of Regional Science, 2013, 50 : 603 - 621
  • [8] Novel analytical methods to interpret large sequencing data from small sample sizes
    Lichou, Florence
    Orazio, Sebastien
    Dulucq, Stephanie
    Etienne, Gabriel
    Longy, Michel
    Hubert, Christophe
    Groppi, Alexis
    Monnereau, Alain
    Mahon, Francois-Xavier
    Turcq, Beatrice
    [J]. HUMAN GENOMICS, 2019, 13 (01) : 41
  • [9] Novel analytical methods to interpret large sequencing data from small sample sizes
    Florence Lichou
    Sébastien Orazio
    Stéphanie Dulucq
    Gabriel Etienne
    Michel Longy
    Christophe Hubert
    Alexis Groppi
    Alain Monnereau
    François-Xavier Mahon
    Béatrice Turcq
    [J]. Human Genomics, 13
  • [10] Assessing the dimensionality of item response matrices with small sample sizes and short test lengths
    De Champlain, A
    Gessaroli, ME
    [J]. APPLIED MEASUREMENT IN EDUCATION, 1998, 11 (03) : 231 - 253