An adaptive classifier design for high-dimensional data analysis with a limited training data set

被引:126
|
作者
Jackson, Q [1 ]
Landgrebe, DA [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
来源
关键词
adaptive iterative classifier; high-dimensional data; labeled samples; limited training data set; semilabeled samples;
D O I
10.1109/36.975001
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In this paper, we propose a self-learning and self-improving adaptive classifier to mitigate the problem of small training sample size that can severely affect the recognition accuracy of classifiers when the dimensionality of the multispectral data is high. This proposed adaptive classifier utilizes classified samples (referred as semilabeled samples) in addition to original training samples iteratively. In order to control the influence of semilabeled samples, the proposed method gives full weight to the training samples and reduced weight to semilabeled samples. We show that by using additional semilabeled samples that are available without extra cost, the additional class label information may be extracted and utilized to enhance statistics estimation and hence improve the classifier performance, and therefore the Hughes phenomenon (peak phenomenon) may be mitigated. Experimental results show this proposed adaptive classifier can improve the classification accuracy as well as representation of estimated statistics significantly.
引用
收藏
页码:2664 / 2679
页数:16
相关论文
共 50 条
  • [31] Semi-supervised classifier ensemble model for high-dimensional data
    Niu, Xufeng
    Ma, Wenping
    INFORMATION SCIENCES, 2023, 643
  • [32] RBPCP: Visualization on Multi-set High-dimensional Data
    Xie, Weiqiang
    Wei, Yingmei
    Ma, Hao
    Du, Xiaolei
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 16 - 20
  • [33] Adaptive local Principal Component Analysis improves the clustering of high-dimensional data
    Migenda, Nico
    Moeller, Ralf
    Schenck, Wolfram
    PATTERN RECOGNITION, 2024, 146
  • [34] Data-adaptive test for high-dimensional multivariate analysis of variance problem
    Zhang, Mingjuan
    Zhou, Cheng
    He, Yong
    Liu, Bin
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2018, 60 (04) : 447 - 470
  • [35] A novel support vector classifier for longitudinal high-dimensional data and its application to neuroimaging data
    Chen S.
    Dubois Bowman F.
    Statistical Analysis and Data Mining, 2011, 4 (06): : 604 - 611
  • [36] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    NCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION THEORY AND APPLICATIONS, 2011, : IS23 - IS25
  • [37] Stringing High-Dimensional Data for Functional Analysis
    Chen, Kun
    Chen, Kehui
    Mueller, Hans-Georg
    Wang, Jane-Ling
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (493) : 275 - 284
  • [38] QUADRATIC DISCRIMINANT ANALYSIS FOR HIGH-DIMENSIONAL DATA
    Wu, Yilei
    Qin, Yingli
    Zhu, Mu
    STATISTICA SINICA, 2019, 29 (02) : 939 - 960
  • [39] The Role Of Hubness in High-dimensional Data Analysis
    Tomasev, Nenad
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2014, 38 (04): : 387 - 388
  • [40] The role of hubness in high-dimensional data analysis
    Tomašev, Nenad
    Informatica (Slovenia), 2014, 38 (04): : 387 - 388