An adaptive classifier design for high-dimensional data analysis with a limited training data set

被引:126
|
作者
Jackson, Q [1 ]
Landgrebe, DA [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
来源
关键词
adaptive iterative classifier; high-dimensional data; labeled samples; limited training data set; semilabeled samples;
D O I
10.1109/36.975001
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In this paper, we propose a self-learning and self-improving adaptive classifier to mitigate the problem of small training sample size that can severely affect the recognition accuracy of classifiers when the dimensionality of the multispectral data is high. This proposed adaptive classifier utilizes classified samples (referred as semilabeled samples) in addition to original training samples iteratively. In order to control the influence of semilabeled samples, the proposed method gives full weight to the training samples and reduced weight to semilabeled samples. We show that by using additional semilabeled samples that are available without extra cost, the additional class label information may be extracted and utilized to enhance statistics estimation and hence improve the classifier performance, and therefore the Hughes phenomenon (peak phenomenon) may be mitigated. Experimental results show this proposed adaptive classifier can improve the classification accuracy as well as representation of estimated statistics significantly.
引用
收藏
页码:2664 / 2679
页数:16
相关论文
共 50 条
  • [41] Information Analysis of High-Dimensional Data and Applications
    Yang, Xin-She
    Lee, Sanghyuk
    Lee, Sangmin
    Theera-Umpon, Nipon
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [42] Regularization techniques for high-dimensional data analysis
    Lu, Jiwen
    Peng, Xi
    Deng, Weihong
    Mian, Ajmal
    IMAGE AND VISION COMPUTING, 2017, 60 : 1 - 3
  • [43] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    ECTA 2011/FCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION THEORY AND APPLICATIONS AND INTERNATIONAL CONFERENCE ON FUZZY COMPUTATION THEORY AND APPLICATIONS, 2011,
  • [44] Supervised Feature Selection Method for High-Dimensional Data Classification in Photo-Thermal Infrared Imaging with Limited Training Data
    Zhang, Nian
    Leatham, Keenan
    2018 5TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2018, : 593 - 598
  • [45] Adaptive Bernstein change detector for high-dimensional data streams
    Marco Heyden
    Edouard Fouché
    Vadim Arzamasov
    Tanja Fenn
    Florian Kalinke
    Klemens Böhm
    Data Mining and Knowledge Discovery, 2024, 38 : 1334 - 1363
  • [46] Adaptive Bernstein change detector for high-dimensional data streams
    Heyden, Marco
    Fouche, Edouard
    Arzamasov, Vadim
    Fenn, Tanja
    Kalinke, Florian
    Boehm, Klemens
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (03) : 1334 - 1363
  • [47] Adaptive quantization of the high-dimensional data for efficient KNN processing
    Cui, B
    Hu, J
    Shen, HT
    Yu, C
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 : 302 - 313
  • [48] Estimation of misclassification probability for a distance-based classifier in high-dimensional data
    Watanabe, Hiroki
    Hyodo, Masashi
    Yamada, Yuki
    Seo, Takashi
    HIROSHIMA MATHEMATICAL JOURNAL, 2019, 49 (02) : 175 - 193
  • [49] A U-classifier for high-dimensional data under non-normality
    Ahmad, M. Rauf
    Pavlenko, Tatjana
    JOURNAL OF MULTIVARIATE ANALYSIS, 2018, 167 : 269 - 283
  • [50] Classifier Ensemble Based on Multiview Optimization for High-Dimensional Imbalanced Data Classification
    Xu, Yuhong
    Yu, Zhiwen
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 870 - 883