Efficient Cluster-Based Boosting for Semisupervised Classification

被引:6
|
作者
Soares, Rodrigo G. F. [1 ]
Chen, Huanhuan [2 ]
Yao, Xin [3 ]
机构
[1] Univ Fed Rural Pernambuco, Dept Informat, BR-52171900 Recife, PE, Brazil
[2] Univ Sci & Technol China, Sch Comp Sci, UBRI, Hefei 230027, Anhui, Peoples R China
[3] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen Key Lab Computat Intelligence, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Cluster-based regularization; ensemble learning; multiclass classification; semisupervised classification;
D O I
10.1109/TNNLS.2018.2809623
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semisupervised classification (SSC) consists of using both labeled and unlabeled data to classify unseen instances. Due to the large number of unlabeled data typically available, SSC algorithms must be able to handle large-scale data sets. Recently, various ensemble algorithms have been introduced with improved generalization performance when compared to single classifiers. However, existing ensemble methods are not able to handle typical large-scale data sets. We propose efficient cluster-based boosting (ECB), a multiclass SSC algorithm with cluster-based regularization that avoids generating decision boundaries in high-density regions. A semisupervised selection procedure reduces time and space complexities by selecting only the most informative unlabeled instances for the training of each base learner. We provide evidences to demonstrate that ECB is able to achieve good performance with small amounts of selected data and a relatively small number of base learners. Our experiments confirmed that ECB scales to large data sets while delivering comparable generalization to state-of-the-art methods.
引用
收藏
页码:5667 / 5680
页数:14
相关论文
共 50 条
  • [1] A Cluster-Based Semisupervised Ensemble for Multiclass Classification
    Soares, Rodrigo G. F.
    Chen, Huanhuan
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2017, 1 (06): : 408 - 420
  • [2] Cluster-Based Boosting
    Miller, L. Dee
    Soh, Leen-Kiat
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (06) : 1491 - 1504
  • [3] Semisupervised Hyperspectral Image Classification With Cluster-Based Conditional Generative Adversarial Net
    Zhao, Wenzhi
    Chen, Xuehong
    Bo, Yanchen
    Chen, Jiage
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (03) : 539 - 543
  • [4] CUSBoost: Cluster-based Under-sampling with Boosting for Imbalanced Classification
    Rayhan, Farshid
    Ahmed, Sajid
    Mahbub, Asif
    Jani, Md. Rafsan
    Shatabda, Swakkhar
    Farid, Dewan Md.
    [J]. 2017 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND INFORMATION TECHNOLOGY FOR SUSTAINABLE SOLUTION (CSITSS-2017), 2017, : 70 - 75
  • [5] Cluster-based adaptive metric classification
    Giotis, Ioannis
    Petkov, Nicolai
    [J]. NEUROCOMPUTING, 2012, 81 : 33 - 40
  • [6] Cluster-based data relabelling for classification
    Wan, Huan
    Wang, Hui
    Scotney, Bryan
    Liu, Jun
    Wei, Xin
    [J]. INFORMATION SCIENCES, 2023, 648
  • [7] Semisupervised Classification With Cluster Regularization
    Soares, Rodrigo G. F.
    Chen, Huanhuan
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (11) : 1779 - 1792
  • [8] Decoupling of clustering and classification steps in a cluster-based classification
    Hashemi, RR
    Bahar, M
    Childers, C
    Tyler, AA
    [J]. ICMLA 2005: FOURTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2005, : 285 - 290
  • [9] Efficient cluster-based portfolio optimization
    Bnouachir, Najla
    Mkhadri, Abdallah
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (11) : 3241 - 3255
  • [10] Cluster-Based Tensorial Semisupervised Discriminant Analysis for Feature Extraction of SAR Images
    Wu, Xiaoying
    Wen, Xianbin
    Yuan, Liming
    Guo, Changlun
    Xu, Haixia
    [J]. IEEE ACCESS, 2019, 7 : 84318 - 84332