Combining Committee-Based Semi-Supervised Learning and Active Learning

被引:0
|
作者
Mohamed Farouk Abdel Hady
Friedhelm Schwenker
机构
[1] University of Ulm,Institute of Neural Information Processing
关键词
data mining; classification; active learning; co-training; semi-supervised learning; ensemble learning; random subspace method; decision tree; nearest neighbor classifier;
D O I
暂无
中图分类号
学科分类号
摘要
Many data mining applications have a large amount of data but labeling data is usually difficult, expensive, or time consuming, as it requires human experts for annotation. Semi-supervised learning addresses this problem by using unlabeled data together with labeled data in the training process. Co-Training is a popular semi-supervised learning algorithm that has the assumptions that each example is represented by multiple sets of features (views) and these views are sufficient for learning and independent given the class. However, these assumptions are strong and are not satisfied in many real-world domains. In this paper, a single-view variant of Co-Training, called Co-Training by Committee (CoBC) is proposed, in which an ensemble of diverse classifiers is used instead of redundant and independent views. We introduce a new labeling confidence measure for unlabeled examples based on estimating the local accuracy of the committee members on its neighborhood. Then we introduce two new learning algorithms, QBC-then-CoBC and QBC-with-CoBC, which combine the merits of committee-based semi-supervised learning and active learning. The random subspace method is applied on both C4.5 decision trees and 1-nearest neighbor classifiers to construct the diverse ensembles used for semi-supervised learning and active learning. Experiments show that these two combinations can outperform other non committee-based ones.
引用
收藏
页码:681 / 698
页数:17
相关论文
共 50 条
  • [1] Combining Committee-Based Semi-Supervised Learning and Active Learning
    Mohamed Farouk Abdel Hady
    Friedhelm Schwenker
    [J]. Journal of Computer Science & Technology, 2010, 25 (04) : 681 - 698
  • [2] Combining Committee-Based Semi-Supervised Learning and Active Learning
    Hady, Mohamed Farouk Abdel
    Schwenker, Friedhelm
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2010, 25 (04): : 681 - 698
  • [3] Combining Committee-Based Semi-supervised and Active Learning and Its Application to Handwritten Digits Recognition
    Hady, Mohamed Farouk Abdel
    Schwenker, Friedhelm
    [J]. MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2010, 5997 : 225 - 234
  • [4] Acoustic model training using committee-based active and semi-supervised learning for speech recognition
    Tsutaoka, Takuya
    Shinoda, Koichi
    [J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [5] Network Security Monitoring by Combining Semi-Supervised Learning and Active Learning
    Pan, Yun
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2022, 13 (02)
  • [6] Semi-supervised learning combining co-training with active learning
    Zhang, Yihao
    Wen, Junhao
    Wang, Xibin
    Jiang, Zhuo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (05) : 2372 - 2378
  • [7] Combining active learning and semi-supervised learning to construct SVM classifier
    Leng, Yan
    Xu, Xinyan
    Qi, Guanghui
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 44 : 121 - 131
  • [8] Combining Active Learning and Semi-supervised Learning by Using Selective Label Spreading
    Chen, Xu
    Wang, Tao
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 850 - 857
  • [9] Combining Semi-Supervised and Active Learning for Hyperspectral Image Classification
    Li, Mingzhi
    Wang, Rui
    Tang, Ke
    [J]. 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING (CIDM), 2013, : 89 - 94
  • [10] Combining active and semi-supervised learning for spoken language understanding
    Tur, G
    Hakkani-Tür, D
    Schapire, RE
    [J]. SPEECH COMMUNICATION, 2005, 45 (02) : 171 - 186