An Efficient Approach to Select Instances in Self-Training and Co-Training Semi-Supervised Methods

被引:10
|
作者
Ovidio Vale, Karliane Medeiros [1 ]
Gorgonio, Arthur Costa [2 ]
Gorgonio, Flavius Da Luz E. [1 ]
De Paula Canuto, Anne Magaly [2 ]
机构
[1] Univ Fed Rio Grande do Norte, Dept Computat & Technol, BR-59300000 Natal, RN, Brazil
[2] Univ Fed Rio Grande do Norte, Dept Informat & Appl Math, BR-59078970 Natal, RN, Brazil
关键词
Semisupervised learning; Training; Labeling; Classification algorithms; Prediction algorithms; Machine learning; Supervised learning; Artificial intelligence; machine learning; semi-supervised learning; self-training semi-supervised method; co-training semi-supervised method; CLASSIFICATION;
D O I
10.1109/ACCESS.2021.3138682
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-supervised learning is a machine learning approach that integrates supervised and unsupervised learning mechanisms. In this learning, most of labels in the training set are unknown, while there is a small part of data that has known labels. The semi-supervised learning is attractive due to its potential to use labeled and unlabeled data to perform better than supervised learning. This paper consists of a study in the field of semi-supervised learning and implements changes on two well-known semi-supervised learning algorithms: self-training and co-training. In the literature, it is common to develop researches that change the structure of these algorithms, however, none of them proposes automating the labeling process of unlabeled instances, which is the main purpose of this work. In order to achieve this goal, three methods are proposed: FlexCon-G, FlexCon and FlexCon-C. The main difference among these methods is the way in which the confidence rate is calculated and the strategy used to select a label in each iteration. In order to evaluate the proposed methods' performance, an empirical analysis is conducted, in which the performance of these methods has been evaluated on 30 datasets with different characteristics. The obtained results indicate that all three proposed methods perform better than the original self-training and co-training methods, in most analysed cases.
引用
收藏
页码:7254 / 7276
页数:23
相关论文
共 50 条
  • [21] Semi-supervised learning combining co-training with active learning
    Zhang, Yihao
    Wen, Junhao
    Wang, Xibin
    Jiang, Zhuo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (05) : 2372 - 2378
  • [22] Analysis of training data using clustering to improve semi-supervised self-training
    Piroonsup, N.
    Sinthupinyo, S.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 143 : 65 - 80
  • [23] Semi-supervised Co-training Algorithm Based on Assisted Learning
    Wang, Hong-li
    Cui, Rong-yi
    [J]. APPLIED INFORMATICS AND COMMUNICATION, PT 2, 2011, 225 : 538 - 545
  • [24] A Mutually Attentive Co-Training Framework for Semi-Supervised Recognition
    Min, Shaobo
    Chen, Xuejin
    Xie, Hongtao
    Zha, Zheng-Jun
    Zhang, Yongdong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 899 - 910
  • [25] Semi-supervised Learning with Multi-Head Co-Training
    Chen, Mingcai
    Du, Yuntao
    Zhang, Yi
    Qian, Shuwei
    Wang, Chongjun
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6278 - 6286
  • [26] Semi-supervised Co-training Algorithm Based on Assisted Learning
    Wang, Hong-li
    Cui, Rong-yi
    [J]. 2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL II, 2010, : 326 - 329
  • [27] Semi-supervised Learning Based on Improved Co-training by Committee
    Liu, Kun
    Guo, Yuwei
    Wang, Shuang
    Wu, Linsheng
    Yue, Bo
    Hou, Biao
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING TECHNIQUES, ISCIDE 2015, PT II, 2015, 9243 : 413 - 421
  • [28] CO-ADAPTATION: ADAPTIVE CO-TRAINING FOR SEMI-SUPERVISED LEARNING
    Tur, Gokhan
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3721 - 3724
  • [29] Semi-supervised hyperspectral classification from a small number of training samples using a co-training approach
    Romaszewski, Michal
    Glomb, Przemyslaw
    Cholewa, Michal
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 121 : 60 - 76
  • [30] Semi-supervised learning with ensemble self-training for cancer classification
    Wang, Qingyong
    Xia, Liang-Yong
    Chai, Hua
    Zhou, Yun
    [J]. 2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 796 - 803