CoLAL: Co-learning Active Learning for Text Classification

被引:0
|
作者
Le, Linh [1 ]
Zhao, Genghong [2 ]
Zhang, Xia [3 ]
Zuccon, Guido [1 ]
Demartini, Gianluca [1 ]
机构
[1] Univ Queensland, St Lucia, Qld, Australia
[2] Neusoft Res Intelligent Healthcare Technol Co Ltd, Shenyang, Peoples R China
[3] Neusoft Corp, Shenyang, Peoples R China
基金
瑞士国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the machine learning field, the challenge of effectively learning with limited data has become increasingly crucial. Active Learning (AL) algorithms play a significant role in this by enhancing model performance. We introduce a novel AL algorithm, termed Co-learning (CoLAL), designed to select the most diverse and representative samples within a training dataset. This approach utilizes noisy labels and predictions made by the primary model on unlabeled data. By leveraging a probabilistic graphical model, we combine two multi-class classifiers into a binary one. This classifier determines if both the main and the peer models agree on a prediction. If they do, the unlabeled sample is assumed to be easy to classify and is thus not beneficial to increase the target model's performance. We prioritize data that represents the unlabeled set without overlapping decision boundaries. The discrepancies between these boundaries can be estimated by the probability that two models result in the same prediction. Through theoretical analysis and experimental validation, we reveal that the integration of noisy labels into the peer model effectively identifies target model's potential inaccuracies. We evaluated the CoLAL method across seven benchmark datasets: four text datasets (AGNews, DBPedia, PubMed, SST-2) and text-based state-of-the-art (SOTA) baselines, and three image datasets (CIFAR100, MNIST, OpenML-155) and computer vision SOTA baselines. The results show that our CoLAL method significantly outperforms existing SOTA in text-based AL, and is competitive with SOTA image-based AL techniques.
引用
收藏
页码:13337 / 13345
页数:9
相关论文
共 50 条
  • [21] Deep Active Learning for Text Classification with Diverse Interpretations
    Liu, Qiang
    Zhu, Yanqiao
    Liu, Zhaocheng
    Zhang, Yufeng
    Wu, Shu
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3263 - 3267
  • [22] Barrage Text Classification with Improved Active Learning and CNN
    Qiu, Ningjia
    Cong, Lin
    Zhou, Sicheng
    Wang, Peng
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2019, 23 (06) : 980 - 989
  • [23] Human and Smart Machine Co-Learning
    Lee, Chang-Shing
    Wang, Mei-Hui
    Ko, Li-Wei
    Kubota, Naoyuki
    Lin, Lu-An
    Kitaoka, Shinya
    Wang, Yu-Te
    Su, Shun-Feng
    IEEE SYSTEMS MAN AND CYBERNETICS MAGAZINE, 2018, 4 (02): : 6 - 13
  • [24] Co-learning of Functions by Probabilistic Algorithms
    Kucevalovs, Ilja
    Balodis, Kaspars
    Freivalds, Rusins
    PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER, COMMUNICATION, CONTROL AND AUTOMATION, 2013, 68 : 71 - 73
  • [25] Small-Text: Active Learning for Text Classification in Python']Python
    Schroeder, Christopher
    Mueller, Lydia
    Niekler, Andreas
    Potthast, Martin
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 84 - 95
  • [26] Knowledge based Learning Action Analysis in Online Co-Learning
    Xu, Bin
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 162 - 166
  • [27] Robust Label and Feature Space Co-Learning for Multi-Label Classification
    Liu, Zhifeng
    Tang, Chuanjing
    Abhadiomhen, Stanley Ebhohimhen
    Shen, Xiang-Jun
    Li, Yangyang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11846 - 11859
  • [28] On Learning and Co-Learning Effective Strategies in Iterated Travelers' Dilemma
    Tosic, Predrag T.
    2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), 2016, : 674 - 677
  • [29] Co-learning and co-teaching in a newly introduced research learning community
    Claessen, Roy J. M.
    van Ede, Annelies E.
    van Gils, Merel
    Reuzel, Rob P. B.
    van Woezik, Tamara E. T.
    van Gurp, Petra J. M.
    CLINICAL TEACHER, 2024, 21 (03):
  • [30] INTERGENERATIONAL CO-LEARNING STRATEGIES AT THE UNIVERSITY LEVEL
    Donorfio, L. K.
    Chapman, B. G.
    GERONTOLOGIST, 2010, 50 : 20 - 21