Active Learning Based on Transfer Learning Techniques for Text Classification

被引:8
|
作者
Onita, Daniela [1 ,2 ]
机构
[1] Univ Bucharest, Dept Comp Sci, Bucharest 050663, Romania
[2] 1 Decembrie 1918 Univ Alba Iulia, Dept Comp Sci, Alba Iulia 515900, Romania
关键词
Active learning; active transfer learning; text classification; transfer learning;
D O I
10.1109/ACCESS.2023.3260771
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text preprocessing is a common task in machine learning applications that involves hand-labeling sets. Although automatic and semi-automatic annotation of text data is a growing field, researchers need to develop models that use resources as efficiently as possible for a learning task. The goal of this work was to learn faster with fewer resources. In this paper, the combination of active and transfer learning was examined with the purpose of developing an effective text categorization method. These two forms of learning have proven their efficiency and capacity to train correct models with substantially less training data. We considered three types of criteria for selecting training points: random selection, uncertainty sampling criterion and active transfer selection. Experimental evaluation was performed on five data sets from different domains. The findings of the experiments suggest that by combining active and transfer learning, the algorithm performs better with fewer labels than random selection of training points.
引用
收藏
页码:28751 / 28761
页数:11
相关论文
共 50 条
  • [21] Deep active learning for multi label text classification
    Wang, Qunbo
    Zhang, Hangu
    Zhang, Wentao
    Dai, Lin
    Liang, Yu
    Shi, Haobin
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [22] Using Active Learning in Text Classification of Quranic Sciences
    Goudjil, Mohamed
    Bedda, Mouldi
    Koudil, Mouloud
    Ghoggali, Noureddine
    2013 TAIBAH UNIVERSITY INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION TECHNOLOGY FOR THE HOLY QURAN AND ITS SCIENCES, 2013, : 209 - 213
  • [23] Active Learning for Text Classification and Fake News Detection
    Sahan, Marko
    Smidl, Vaclav
    Marik, Radek
    2021 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROLS (ISCSIC 2021), 2021, : 87 - 94
  • [24] Deep Active Learning for Text Classification with Diverse Interpretations
    Liu, Qiang
    Zhu, Yanqiao
    Liu, Zhaocheng
    Zhang, Yufeng
    Wu, Shu
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3263 - 3267
  • [25] Barrage Text Classification with Improved Active Learning and CNN
    Qiu, Ningjia
    Cong, Lin
    Zhou, Sicheng
    Wang, Peng
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2019, 23 (06) : 980 - 989
  • [26] Small-Text: Active Learning for Text Classification in Python']Python
    Schroeder, Christopher
    Mueller, Lydia
    Niekler, Andreas
    Potthast, Martin
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 84 - 95
  • [27] Active Learning for Biomedical Text Classification Based on Automatically Generated Regular Expressions
    Flores, Christopher A.
    Figueroa, Rosa L.
    Pezoa, Jorge E.
    Flores, Christopher A. (christopher.flores@biomedica.udec.cl), 1600, Institute of Electrical and Electronics Engineers Inc. (09): : 38767 - 38777
  • [28] Stopping Active Learning based on Predicted Change of F Measure for Text Classification
    Altschuler, Michael
    Bloodgood, Michael
    2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 47 - 54
  • [29] Active Learning for Biomedical Text Classification Based on Automatically Generated Regular Expressions
    Flores, Christopher A.
    Figueroa, Rosa L.
    Pezoa, Jorge E.
    IEEE ACCESS, 2021, 9 : 38767 - 38777
  • [30] Lost in Transduction: Transductive Transfer Learning in Text Classification
    Moreo, Alejandro
    Esuli, Andrea
    Sebastiani, Fabrizio
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (01)