Text classification with active learning

被引:4
|
作者
Novak, B [1 ]
Mladenic, D [1 ]
Grobelnik, M [1 ]
机构
[1] Jozef Stefan Inst, Jamova 39, Ljubljana 1000, Slovenia
关键词
D O I
10.1007/3-540-31314-1_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many real world machine learning tasks, labeled training examples are expensive to obtain, while at the same time there is a lot of unlabeled examples available. One such class of learning problems is text classification. Active learning strives to reduce the required labeling effort while retaining the accuracy by intelligently selecting the examples to be labeled. However, very little comparison exists between different active learning methods. The effects of the ratio of positive to negative examples on the accuracy of such algorithms also received very little attention. This paper presents a comparison of two most promising methods and their performance on a range of categories from the Reuters Corpus Vol. I news article dataset.
引用
收藏
页码:398 / +
页数:2
相关论文
共 50 条
  • [31] Exploring Uncertain Samples through Active Learning To Enhance Text Emotion Classification
    Dou, Rongyu
    Shun, Nishide
    Ren, Fuji
    Kang, Xin
    [J]. PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 26 - 30
  • [32] MII: A novel text classification model combining deep active learning with BERT
    Zhang A.
    Li B.
    Wang W.
    Wan S.
    Chen W.
    [J]. Computers, Materials and Continua, 2020, 63 (03): : 1499 - 1514
  • [33] Active Learning for Biomedical Text Classification Based on Automatically Generated Regular Expressions
    Flores, Christopher A.
    Figueroa, Rosa L.
    Pezoa, Jorge E.
    [J]. IEEE ACCESS, 2021, 9 : 38767 - 38777
  • [34] Investigating Active Learning Sampling Strategies for Extreme Multi Label Text Classification
    Fromme, Lukas
    Mirylenka, Katsiaryna
    Kuhn, Jonas
    Bogojeska, Jasmina
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4597 - 4605
  • [35] Scalability of Continuous Active Learning for Reliable High-Recall Text Classification
    Cormack, Gordon V.
    Grossman, Maura R.
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1039 - 1048
  • [36] Learning to Weight for Text Classification
    Moreo, Alejandro
    Esuli, Andrea
    Sebastiani, Fabrizio
    [J]. IEEE Transactions on Knowledge and Data Engineering, 2020, 32 (02): : 302 - 316
  • [37] Learning to Weight for Text Classification
    Moreo, Alejandro
    Esuli, Andrea
    Sebastiani, Fabrizio
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (02) : 302 - 316
  • [38] A Chinese text classification based on active
    Deng, Song
    Li, Qianliang
    Dai, Renjie
    Wei, Siming
    Wu, Di
    He, Yi
    Wu, Xindong
    [J]. APPLIED SOFT COMPUTING, 2024, 150
  • [39] The Use of Unlabeled Data versus Labeled Data for Stopping Active Learning for Text Classification
    Beatty, Garrett
    Kochis, Ethan
    Bloodgood, Michael
    [J]. 2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 287 - 294
  • [40] Visually-Enabled Active Deep Learning for (Geo) Text and Image Classification: A Review
    Yang, Liping
    MacEachren, Alan M.
    Mitra, Prasenjit
    Onorati, Teresa
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2018, 7 (02)