Text classification with active learning

被引：4

作者：

Novak, B ^{[1
]}

Mladenic, D ^{[1
]}

Grobelnik, M ^{[1
]}

机构：

[1] Jozef Stefan Inst, Jamova 39, Ljubljana 1000, Slovenia

来源：

FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING | 2006年

关键词：

D O I：

10.1007/3-540-31314-1_48

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In many real world machine learning tasks, labeled training examples are expensive to obtain, while at the same time there is a lot of unlabeled examples available. One such class of learning problems is text classification. Active learning strives to reduce the required labeling effort while retaining the accuracy by intelligently selecting the examples to be labeled. However, very little comparison exists between different active learning methods. The effects of the ratio of positive to negative examples on the accuracy of such algorithms also received very little attention. This paper presents a comparison of two most promising methods and their performance on a range of categories from the Reuters Corpus Vol. I news article dataset.

引用

页码：398 / +

页数：2

共 50 条

[31] Exploring Uncertain Samples through Active Learning To Enhance Text Emotion Classification
Dou, Rongyu
Shun, Nishide
Ren, Fuji
Kang, Xin
[J]. PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 26 - 30
[32] MII: A novel text classification model combining deep active learning with BERT
Zhang A.
Li B.
Wang W.
Wan S.
Chen W.
[J]. Computers, Materials and Continua, 2020, 63 (03): : 1499 - 1514
[33] Active Learning for Biomedical Text Classification Based on Automatically Generated Regular Expressions
Flores, Christopher A.
Figueroa, Rosa L.
Pezoa, Jorge E.
[J]. IEEE ACCESS, 2021, 9 : 38767 - 38777
[34] Investigating Active Learning Sampling Strategies for Extreme Multi Label Text Classification
Fromme, Lukas
Mirylenka, Katsiaryna
Kuhn, Jonas
Bogojeska, Jasmina
[J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4597 - 4605
[35] Scalability of Continuous Active Learning for Reliable High-Recall Text Classification
Cormack, Gordon V.
Grossman, Maura R.
[J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1039 - 1048
[36] Learning to Weight for Text Classification
Moreo, Alejandro
Esuli, Andrea
Sebastiani, Fabrizio
[J]. IEEE Transactions on Knowledge and Data Engineering, 2020, 32 (02): : 302 - 316
[37] Learning to Weight for Text Classification
Moreo, Alejandro
Esuli, Andrea
Sebastiani, Fabrizio
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (02) : 302 - 316
[38] A Chinese text classification based on active
Deng, Song
Li, Qianliang
Dai, Renjie
Wei, Siming
Wu, Di
He, Yi
Wu, Xindong
[J]. APPLIED SOFT COMPUTING, 2024, 150
[39] The Use of Unlabeled Data versus Labeled Data for Stopping Active Learning for Text Classification
Beatty, Garrett
Kochis, Ethan
Bloodgood, Michael
[J]. 2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 287 - 294
[40] Visually-Enabled Active Deep Learning for (Geo) Text and Image Classification: A Review
Yang, Liping
MacEachren, Alan M.
Mitra, Prasenjit
Onorati, Teresa
[J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2018, 7 (02)

← 1 2 3 4 5 →