Multicore based least confidence query sampling strategy to speed up active learning approach for named entity recognition

被引：0

作者：

Ankit Agrawal

Sarsij Tripathi

Manu Vardhan

机构：

[1] National Institute of Technology Raipur,

[2] Motilal Nehru National Institute of Technology Allahabad,undefined

来源：

Computing | 2023年 / 105卷

关键词：

Least confidence; Active learning; Named entity recognition; Speed up; 68U15; 68U01; 68W10; 68T50;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In the present era, there is a large amount of new data available readily from different sources to collect and store. One of the main problems is to label these new data for various machine learning applications correctly. The active learning approach presents a unique case of machine learning which is widely used to solve the above problem by significantly minimizing the need for labeled data. It aims to select the most appropriate samples from the unlabeled data to be correctly labeled by the oracle and is passed to train the active learner incrementally. There are several different query sampling strategies that exist using which the appropriate samples are selected. One of the main problems with the active learning approach is that it is very time-consuming. So in this research work, a new multi-core-based algorithm is proposed to speed up the active learning approach, which can utilize the complete computational resources present in the system. The experiments have been performed for the problem of named entity recognition which deals with labeling the sequences of words in an unstructured text by classifying them into pre-existing categories. The proposed algorithm is evaluated in terms of both: the performance and execution time over three named entity recognition corpus of distinct biomedical domains. The evaluation results shows considerable improvement in terms of execution time for the proposed active learning algorithm than the existing active learning approach.

引用

页码：979 / 997

页数：18

共 35 条

[1] Multicore based least confidence query sampling strategy to speed up active learning approach for named entity recognition
Agrawal, Ankit
Tripathi, Sarsij
Vardhan, Manu
COMPUTING, 2023, 105 (05) : 979 - 997
[2] Active learning approach using a modified least confidence sampling strategy for named entity recognition
Agrawal, Ankit
Tripathi, Sarsij
Vardhan, Manu
PROGRESS IN ARTIFICIAL INTELLIGENCE, 2021, 10 (02) : 113 - 128
[3] Active learning approach using a modified least confidence sampling strategy for named entity recognition
Ankit Agrawal
Sarsij Tripathi
Manu Vardhan
Progress in Artificial Intelligence, 2021, 10 : 113 - 128
[4] Uncertainty query sampling strategies for active learning of named entity recognition task
Agrawal, Ankit
Tripathi, Sarsij
Vardhan, Manu
INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2021, 15 (01): : 99 - 114
[5] A Variance Based Active Learning Approach for Named Entity Recognition
Hassanzadeh, Hamed
Keyvanpour, MohammadReza
INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT II, 2011, 135 : 347 - +
[6] LTP: A New Active Learning Strategy for CRF-Based Named Entity Recognition
Liu, Mingyi
Tu, Zhiying
Zhang, Tong
Su, Tonghua
Xu, Xiaofei
Wang, Zhongjie
NEURAL PROCESSING LETTERS, 2022, 54 (03) : 2433 - 2454
[7] LTP: A New Active Learning Strategy for CRF-Based Named Entity Recognition
Mingyi Liu
Zhiying Tu
Tong Zhang
Tonghua Su
Xiaofei Xu
Zhongjie Wang
Neural Processing Letters, 2022, 54 : 2433 - 2454
[8] The Named Entity Recognition of Chinese Cybersecurity Using an Active Learning Strategy
Xie, Bo
Shen, Guowei
Guo, Chun
Cui, Yunhe
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
[9] Loss-based Active Learning for Named Entity Recognition
Linh, Le Thai
Nguyen, Minh-Tien
Zuccon, Guido
Demartini, Gianluca
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[10] Subsequence Based Deep Active Learning for Named Entity Recognition
Radmard, Puria
Fathullah, Yassir
Lipani, Aldo
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4310 - 4321

← 1 2 3 4 →