Multicore based least confidence query sampling strategy to speed up active learning approach for named entity recognition

被引:0
|
作者
Ankit Agrawal
Sarsij Tripathi
Manu Vardhan
机构
[1] National Institute of Technology Raipur,
[2] Motilal Nehru National Institute of Technology Allahabad,undefined
来源
Computing | 2023年 / 105卷
关键词
Least confidence; Active learning; Named entity recognition; Speed up; 68U15; 68U01; 68W10; 68T50;
D O I
暂无
中图分类号
学科分类号
摘要
In the present era, there is a large amount of new data available readily from different sources to collect and store. One of the main problems is to label these new data for various machine learning applications correctly. The active learning approach presents a unique case of machine learning which is widely used to solve the above problem by significantly minimizing the need for labeled data. It aims to select the most appropriate samples from the unlabeled data to be correctly labeled by the oracle and is passed to train the active learner incrementally. There are several different query sampling strategies that exist using which the appropriate samples are selected. One of the main problems with the active learning approach is that it is very time-consuming. So in this research work, a new multi-core-based algorithm is proposed to speed up the active learning approach, which can utilize the complete computational resources present in the system. The experiments have been performed for the problem of named entity recognition which deals with labeling the sequences of words in an unstructured text by classifying them into pre-existing categories. The proposed algorithm is evaluated in terms of both: the performance and execution time over three named entity recognition corpus of distinct biomedical domains. The evaluation results shows considerable improvement in terms of execution time for the proposed active learning algorithm than the existing active learning approach.
引用
收藏
页码:979 / 997
页数:18
相关论文
共 35 条
  • [1] Multicore based least confidence query sampling strategy to speed up active learning approach for named entity recognition
    Agrawal, Ankit
    Tripathi, Sarsij
    Vardhan, Manu
    COMPUTING, 2023, 105 (05) : 979 - 997
  • [2] Active learning approach using a modified least confidence sampling strategy for named entity recognition
    Agrawal, Ankit
    Tripathi, Sarsij
    Vardhan, Manu
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2021, 10 (02) : 113 - 128
  • [3] Active learning approach using a modified least confidence sampling strategy for named entity recognition
    Ankit Agrawal
    Sarsij Tripathi
    Manu Vardhan
    Progress in Artificial Intelligence, 2021, 10 : 113 - 128
  • [4] Uncertainty query sampling strategies for active learning of named entity recognition task
    Agrawal, Ankit
    Tripathi, Sarsij
    Vardhan, Manu
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2021, 15 (01): : 99 - 114
  • [5] A Variance Based Active Learning Approach for Named Entity Recognition
    Hassanzadeh, Hamed
    Keyvanpour, MohammadReza
    INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT II, 2011, 135 : 347 - +
  • [6] LTP: A New Active Learning Strategy for CRF-Based Named Entity Recognition
    Liu, Mingyi
    Tu, Zhiying
    Zhang, Tong
    Su, Tonghua
    Xu, Xiaofei
    Wang, Zhongjie
    NEURAL PROCESSING LETTERS, 2022, 54 (03) : 2433 - 2454
  • [7] LTP: A New Active Learning Strategy for CRF-Based Named Entity Recognition
    Mingyi Liu
    Zhiying Tu
    Tong Zhang
    Tonghua Su
    Xiaofei Xu
    Zhongjie Wang
    Neural Processing Letters, 2022, 54 : 2433 - 2454
  • [8] The Named Entity Recognition of Chinese Cybersecurity Using an Active Learning Strategy
    Xie, Bo
    Shen, Guowei
    Guo, Chun
    Cui, Yunhe
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [9] Loss-based Active Learning for Named Entity Recognition
    Linh, Le Thai
    Nguyen, Minh-Tien
    Zuccon, Guido
    Demartini, Gianluca
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [10] Subsequence Based Deep Active Learning for Named Entity Recognition
    Radmard, Puria
    Fathullah, Yassir
    Lipani, Aldo
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4310 - 4321