BERT-Based Transfer-Learning Approach for Nested Named-Entity Recognition Using Joint Labeling

被引：21

作者：

Agrawal, Ankit ^{[1
]}

Tripathi, Sarsij ^{[2
]}

Vardhan, Manu ^{[1
]}

Sihag, Vikas ^{[3
]}

Choudhary, Gaurav ^{[4
]}

Dragoni, Nicola ^{[4
]}

机构：

[1] Natl Inst Technol Raipur, Dept Comp Sci & Engn, Raipur 492010, Chhattisgarh, India

[2] Motilal Nehru Natl Inst Technol Allahabad, Dept Comp Sci & Engn, Prayagraj 211004, Uttar Pradesh, India

[3] Sardar Patel Univ Police, Dept Cyber Secur, Secur & Criminal Justice, Jodhpur 342037, Rajasthan, India

[4] Tech Univ Denmark DTU, Dept Appl Math & Comp Sci, DTU Comp, DK-2800 Lyngby, Denmark

来源：

APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 03期

关键词：

named-entity recognition; transfer learning; BERT model; conditional random field; pre-trained model; fine-tuning;

D O I：

10.3390/app12030976

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Named-entity recognition (NER) is one of the primary components in various natural language processing tasks such as relation extraction, information retrieval, question answering, etc. The majority of the research work deals with flat entities. However, it was observed that the entities were often embedded within other entities. Most of the current state-of-the-art models deal with the problem of embedded/nested entity recognition with very complex neural network architectures. In this research work, we proposed to solve the problem of nested named-entity recognition using the transfer-learning approach. For this purpose, different variants of fine-tuned, pretrained, BERT-based language models were used for the problem using the joint-labeling modeling technique. Two nested named-entity-recognition datasets, i.e., GENIA and GermEval 2014, were used for the experiment, with four and two levels of annotation, respectively. Also, the experiments were performed on the JNLPBA dataset, which has flat annotation. The performance of the above models was measured using F1-score metrics, commonly used as the standard metrics to evaluate the performance of named-entity-recognition models. In addition, the performance of the proposed approach was compared with the conditional random field and the Bi-LSTM-CRF model. It was found that the fine-tuned, pretrained, BERT-based models outperformed the other models significantly without requiring any external resources or feature extraction. The results of the proposed models were compared with various other existing approaches. The best-performing BERT-based model achieved F1-scores of 74.38, 85.29, and 80.68 for the GENIA, GermEval 2014, and JNLPBA datasets, respectively. It was found that the transfer learning (i.e., pretrained BERT models after fine-tuning) based approach for the nested named-entity-recognition task could perform well and is a more generalized approach in comparison to many of the existing approaches.

引用

页数：20

共 50 条

[21] Extracting Named Entity Using Entity Labeling in Geological Text Using Deep Learning Approach
Qinjun Qiu
Miao Tian
Zhong Xie
Yongjian Tan
Kai Ma
Qingfang Wang
Shengyong Pan
Liufeng Tao
Journal of Earth Science, 2023, (05) : 1406 - 1417
[22] Named Entity Recognition in Crime Using Machine Learning Approach
Shabat, Hafedh
Omar, Nazlia
Rahem, Khmael
INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014, 2014, 8870 : 280 - 288
[23] Named entity recognition using hybrid machine learning approach
Chiong, Raymond
Wei, Wang
PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 578 - 583
[24] Named entity recognition in crime using machine learning approach
Shabat, Hafedh (h2005_ali@yahoo.com), 1600, Springer Verlag (8870):
[25] A Variance Based Active Learning Approach for Named Entity Recognition
Hassanzadeh, Hamed
Keyvanpour, MohammadReza
INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT II, 2011, 135 : 347 - +
[26] A Fine-Tuned BERT-Based Transfer Learning Approach for Text Classification
Qasim, Rukhma
Bangyal, Waqas Haider
Alqarni, Mohammed A.
Almazroi, Abdulwahab Ali
JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022
[27] A Fine-Tuned BERT-Based Transfer Learning Approach for Text Classification
Qasim, Rukhma
Bangyal, Waqas Haider
Alqarni, Mohammed A. A.
Almazroi, Abdulwahab Ali
JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022
[28] Chinese Named Entity Recognition with a Sequence Labeling Approach: Based on Characters, or Based on Words?
Liu, Zhangxun
Zhu, Conghui
Zhao, Tiejun
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2010, 6216 : 634 - 640
[29] NAMED-ENTITY RECOGNITION FOR HINDI LANGUAGE USING CONTEXT PATTERN-BASED MAXIMUM ENTROPY
Jain, Arti
Yadav, Divakar
Arora, Anuja
Tayal, Devendra K.
COMPUTER SCIENCE-AGH, 2022, 23 (01): : 81 - 115
[30] A Research Toward Chinese Named Entity Recognition Based on Transfer Learning
Hui Kang
Jingwu Xiao
Yunpeng Zhang
Lei Zhang
Xu Zhao
Tie Feng
International Journal of Computational Intelligence Systems, 16

← 1 2 3 4 5 →