Named Entity Recognition in Aviation Products Domain Based on BERT

被引:0
|
作者
Yang, Mingye [1 ]
Namoano, Bernadin [1 ]
Farsi, Maryam [1 ]
Erkoyuncu, John Ahmet [1 ]
机构
[1] Cranfield Univ, Ctr Digital Engn & Mfg, Cranfield MK43 0AL, England
来源
IEEE ACCESS | 2024年 / 12卷
基金
英国工程与自然科学研究理事会;
关键词
Hidden Markov models; Data models; Named entity recognition; Knowledge graphs; Atmospheric modeling; Feature extraction; Data mining; Ontologies; Encoding; Biological system modeling; Aviation; named entity recognition (NER); knowledge graph; bidirectional encoder representations from transformers (BERT); bidirectional long short-term memory network (Bi-LSTM);
D O I
10.1109/ACCESS.2024.3516390
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aviation products' manufacturing industry is undergoing a profound transformation towards intelligence, among which the construction of a knowledge graph specifically for the aviation field has become the core link in achieving cognitive intelligence. In the process of knowledge graph construction, named entity recognition (NER) is a key step and one of the main tasks of knowledge extraction. Given the high degree of specialisation of aviation product text data and the wide span of contextual information, existing models often perform poorly in entity extraction. This paper proposes a new Named Entity Recognition (NER) method specifically tailored for the aviation product field (BBC-Ap), introducing an innovative approach that leverages domain-specific ontologies and advanced deep learning algorithms to significantly enhance the accuracy and efficiency of entity extraction from complex technical documents. The first step of this method is to establish an ontology model of aviation products and annotate the relevant text data to form a dataset for training the named entity model. Next, it adopts a multi-level model structure based on BERT, in which BERT is used to generate word vector representations, a bidirectional long short-term memory network (BiLSTM) is used as an encoder to extract semantic features, and a conditional random field (CRF) is used as a decoder to achieve optimal label assignment. Through experiments on the constructed aviation product dataset, the model achieved a Precision value of 91.74%, a Recall value of 92.46%, and an F1 score of 92.1%, Compared with other baseline models, the F1-score is improved by 0.9% to 1.5%. At the same time, the model also performs well on standard datasets such as CoNLLpp, with a Precision value of 92.87%, a Recall value of 92.54%, and an F1-Score of 92.70%. Finally, the model was used to successfully construct a knowledge graph reflecting the relationships between aviation products in Neo4j, further demonstrating the effectiveness and practicality of the method.
引用
收藏
页码:189710 / 189721
页数:12
相关论文
共 50 条
  • [31] A Hybrid Named Entity Recognition System for Aviation Text
    Bharathi, A.
    Ramdin, Robin
    Babu, Preeja
    Menon, Vijay Krishna
    Jayaramakrishnan, Chandrasekhar
    Lakshmikumar, Sudarsan
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (01)
  • [32] Arabic Named Entity Recognition: A BERT-BGRU Approach
    Alsaaran, Norah
    Alrabiah, Maha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (01): : 471 - 485
  • [33] Named Entity Recognition for Drone Forensic Using BERT and DistilBERT
    Silalahi, Swardiantara
    Ahmad, Tohari
    Studiawan, Hudan
    2022 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ITS APPLICATIONS (ICODSA), 2022, : 53 - 58
  • [34] Named Entity Recognition of Enterprise Annual Report Integrated with BERT
    Zhang J.
    He G.
    Dai Z.
    Liu Y.
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2021, 55 (02): : 117 - 123
  • [35] Named entity recognition of agricultural based entity-level masking BERT and BiLSTM-CRF
    Wei Z.
    Song L.
    Hu X.
    Chen N.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2022, 38 (15): : 195 - 203
  • [36] Named Entity Recognition for Chinese Aviation Security Incident Based on BiLSTM and CRF
    Zhao, Yan
    Liu, Hu
    Chen, Zhen
    2021 2ND ASIA CONFERENCE ON COMPUTERS AND COMMUNICATIONS (ACCC 2021), 2021, : 89 - 94
  • [37] Chinese clinical named entity recognition with variant neural structures based on BERT methods
    Li, Xiangyang
    Zhang, Huan
    Zhou, Xiao-Hua
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 107
  • [38] ABioNER: A BERT-Based Model for Arabic Biomedical Named-Entity Recognition
    Boudjellal, Nada
    Zhang, Huaping
    Khan, Asif
    Ahmad, Arshad
    Naseem, Rashid
    Shang, Jianyun
    Dai, Lin
    COMPLEXITY, 2021, 2021
  • [39] Named Entity Recognition Method for Power Equipment Based on BERT-BiLSTM-CRF
    Hu, Jiangyi
    Yang, Wenqing
    Yang, Huafei
    Wei, Shanming
    Sun, Zhen
    2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 694 - 699
  • [40] A Named Entity Recognition Model for Manufacturing Process Based on the BERT Language Model Scheme
    Shrivastava, Manu
    Seri, Kota
    Wagatsuma, Hiroaki
    SOCIAL ROBOTICS, ICSR 2022, PT I, 2022, 13817 : 576 - 587