Aviation-BERT-NER: Named Entity Recognition for Aviation Safety Reports

被引:0
|
作者
Chandra, Chetan [1 ]
Ojima, Yuga [1 ]
Bendarkar, Mayank V. [1 ]
Mavris, Dimitri N. [1 ]
机构
[1] Georgia Inst Technol, Aerosp Syst Design Lab ASDL, Atlanta, GA 30332 USA
关键词
natural language processing; text mining; aviation safety; NTSB; ASRS; named entity recognition;
D O I
10.3390/aerospace11110890
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This work introduces Aviation-BERT-NER, a Named Entity Recognition (NER) system tailored for aviation safety reports, building on the Aviation-BERT base model developed at the Georgia Institute of Technology's Aerospace Systems Design Laboratory. This system integrates aviation domain-specific data, including aircraft types, manufacturers, quantities, and aviation terminology, to identify named entities critical for aviation safety analysis. A key innovation of Aviation-BERT-NER is its template-based approach to fine-tuning, which utilizes structured datasets to generate synthetic training data that mirror the complexity of real-world aviation safety reports. This method significantly improves the model's generalizability and adaptability, enabling rapid updates and customization to meet evolving domain-specific requirements. The development process involved careful data preparation, including the synthesis of entity types and the generation of labeled datasets through template filling. Testing on real-world narratives from the National Transportation Safety Board (NTSB) database highlighted Aviation-BERT-NER's robustness, with a precision of 95.34%, recall of 94.62%, and F1 score of 94.78% when evaluated over 50 manually annotated (BIO tagged) paragraphs. This work addresses a critical gap in English language NER models for aviation safety, promising substantial improvements in the analysis and understanding of aviation safety reports.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Named Entity Recognition in Aviation Products Domain Based on BERT
    Yang, Mingye
    Namoano, Bernadin
    Farsi, Maryam
    Erkoyuncu, John Ahmet
    IEEE ACCESS, 2024, 12 : 189710 - 189721
  • [2] A Hybrid Named Entity Recognition System for Aviation Text
    Bharathi, A.
    Ramdin, Robin
    Babu, Preeja
    Menon, Vijay Krishna
    Jayaramakrishnan, Chandrasekhar
    Lakshmikumar, Sudarsan
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (01)
  • [3] Named Entity Recognition (NER) for Nepali
    Maharjan, Gopal
    Bal, Bal Krishna
    Regmi, Santosh
    CREATIVITY IN INTELLIGENT TECHNOLOGIES AND DATA SCIENCE, PT II, 2019, 1084 : 71 - 80
  • [4] Continual learning framework of named entity recognition in aviation assembly domain
    Liu P.-F.
    Qian L.
    Zhao X.-W.
    Tao B.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (06): : 1186 - 1194+1266
  • [5] Named Entity Recognition for Chinese Aviation Security Incident Based on BiLSTM and CRF
    Zhao, Yan
    Liu, Hu
    Chen, Zhen
    2021 2ND ASIA CONFERENCE ON COMPUTERS AND COMMUNICATIONS (ACCC 2021), 2021, : 89 - 94
  • [6] Telugu named entity recognition using bert
    Gorla, SaiKiranmai
    Tangeda, Sai Sharan
    Neti, Lalita Bhanu Murthy
    Malapati, Aruna
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2022, 14 (02) : 127 - 140
  • [7] Telugu named entity recognition using bert
    SaiKiranmai Gorla
    Sai Sharan Tangeda
    Lalita Bhanu Murthy Neti
    Aruna Malapati
    International Journal of Data Science and Analytics, 2022, 14 : 127 - 140
  • [8] Named Entity Recognition Approaches and Their Comparison for Custom NER Model
    Shelar H.
    Kaur G.
    Heda N.
    Agrawal P.
    Science and Technology Libraries, 2020, 39 (03): : 324 - 337
  • [9] KA-NER: Knowledge Augmented Named Entity Recognition
    Nie, Binling
    Li, Chenyang
    Wang, Honglie
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE GRAPH EMPOWERS NEW INFRASTRUCTURE CONSTRUCTION, 2021, 1466 : 60 - 75
  • [10] Chinese named entity recognition model based on BERT
    Liu, Hongshuai
    Jun, Ge
    Zheng, Yuanyuan
    2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336