Pretrained Models with Adversarial Training for Named Entity Recognition in Scientific Text

被引:2
|
作者
Ma, Hangchao [1 ]
Zhang, You [1 ]
Wang, Jin [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
scientific text entity recognition; pretrained models; adversarial training;
D O I
10.1109/IALP57159.2022.9961309
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity recognition (NER) is an important fundamental task in natural language processing (NLP). This paper describes a method for named entity recognition based on pretrained models and adversarial training in scientific text. The scientific entity recognition task requires the model to identify 7 different scientific term entities. There are some issues with a given dataset, such as imbalanced label classes, excessively long entity bounds, and inconsistent entity labeling. To address these issues, we proposed to use focal loss instead of existing cross-entropy loss. Further, we used one of the common adversarial training methods, i.e., Fast Gradient Method (FGM) to perform semi-supervised NER. The experimental results show that our adversarial training method considerably enhances the performance of the model, and the method used in this study achieves the highest F-1-score of 88.92 %. Moreover, our results also prove that SciBERT is better suited to the task of named entity recognition in scientific text and that the focal loss successfully solves the problem of data imbalance.
引用
收藏
页码:259 / 264
页数:6
相关论文
共 50 条
  • [21] Context-aware Adversarial Training for Name Regularity Bias in Named Entity Recognition
    Ghaddar, Abbas
    Langlais, Philippe
    Rashid, Ahmad
    Rezagholizadeh, Mehdi
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 586 - 604
  • [22] Creating Training Data for Scientific Named Entity Recognition with Minimal Human Effort
    Tchoua, Roselyne B.
    Ajith, Aswathy
    Hong, Zhi
    Ward, Logan T.
    Chard, Kyle
    Belikov, Alexander
    Audus, Debra J.
    Patel, Shrayesh
    de Pablo, Juan J.
    Foster, Ian T.
    COMPUTATIONAL SCIENCE - ICCS 2019, PT I, 2019, 11536 : 398 - 411
  • [23] Adversarial Active Learning for Named Entity Recognition in Cybersecurity
    Li, Tao
    Hu, Yongjin
    Ju, Ankang
    Hu, Zhuoran
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 66 (01): : 407 - 420
  • [24] GeoNER: Geological Named Entity Recognition with Enriched Domain Pre-Training Model and Adversarial Training
    Ma, Kai
    Hu, Xinxin
    Tian, Miao
    Tan, Yongjian
    Zheng, Shuai
    Tao, Liufeng
    Qiu, Qinjun
    ACTA GEOLOGICA SINICA-ENGLISH EDITION, 2024, 98 (05) : 1404 - 1417
  • [25] Textual adversarial attacks in cybersecurity named entity recognition
    Jiang, Tian
    Liu, Yunqi
    Cui, Xiaohui
    COMPUTERS & SECURITY, 2025, 150
  • [26] Adversarial Named Entity Recognition with POS label embedding
    Bai, Yuxuan
    Wang, Yu
    Xia, Bin
    Li, Yun
    Zhu, Ziye
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [27] One Class per Named Entity: Exploiting Unlabeled Text for Named Entity Recognition
    Wong, Yingchuan
    Ng, Hwee Tou
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1763 - 1768
  • [28] GeoNER:Geological Named Entity Recognition with Enriched Domain Pre-Training Model and Adversarial Training
    MA Kai
    HU Xinxin
    TIAN Miao
    TAN Yongjian
    ZHENG Shuai
    TAO Liufeng
    QIU Qinjun
    Acta Geologica Sinica(English Edition), 2024, 98 (05) : 1404 - 1417
  • [29] Chinese named entity recognition method for the finance domain based on enhanced features and pretrained language models
    Zhang, Han
    Wang, Xinyu
    Liu, Junxiu
    Zhang, Lei
    Ji, Lixia
    INFORMATION SCIENCES, 2023, 625 : 385 - 400
  • [30] Robustness of Named Entity Recognition Models
    Walkowiak, Pawel
    SYSTEM DEPENDABILITY-THEORY AND APPLICATIONS, DEPCOS-RELCOMEX 2024, 2024, 1026 : 306 - 315