Adversarial training for named entity recognition of rail fault text

被引:0
|
作者
Qu, J. [1 ]
Su, S. [1 ,2 ]
Li, R. [1 ]
Wang, G. [3 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Traff Control & Safety, Beijing, Peoples R China
[2] Beijing Jiaotong Univ, Frontiers Sci Ctr Smart High Speed Railway Syst, Beijing, Peoples R China
[3] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08854 USA
关键词
Rail fault texts; Named entity recognition; Adversarial training;
D O I
10.1109/ITSC48978.2021.9565087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, most rail faults in metro systems are recorded in the form of text. Due to the lack of effective mining and analysis tools, information in the massive textual data is not fully utilized. Learning from past fault texts and identifying some key concepts are essential to analyze faults and help decision making. In this paper, a word-enhanced adversarial training model (AT-BiLSTM-CRF) is proposed to address this problem. In this model, the named entity recognition (NER) is achieved by bi-directional long short-term memory (BiLSTM) with conditional random field (CRF). At the same time, the Chinese word segmentation (CWS) task is introduced to conduct adversarial training with the NER task. The structure of adversarial training is to make full use of the boundary information and filter out the noise caused by introducing the CWS task. More importantly, the experiments on five different train fault datasets are conducted in the rail field. The results show that the model performs better than the state-of-the-art baselines, which indicates it has the potential to lay the foundation for textual data analysis in the rail field.
引用
收藏
页码:1353 / 1358
页数:6
相关论文
共 50 条
  • [1] Adversarial Training Lattice LSTM for Named Entity Recognition of Rail Fault Texts
    Su, Shuai
    Qu, Jia
    Cao, Yuan
    Li, Ruoqing
    Wang, Guang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21201 - 21215
  • [2] Pretrained Models with Adversarial Training for Named Entity Recognition in Scientific Text
    Ma, Hangchao
    Zhang, You
    Wang, Jin
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 259 - 264
  • [3] Research on the Named Entity Recognition for Rail Fault Text Based on Distant Supervision
    Cai, Yi
    Su, Shuai
    Li, Zheng
    Han, Qinglong
    Zhang, Jianxia
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 939 - 944
  • [4] Improved Attention Mechanism and Adversarial Training for Respiratory Infectious Disease Text Named Entity Recognition
    Liu, Junhong
    Wei, Wenxue
    Zhang, Yukun
    Liang, Lei
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT V, 2023, 14258 : 103 - 114
  • [5] Named Entity Recognition Based on Reinforcement Learning and Adversarial Training
    Peng, Shi
    Zhang, Yong
    Yu, Yuanfang
    Zuo, Haoyang
    Zhang, Kai
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 191 - 202
  • [6] Chinese Named Entity Recognition for Automobile Fault Texts Based on External Context Retrieving and Adversarial Training
    Wang, Shuhai
    Sun, Linfu
    ENTROPY, 2025, 27 (02)
  • [7] Named entity recognition in the food field based on BERT and Adversarial training
    Dong, Zhe
    Shao, RuoQi
    Chen, YuLiang
    Chen, JiaWei
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2219 - 2226
  • [8] Named entity recognition for Chinese based on global pointer and adversarial training
    Hongjun Li
    Mingzhe Cheng
    Zelin Yang
    Liqun Yang
    Yansong Chua
    Scientific Reports, 13
  • [9] Named entity recognition for Chinese based on global pointer and adversarial training
    Li, Hongjun
    Cheng, Mingzhe
    Yang, Zelin
    Yang, Liqun
    Chua, Yansong
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [10] Cross-Lingual Named Entity Recognition Based on Attention and Adversarial Training
    Wang, Hao
    Zhou, Lekai
    Duan, Jianyong
    He, Li
    APPLIED SCIENCES-BASEL, 2023, 13 (04):