A Chinese Named Entity Recognition Model of Maintenance Records for Power Primary Equipment Based on Progressive Multitype Feature Fusion

被引:5
|
作者
He, Lanfei [1 ]
Zhang, Xuefei [1 ]
Li, Zhiwei [1 ]
Xiao, Peng [2 ]
Wei, Ziming [2 ]
Cheng, Xu [3 ]
Qu, Shaocheng [2 ]
机构
[1] Hubei Elect Power Co State Grid, Econ & Tech Res Inst, Wuhan 430000, Peoples R China
[2] Cent China Normal Univ, Coll Phys Sci & Technol, Dept Elect & Informat Engn, Wuhan 430000, Peoples R China
[3] Wuhan Esmorning S&T Co Ltd, Wuhan 430023, Peoples R China
关键词
CRF;
D O I
10.1155/2022/8114217
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Presently, the State Grid Corporation of China has accumulated a large amount of maintenance records for power primary equipment. Unfortunately, most of these records are unstructured data which lead to difficultly analyze and utilize them. The emergence of natural language processing technology and deep learning methods provide a solution for unstructured text data. This paper proposes a progressive multitype feature fusion model to recognize Chinese named entity of unstructured maintenance records for power primary equipment. Firstly, the textual characteristics and word separation difficulties of maintenance records are analyzed, then 7 main entity categories of power technical terms from unstructured maintenance records are chosen, and 3452 maintenance records are labeled by these categories, which is so called EPE-MR training dataset. Secondly, the standard test reports, standard maintenance, and fault analysis reports for three types of power primary equipment (namely, main transformer, circuit breaker, and isolating switch) are employed as corpus to train character embedding in order to obtain certain words representation ability of maintenance records. After that, progressive multilevel radicals feature extraction module is designed to get detailed and fine semantic information in a hierarchical manner. Further, radicals feature representation and character embedding are concatenated and sent to BiLSTM module to extract contextual information in order to improve Chinese entity recognition ability. Moreover, CRF is introduced to handle the dependencies among prediction labels and to output the optimal prediction sequence, which can easily obtain structured data of maintenance records. Finally, comparative experiments on public MSRA dataset, China People's Daily corpus, and EPE-MR dataset are implemented, respectively, which show the effectiveness of the proposed method.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Named Entity Recognition Model of Power Equipment Based on Multi-feature Fusion
    Wu, Yun
    Ma, Xiangwen
    Yang, Jieming
    Wang, Anping
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2022, 13630 : 255 - 267
  • [2] Named Entity Recognition Model Based on Feature Fusion
    Sun, Zhen
    Li, Xinfu
    INFORMATION, 2023, 14 (02)
  • [3] Named Entity Recognition of Chinese Electronic Medical Records Based on Multi-Feature Fusion
    Sun, Zhen
    Li, Xinfu
    Computer Engineering and Applications, 2023, 59 (23) : 136 - 144
  • [4] Chinese named entity recognition method based on multiscale feature fusion
    Jiang, Xiaoguang
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2024, 16 (3-4) : 337 - 349
  • [5] HiNER: Hierarchical feature fusion for Chinese named entity recognition
    Hou, Shuxiang
    Qian, Yurong
    Chen, Jiaying
    Zhao, Jigui
    Lv, Huiyong
    Zhang, Jiyuan
    Leng, Hongyong
    Ma, Mengnan
    Neurocomputing, 2025, 611
  • [6] Chinese medical named entity recognition based on feature fusion and multihead biaffine transformations
    Wang, Zhixiang
    Yolwas, Nurmemet
    Proceedings of SPIE - The International Society for Optical Engineering, 2024, 13210
  • [7] Chinese Named Entity Recognition method based on multi-feature fusion and biaffine
    Ke, Xiaohua
    Wu, Xiaobo
    Ou, Zexian
    Li, Binglong
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (05) : 6305 - 6318
  • [8] Multi-Feature Fusion Transformer for Chinese Named Entity Recognition
    Han, Xiaokai
    Yue, Qi
    Chu, Jing
    Han, Zhan
    Shi, Yifan
    Wang, Chengfeng
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4227 - 4232
  • [9] A probabilistic feature based Maximum Entropy model for Chinese named entity recognition
    Zhang, Suxiang
    Wang, Xiaojie
    Wen, Juan
    Qin, Ying
    Zhong, Yixin
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 189 - +
  • [10] Chinese Named Entity Recognition Based on BERT and Lightweight Feature Extraction Model
    Yang, Ruisen
    Gan, Yong
    Zhang, Chenfang
    INFORMATION, 2022, 13 (11)