Knowledge enhancement BERT based on domain dictionary mask

被引:0
|
作者
Cao, Xianglin [1 ]
Xiao, Hong [1 ]
Jiang, Wenchao [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Peoples R China
关键词
Intelligent customer service; dictionary mask; BERT; data preprocessing;
D O I
10.3233/JHS-222013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic matching is one of the critical technologies for intelligent customer service. Since Bidirectional Encoder Representations from Transformers (BERT) is proposed, fine-tuning on a large-scale pre-training language model becomes a general method to implement text semantic matching. However, in practical application, the accuracy of the BERT model is limited by the quantity of pre-training corpus and proper nouns in the target domain. An enhancement method for knowledge based on domain dictionary to mask input is proposed to solve the problem. Firstly, for modul input, we use keyword matching to recognize and mask the word in domain. Secondly, using self-supervised learning to inject knowledge of the target domain into the BERT model. Thirdly, we fine-tune the BERT model with public datasets LCQMC and BQboost. Finally, we test the model's performance with a financial company's user data. The experimental results show that after using our method and BQboost, accuracy increases by 12.12% on average in practical applications.
引用
收藏
页码:121 / 128
页数:8
相关论文
共 50 条
  • [21] Common Dictionary and Domain-Specific Dictionary based Cross-Domain Image Classification
    Zhang, Kangkang
    Yuan, Meigui
    Xiong, Youling
    Qu, Lei
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 2824 - 2829
  • [22] Sentiment Classification Algorithm Based on the Cascade of BERT Model and Adaptive Sentiment Dictionary
    Duan, Ruixue
    Huang, Zhuofan
    Zhang, Yangsen
    Liu, Xiulei
    Dang, Yue
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [23] Sentiment Classification Algorithm Based on the Cascade of BERT Model and Adaptive Sentiment Dictionary
    Duan, Ruixue
    Huang, Zhuofan
    Zhang, Yangsen
    Liu, Xiulei
    Dang, Yue
    Wireless Communications and Mobile Computing, 2021, 2021
  • [24] Chinese Named Entity Recognition in the Geoscience Domain Based on BERT
    Lv, Xia
    Xie, Zhong
    Xu, Dexin
    Jin, Xiangguo
    Ma, Kai
    Tao, Liufeng
    Qiu, Qinjun
    Pan, Yongsheng
    EARTH AND SPACE SCIENCE, 2022, 9 (03)
  • [25] Named Entity Recognition in Aviation Products Domain Based on BERT
    Yang, Mingye
    Namoano, Bernadin
    Farsi, Maryam
    Erkoyuncu, John Ahmet
    IEEE ACCESS, 2024, 12 : 189710 - 189721
  • [26] Design of Multimodal Retrieval Model for Translation Domain Based on BERT
    Sheng, Xia
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND DIGITAL APPLICATIONS, MIDA2024, 2024, : 168 - 172
  • [27] Cross-Domain Text Classification Based on BERT Model
    Zhang, Kuan
    Hei, Xinhong
    Fei, Rong
    Guo, Yufan
    Jiao, Rui
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS: DASFAA 2021 INTERNATIONAL WORKSHOPS, 2021, 12680 : 197 - 208
  • [28] A dictionary-based approach to normalizing gene names in one domain of knowledge from the biomedical literature
    Galvez, Carmen
    de Moya-Anegon, Felix
    JOURNAL OF DOCUMENTATION, 2012, 68 (01) : 5 - 30
  • [29] Prostate segmentation by feature enhancement using domain knowledge and adaptive region based operations
    Nanayakkara, ND
    Samarabandu, J
    Fenster, A
    PHYSICS IN MEDICINE AND BIOLOGY, 2006, 51 (07): : 1831 - 1848
  • [30] Open-domain Multi-turn Dialogue Model Based on Knowledge Enhancement
    Xu F.
    Xu J.-M.
    Ma Y.
    Wang M.-W.
    Zhou G.-D.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (02): : 758 - 772