Knowledge enhancement BERT based on domain dictionary mask

被引:0
|
作者
Cao, Xianglin [1 ]
Xiao, Hong [1 ]
Jiang, Wenchao [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Peoples R China
关键词
Intelligent customer service; dictionary mask; BERT; data preprocessing;
D O I
10.3233/JHS-222013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic matching is one of the critical technologies for intelligent customer service. Since Bidirectional Encoder Representations from Transformers (BERT) is proposed, fine-tuning on a large-scale pre-training language model becomes a general method to implement text semantic matching. However, in practical application, the accuracy of the BERT model is limited by the quantity of pre-training corpus and proper nouns in the target domain. An enhancement method for knowledge based on domain dictionary to mask input is proposed to solve the problem. Firstly, for modul input, we use keyword matching to recognize and mask the word in domain. Secondly, using self-supervised learning to inject knowledge of the target domain into the BERT model. Thirdly, we fine-tune the BERT model with public datasets LCQMC and BQboost. Finally, we test the model's performance with a financial company's user data. The experimental results show that after using our method and BQboost, accuracy increases by 12.12% on average in practical applications.
引用
收藏
页码:121 / 128
页数:8
相关论文
共 50 条
  • [31] Supervised single-channel speech enhancement using ratio mask with joint dictionary learning
    Zhang, Long
    Bao, Guangzhao
    Zhang, Jing
    Ye, Zhongfu
    SPEECH COMMUNICATION, 2016, 82 : 38 - 52
  • [32] Entity and relation extraction with rule-guided dictionary as domain knowledge
    Xinzhi Wang
    Jiahao Li
    Ze Zheng
    Yudong Chang
    Min Zhu
    Frontiers of Engineering Management, 2022, 9 : 610 - 622
  • [33] Entity and relation extraction with rule-guided dictionary as domain knowledge
    Xinzhi WANG
    Jiahao LI
    Ze ZHENG
    Yudong CHANG
    Min ZHU
    Frontiers of Engineering Management, 2022, 9 (04) : 610 - 622
  • [34] Entity and relation extraction with rule-guided dictionary as domain knowledge
    Wang, Xinzhi
    Li, Jiahao
    Zheng, Ze
    Chang, Yudong
    Zhu, Min
    FRONTIERS OF ENGINEERING MANAGEMENT, 2022, 9 (04) : 610 - 622
  • [35] A FEATURE DICTIONARY SUPPORTING A MULTI-DOMAIN MEDICAL KNOWLEDGE BASE
    NAEYMIRAD, F
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 1989, 30 (2-3) : 217 - 228
  • [36] Constructing Chinese Historical Literature Knowledge Graph Based on BERT
    Guo, Qingyan
    Sun, Yang
    Liu, Guanzhong
    Wang, Zijun
    Ji, Zijing
    Shen, Yuxin
    Wang, Xin
    WEB INFORMATION SYSTEMS AND APPLICATIONS (WISA 2021), 2021, 12999 : 323 - 334
  • [37] Unsupervised Multitarget Domain Adaptation With Dictionary-Bridged Knowledge Exploitation
    Tian, Qing
    Ma, Chuang
    Cao, Meng
    Wan, Jun
    Lei, Zhen
    Chen, Songcan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3464 - 3477
  • [38] Unsupervised model based image segmentation using domain knowledge based fuzzy logic and edge enhancement
    Nanayakkara, ND
    Samarabandu, J
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 577 - 580
  • [39] Improved Speech Enhancement using a Time-Domain GAN with Mask Learning
    Lin, Ju
    Niu, Sufeng
    van Wijngaarden, Adriaan J.
    McClendon, Jerome L.
    Smith, Melissa C.
    Wang, Kuang-Ching
    INTERSPEECH 2020, 2020, : 3286 - 3290
  • [40] Speech enhancement using sparse dictionary learning in wavelet packet transform domain
    Mavaddaty, Samira
    Ahadi, Seyed Mohammad
    Seyedin, Sanaz
    COMPUTER SPEECH AND LANGUAGE, 2017, 44 : 22 - 47