Application of the transformer model algorithm in chinese word sense disambiguation: a case study in chinese language

被引:1
|
作者
Li, Linlin [1 ]
Li, Juxing [2 ]
Wang, Hongli [3 ]
Nie, Jianing [4 ]
机构
[1] Sichuan Univ, Coll Literature & Journalism, Chengdu 610000, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Journalism & New Media, Xian 710049, Peoples R China
[3] Tiangong Univ, Sch Artificial Intelligence, Tianjin 300000, Peoples R China
[4] Wuhan Text Univ, Coll Int Business & Econ, Sch Art, Wuhan 430000, Peoples R China
关键词
Transformer model algorithm; Chinese language; BiLSTM; Word sense disambiguation; Root mean squared error;
D O I
10.1038/s41598-024-56976-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study aims to explore the research methodology of applying the Transformer model algorithm to Chinese word sense disambiguation, seeking to resolve word sense ambiguity in the Chinese language. The study introduces deep learning and designs a Chinese word sense disambiguation model based on the fusion of the Transformer with the Bi-directional Long Short-Term Memory (BiLSTM) algorithm. By utilizing the self-attention mechanism of Transformer and the sequence modeling capability of BiLSTM, this model efficiently captures semantic information and context relationships in Chinese sentences, leading to accurate word sense disambiguation. The model's evaluation is conducted using the PKU Paraphrase Bank, a Chinese text paraphrase dataset. The results demonstrate that the model achieves a precision rate of 83.71% in Chinese word sense disambiguation, significantly outperforming the Long Short-Term Memory algorithm. Additionally, the root mean squared error of this algorithm is less than 17, with a loss function value remaining around 0.14. Thus, this study validates that the constructed Transformer-fused BiLSTM-based Chinese word sense disambiguation model algorithm exhibits both high accuracy and robustness in identifying word senses in the Chinese language. The findings of this study provide valuable insights for advancing the intelligent development of word senses in Chinese language applications.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] An improved algorithm on word sense disambiguation
    Serban, G
    Tatar, D
    INTELLIGENT INFORMATION PROCESSING AND WEB MINING, 2003, : 199 - 208
  • [32] A trigram statistical language model algorithm for chinese word segmentation
    Mao, Jun
    Cheng, Gang
    He, Yanxiang
    Xing, Zehuan
    FRONTIERS IN ALGORITHMICS, PROCEEDINGS, 2007, 4613 : 271 - +
  • [33] Word sense disambiguation using evolutionary algorithms - Application to Arabic language
    Menai, Mohamed El Bachir
    COMPUTERS IN HUMAN BEHAVIOR, 2014, 41 : 92 - 103
  • [34] The Application of Hidden Markov Model based on Semantic Case Amelioration in Chinese Word Sense Tagging
    Fang, Hao
    Ding, Yimin
    Yang, Min
    SECOND INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING: WGEC 2008, PROCEEDINGS, 2008, : 340 - +
  • [35] Query expansion based on word sense disambiguation in Chinese question answering system
    Jia, Keliang
    Journal of Computational Information Systems, 2010, 6 (01): : 181 - 187
  • [36] Symmetric Trends: Optimal Local Context Window in Chinese Word Sense Disambiguation
    Li, Gang
    Kou, Guangzeng
    Zhou, Ercui
    Zhang, Ling
    HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 3, PROCEEDINGS, 2009, : 151 - +
  • [37] A semantics-enhanced language model for unsupervised word sense disambiguation
    Lin, Shou-De
    Verspoor, Karin
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 287 - +
  • [38] A Modified Technique for Word Sense Disambiguation Using Lesk Algorithm in Hindi Language
    Sawhney, Radhike
    Kaur, Arvinder
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 2745 - 2749
  • [39] Case Based Interpretation Model for Word Sense Disambiguation in Gurmukhi
    Walia, Himdweep
    Rana, Ajay
    Kansal, Vineet
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 359 - 364
  • [40] Harmony Search Algorithm for Word Sense Disambiguation
    Abed, Saad Adnan
    Tiun, Sabrina
    Omar, Nazlia
    PLOS ONE, 2015, 10 (09):