WORD SENSE DISAMBIGUATION: A STRUCTURED LEARNING PERSPECTIVE

被引:0
|
作者
Zhou, Yun [1 ]
Wang, Ting [1 ]
Wang, Zhiyuan [1 ,2 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha 410073, Peoples R China
[2] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
Word sense disambiguation; structured learning; hidden Markov model; conditional random field; parallelization; approximate training;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores the application of structured learning methods (SLMs) to word sense disambiguation (WSD). On one hand, the semantic dependencies between polysemous words in the sentence can be encoded in SLMs. On the other hand, SLMs obtained significant achievements in natural language processing, and so it is a natural idea to apply them to WSD. However, there are many theoretical and practical problems when SLMs are applied to WSD, due to characteristics of WSD. Beginning with the method based on hidden Markov model, this paper proposes for the first time a comprehensive and unified solution for WSD based on maximum entropy Markov model, conditional random field and tree-structured conditional random field, and reduces the time complexity and running time of the proposed methods to a reasonable level by beam search, approximate training, and parallel training. The update of models brings performance improvement, the introduction of one step dependency improves performance by 1-5 percent, the adoption of non-independent features improves performance by 2-3 percent, and the extension of underlying structure to dependency parsing tree improves performance by about 1 percent. On the English all-words WSD dataset of Senseval-2004, the method based on tree-structured conditional random field outperforms the best attendee system significantly. Nevertheless, almost all machine learning methods suffer from data sparseness due to the scarcity of sense tagged data, and so do SLMs. Besides improving structured learning methods according to the characteristics of WSD, another approach to improve disambiguation performance is to mine disambiguation knowledge from all kinds of sources, such as Wikipedia, parallel corpus, and to alleviate knowledge acquisition bottleneck of WSD.
引用
收藏
页码:1257 / 1288
页数:32
相关论文
共 50 条
  • [1] Word sense disambiguation for vocabulary learning
    Kulkarni, Anagha
    Heilman, Michael
    Eskenazi, Maxine
    Callan, Jamie
    [J]. INTELLIGENT TUTORING SYSTEM, PROCEEDINGS, 2008, 5091 : 500 - 509
  • [2] Learning Sense Representation from Word Representation for Unsupervised Word Sense Disambiguation
    Wang, Jie
    Fu, Zhenxin
    Li, Moxin
    Zhang, Haisong
    Zhao, Dongyan
    Yan, Rui
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13947 - 13948
  • [3] Effect of Supervised Sense Disambiguation Model Using Machine Learning Technique and Word Embedding in Word Sense Disambiguation
    Mahajan, Rupesh
    Kokane, Chandrakant
    Pathak, Kishor
    Kodmelwar, Manohar
    Wagh, Kapil
    Bhandari, Mahesh
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (01) : 436 - 443
  • [4] Word sense disambiguation by semi-supervised learning
    Niu, ZY
    Ji, DH
    Tan, CL
    Yang, LP
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 238 - 241
  • [5] Word sense disambiguation of Thai language with unsupervised learning
    Pongpinigpinyo, S
    Rivepiboon, W
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2005, 3681 : 1275 - 1283
  • [6] Assamese Word Sense Disambiguation using Supervised Learning
    Borah, Pranjal Protim
    Talukdar, Gitimoni
    Baruah, Arup
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 946 - 950
  • [7] Word sense disambiguation by learning from unlabeled data
    Park, SB
    Zhang, BT
    Kim, YT
    [J]. 38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 547 - 554
  • [8] Learning rules for large vocabulary word sense disambiguation
    Paliouras, G
    Karkaletsis, V
    Spyropoulos, CD
    [J]. IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, 1999, : 674 - 679
  • [9] Sense Space for Word Sense Disambiguation
    Kang, Myung Yun
    Min, Tae Hong
    Lee, Jae Sung
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 669 - 672
  • [10] Word sense disambiguation based on word sense clustering
    Anaya-Sanchez, Henry
    Pons-Porrata, Aurora
    Berlanga-Llavori, Rafael
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 472 - 481