WORD SENSE DISAMBIGUATION: A STRUCTURED LEARNING PERSPECTIVE

被引：0

作者：

Zhou, Yun ^{[1
]}

Wang, Ting ^{[1
]}

Wang, Zhiyuan ^{[1
,2
]}

机构：

[1] Natl Univ Def Technol, Sch Comp, Changsha 410073, Peoples R China

[2] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Peoples R China

来源：

COMPUTING AND INFORMATICS | 2015年 / 34卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Word sense disambiguation; structured learning; hidden Markov model; conditional random field; parallelization; approximate training;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper explores the application of structured learning methods (SLMs) to word sense disambiguation (WSD). On one hand, the semantic dependencies between polysemous words in the sentence can be encoded in SLMs. On the other hand, SLMs obtained significant achievements in natural language processing, and so it is a natural idea to apply them to WSD. However, there are many theoretical and practical problems when SLMs are applied to WSD, due to characteristics of WSD. Beginning with the method based on hidden Markov model, this paper proposes for the first time a comprehensive and unified solution for WSD based on maximum entropy Markov model, conditional random field and tree-structured conditional random field, and reduces the time complexity and running time of the proposed methods to a reasonable level by beam search, approximate training, and parallel training. The update of models brings performance improvement, the introduction of one step dependency improves performance by 1-5 percent, the adoption of non-independent features improves performance by 2-3 percent, and the extension of underlying structure to dependency parsing tree improves performance by about 1 percent. On the English all-words WSD dataset of Senseval-2004, the method based on tree-structured conditional random field outperforms the best attendee system significantly. Nevertheless, almost all machine learning methods suffer from data sparseness due to the scarcity of sense tagged data, and so do SLMs. Besides improving structured learning methods according to the characteristics of WSD, another approach to improve disambiguation performance is to mine disambiguation knowledge from all kinds of sources, such as Wikipedia, parallel corpus, and to alleviate knowledge acquisition bottleneck of WSD.

引用

页码：1257 / 1288

页数：32

共 50 条

[1] Word sense disambiguation for vocabulary learning
Kulkarni, Anagha
Heilman, Michael
Eskenazi, Maxine
Callan, Jamie
[J]. INTELLIGENT TUTORING SYSTEM, PROCEEDINGS, 2008, 5091 : 500 - 509
[2] Learning Sense Representation from Word Representation for Unsupervised Word Sense Disambiguation
Wang, Jie
Fu, Zhenxin
Li, Moxin
Zhang, Haisong
Zhao, Dongyan
Yan, Rui
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13947 - 13948
[3] Effect of Supervised Sense Disambiguation Model Using Machine Learning Technique and Word Embedding in Word Sense Disambiguation
Mahajan, Rupesh
Kokane, Chandrakant
Pathak, Kishor
Kodmelwar, Manohar
Wagh, Kapil
Bhandari, Mahesh
[J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (01) : 436 - 443
[4] Word sense disambiguation by semi-supervised learning
Niu, ZY
Ji, DH
Tan, CL
Yang, LP
[J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 238 - 241
[5] Word sense disambiguation of Thai language with unsupervised learning
Pongpinigpinyo, S
Rivepiboon, W
[J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2005, 3681 : 1275 - 1283
[6] Assamese Word Sense Disambiguation using Supervised Learning
Borah, Pranjal Protim
Talukdar, Gitimoni
Baruah, Arup
[J]. 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 946 - 950
[7] Word sense disambiguation by learning from unlabeled data
Park, SB
Zhang, BT
Kim, YT
[J]. 38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 547 - 554
[8] Learning rules for large vocabulary word sense disambiguation
Paliouras, G
Karkaletsis, V
Spyropoulos, CD
[J]. IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, 1999, : 674 - 679
[9] Sense Space for Word Sense Disambiguation
Kang, Myung Yun
Min, Tae Hong
Lee, Jae Sung
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 669 - 672
[10] Word sense disambiguation based on word sense clustering
Anaya-Sanchez, Henry
Pons-Porrata, Aurora
Berlanga-Llavori, Rafael
[J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 472 - 481

← 1 2 3 4 5 →