Retraining: The Semi-Supervised Learning of the Word Sense Disambiguation

被引:0
|
作者
Suarez, Armando [1 ]
Palomar, Manuel [1 ]
Rigau, German [2 ]
机构
[1] Univ Alicante, Dep Lenguajes & Sistemas Informat, Aptd 99, E-03080 Alicante, Spain
[2] Euskal Herriko Unibertsitatea, Grupo IXA, Donostia San Sebastian, Spain
来源
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper presents re-training, a bootstrapping algorithm that automatically acquires semantically annotated data, ensuring high levels of precision. This algorithm uses a corpus-based system of word sense disambiguation that relies on maximum entropy probability models. The re-training method consists of the iterative feeding of training-classification cycles with new and high-confidence examples. The process relies on several filters that ensure the accuracy of the disambiguation by discarding uncertain classifications. This new method is inspired by co-training algorithms, but it makes stronger assumptions on when to assign a label to a linguistic context.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Word sense disambiguation by semi-supervised learning
    Niu, ZY
    Ji, DH
    Tan, CL
    Yang, LP
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 238 - 241
  • [2] Investigating problems of semi-supervised learning for word sense disambiguation
    Le, Anh-Cuong
    Shimazu, Akira
    Nguyen, Le-Minh
    [J]. COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 482 - +
  • [3] Word sense disambiguation based on semi-supervised ensemble learning
    Zhang C.
    Xiong J.
    Gao X.
    [J]. Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2020, 41 (08): : 1216 - 1222
  • [4] Semi-supervised learning integrated with classifier combination for word sense disambiguation
    Le, Anh-Cuong
    Shimazu, Akira
    Huynh, Van-Nam
    Nguyen, Le-Minh
    [J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (04): : 330 - 345
  • [5] Semi-supervised approach for Persian word sense disambiguation
    Mahmoodvand, Mohamadreza
    Hourali, Maryam
    [J]. PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2017, : 104 - 110
  • [6] Semi-Supervised Method for Chinese Word Sense Disambiguation
    Zhang C.
    Xu Z.
    Gao X.
    [J]. Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2019, 54 (02): : 408 - 414
  • [7] Semi-supervised Learning with Induced Word Senses for State of the Art Word Sense Disambiguation
    Baskaya, Osman
    Jurgens, David
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2016, 55 : 1025 - 1058
  • [8] A Novel Word Sense Disambiguation Algorithm Based on Semi-Supervised Statistical Learning
    Huang, Zhehuang
    Chen, Yidong
    Shi, Xiaodong
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2013, 43 (13): : 452 - 458
  • [9] Semi-supervised Word Sense Disambiguation Using the Web as Corpus
    Guzman-Cabrera, Rafael
    Rosso, Paolo
    Montes-y-Gomez, Manuel
    Villasenor-Pineda, Luis
    Pinto-Avendano, David
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2009, 5449 : 256 - +
  • [10] Deep analysis of word sense disambiguation via semi-supervised learning and neural word representations
    Duarte, Jose Marcio
    Sousa, Samuel
    Milios, Evangelos
    Berton, Lilian
    [J]. INFORMATION SCIENCES, 2021, 570 : 278 - 297