Word clustering for collocation-based word sense disambiguation

被引:0
|
作者
Jin, Peng [1 ]
Sun, Xu [1 ]
Wu, Yunfang [1 ]
Yu, Shiwen [1 ]
机构
[1] Peking Univ, Inst Computat Linguist, Dept Comp Sci & Technol, Beijing 100871, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main disadvantage of collocation-based word sense disambiguation is that the recall is low, with relatively high precision. How to improve the recall without decrease the precision? In this paper, we investigate a word-class approach to extend the collocation list which is constructed from the manually sense-tagged corpus. But the word classes are obtained from a larger scale corpus which is not sense tagged. The experiment results have shown that the F-measure is improved to 71% compared to 54% of the baseline system where the word-class is not considered, although the precision decreases slightly. Further study discovers the relationship between the F-measure and the number of word-class trained from the various sizes of corpus.
引用
收藏
页码:267 / +
页数:3
相关论文
共 50 条
  • [1] Word sense disambiguation based on word sense clustering
    Anaya-Sanchez, Henry
    Pons-Porrata, Aurora
    Berlanga-Llavori, Rafael
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 472 - 481
  • [2] Unsupervised Word Sense Disambiguation based on Word Embedding and Collocation
    Han, Shangzhuang
    Shirai, Kiyoaki
    [J]. ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 1218 - 1225
  • [3] Collocation analysis for UMLS knowledge-based word sense disambiguation
    Antonio Jimeno-Yepes
    Bridget T Mclnnes
    Alan R Aronson
    [J]. BMC Bioinformatics, 12
  • [4] Collocation analysis for UMLS knowledge-based word sense disambiguation
    Jimeno-Yepes, Antonio
    McInnes, Bridget T.
    Aronson, Alan R.
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [5] A clustering-based Approach for Unsupervised Word Sense Disambiguation
    Martin-Wanton, Tamara
    Berlanga-Llavori, Rafael
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (49): : 49 - 56
  • [6] Cross-Lingual Word Sense Clustering for Sense Disambiguation
    Casteleiro, Joao
    da Silva, Joaquim Ferreira
    Lopes, Gabriel Pereira
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 : 747 - 758
  • [7] Tovel: Distributed Graph Clustering for Word Sense Disambiguation
    Guerrieri, Alessio
    Rahimian, Fatemeh
    Girdzijauskas, Sarunas
    Montresor, Alberto
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 623 - 630
  • [8] Correlation Based Word Sense Disambiguation
    Agarwal, Madhavi
    Bajpai, Jyoti
    [J]. 2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 382 - 386
  • [9] WordNet Based Word Sense Disambiguation
    Sieminski, Andrzej
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT II: THIRD INTERNATIONAL CONFERENCE, ICCCI 2011, 2011, 6923 : 405 - 414
  • [10] Graph Based Word Sense Disambiguation
    Koppula, Neeraja
    Rani, B. Padmaja
    Rao, Koppula Srinivas
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS, ICCII 2016, 2017, 507 : 665 - 670