Unsupervised translation disambiguation based on web indirect association of bilingual word

被引:1
|
作者
Liu P.-Y. [1 ]
Zhao T.-J. [2 ]
机构
[1] Institute of Computational Linguistics, Peking University
[2] College of Computer Science and Technology, Harbin Institute of Technology
来源
Ruan Jian Xue Bao/Journal of Software | 2010年 / 21卷 / 04期
关键词
Knowledge acquisition; Unsupervised translation disambiguation; Web indirect association; WSD;
D O I
10.3724/SP.J.1001.2010.03574
中图分类号
学科分类号
摘要
To solve the problems of data sparseness and knowledge acquisition in translation disambiguation and WSD (word sense disambiguation), this paper introduces a fully unsupervised method, which is based on Web mining and Web indirect association of bilingual words. It provides new knowledge of translation disambiguation. It assumes that word sense can be determined by indirect association of bilingual words. Based on Web, this paper revises four common methods of indirect association, and designs three decision methods. These methods are evaluated on a gold standard Multilingual Chinese English Lexical Sample Task dataset of SemEval- 2007. The experimental results show that the model gets the state-of-the-art results (Pmar=44.4%) and outperforms the best system in SemEval-2007. © by Institute of Software, the Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:575 / 585
页数:10
相关论文
共 32 条
  • [1] Edmonds P., Cotton S., Senseval-2: Overview, Proc. of the 2nd Int'l Workshop on Evaluating Word Sense Disambiguation Systems, pp. 1-5, (2001)
  • [2] Mihalcea R., Edmonds P., Senseval-3, Proc. of the 3rd Int'l Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Association for Computational Linguistics Conf. (ACL 2004), pp. 1-17, (2004)
  • [3] Jin P., Wu Y.F., Yu S.W., SemEval-2007 task 5: Multilingual Chinese-English lexical sample, Proc. of the 4th Int'l Workshop on Semantic Evaluations (SemEval-2007), pp. 19-23, (2007)
  • [4] Yuret D., KU: Word sense disambiguation by substitution, Proc. of the 4th Int'l Workshop on Semantic Evaluations (SemEval-2007), pp. 207-214, (2007)
  • [5] Veronis J., A study of polysemy judgements and Inter-annotator agreement, Proc. of the Programme and Advanced Papers of the Senseval Workshop, (1998)
  • [6] Ng H.T., Lim C.Y., Foo S.K., A case study on inter-annotator agreement for word sense disambiguation, Proc. of the Siglex-ACL Workshop on Standardizing Lexical Resources, pp. 9-13, (1999)
  • [7] Li H., Li C., Word translation disambiguation using bilingual bootstrapping, Computational Linguistics, 20, 4, pp. 563-596, (2004)
  • [8] Yarowsky D., Decision lists for lexical ambiguity resolution: Application to accent restoration in Spanish and French, Proc. of the 32nd Annual Meeting of the Association for Computational Linguistics, pp. 88-95, (1994)
  • [9] Niu Z.Y., Ji D.H., Tan C.L., Pakhomov S., Word sense disambiguation using label propagation based semi-supervised learning, Proc. of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL), pp. 395-402, (2005)
  • [10] Gale W.A., Church K.W., Yarowsky D., Using bilingual materials to develop word sense disambiguation methods, Proc. of the Int'l Conf. on Theoretical and Methodological Issues in Machine Translation, pp. 101-112, (1992)