Sense Prediction Study: Two corpus-driven linguistic Approaches

被引:0
|
作者
Hong, Jia-Fei [1 ]
Ker, Sue-Jin [2 ]
Ahrens, Kathleen [1 ,5 ]
Huang, Chu-Ren [3 ,4 ]
机构
[1] Natl Taiwan Univ, Grad Inst Linguist, Taipei, Taiwan
[2] Soochow Univ, Dept Comp Sci & Informat Management, Taipei, Taiwan
[3] Acad Sinica, Inst Linguist, Taipei, Taiwan
[4] Hong Kong Polytech Univ, Fac Humanities, Hong Kong, Hong Kong, Peoples R China
[5] Hong Kong Baptist Univ, Language Ctr, Hong Kong, Hong Kong, Peoples R China
关键词
Lexical ambiguity; Sense prediction; Corpus-based approach; Character similarity clustering approach; Concept similarity clustering approach; Evaluation; DISAMBIGUATION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this study, we propose to use two corpus-driven linguistic approaches for a sense prediction study. We will concentrate on the character similarity clustering approach and concept similarity clustering approach to predict the senses of non-assigned words by using corpora and tools, such as Chinese Gigaword Corpus, and HowNet.. In this study, we would then like to evaluate their predictions via the sense divisions of Chinese Wordnet (CWN) and Xiandai Hanyu Cidian (Xian Han). Using these corpora, we will determine their clusters of our four target words - chi I "eat", wan2 "play", huan4 "change" and shao I "burn" in order to predict their all possible senses and evaluate them. This requirement will demonstrate the visibility of the corpus-based approaches.
引用
收藏
页码:236 / 243
页数:8
相关论文
共 9 条
  • [1] [Anonymous], LANGUAGE LINGUISTICS
  • [2] Chen Hao, 2005, COMPUTATIONAL LINGUI, V10, P473
  • [3] Measuring Semantic Similarity between Words Using HowNet
    Dai, Liuling
    Liu, Bin
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 601 - +
  • [4] Jin P, 2007, LECT NOTES COMPUT SC, V4394, P267
  • [5] Li Wanyin, 2005, COMPUT LINGUIST, V10, P123
  • [6] Liu Qun, 2002, P 3 C CHIN LEX TAIP
  • [7] Word sense disambiguation of WordNet glosses
    Moldovan, D
    Novischi, A
    [J]. COMPUTER SPEECH AND LANGUAGE, 2004, 18 (03): : 301 - 317
  • [8] Xue Nianwen, 2006, P COLING ACL 2006 MA, P921
  • [9] [No title captured]