The Role Collocation in Corpus Word Sense Annotation

被引:0
|
作者
Liu, Jing [1 ]
Yang, Li-jiao [1 ]
Liu, Zhi-ying [1 ]
机构
[1] Beijing Normal Univ, Inst Chinese Informat Proc, Beijing 100875, Peoples R China
关键词
Word sense annotation; Corpus; Collocation;
D O I
暂无
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Annotating words' senses in a corpus plays an important role in discriminating the senses of polysemants. When sense annotation based on dictionary there will be two obstacles: The first one is the ambiguous of dictionary sense distinctions, the second one is that senses in dictionary cannot match all usages of polysemants in corpus. Basing on the sentiment annotations of International Chinese Education Dynamic Corpus, this paper designs two kinds of collocation: lexical collocation and grammatical collocation and brings up that grammatical collocation can help finding the senses which aren't in dictionary, while lexical collocation can be used to solve annotation disagreements. The result of our experiment confirms the hypothesis and shows that those two collocations can be used to improve the accuracy of word sense annotation.
引用
下载
收藏
页码:100 / 104
页数:5
相关论文
共 50 条
  • [21] Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
    Zhang, Yujie
    Wang, Zhulong
    Uchimoto, Kiyotaka
    Ma, Qing
    Isahara, Hitoshi
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1025 - 1029
  • [22] Word-based Partial Annotation for Efficient Corpus Construction
    Neubig, Graham
    Mori, Shinsuke
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2723 - 2727
  • [23] Topic Modeling and Word Sense Disambiguation on the Ancora corpus
    Izquierdo, Ruben
    Postma, Marten
    Vossen, Piek
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2015, (55): : 15 - 22
  • [24] A Sense Annotated Corpus for All-Words Urdu Word Sense Disambiguation
    Saeed, Ali
    Nawab, Rao Muhammad Adeel
    Stevenson, Mark
    Rayson, Paul
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (04)
  • [25] Building Sense Tagged Corpus Using Wikipedia for Supervised Word Sense Disambiguation
    Saif, Abdulgabbar
    Omar, Nazlia
    Zainodin, Ummi Zakiah
    Ab Aziz, Mohd Juziaddin
    8TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, BICA 2017 (EIGHTH ANNUAL MEETING OF THE BICA SOCIETY), 2018, 123 : 403 - 412
  • [26] Collocation analysis for UMLS knowledge-based word sense disambiguation
    Antonio Jimeno-Yepes
    Bridget T Mclnnes
    Alan R Aronson
    BMC Bioinformatics, 12
  • [27] Collocation analysis for UMLS knowledge-based word sense disambiguation
    Jimeno-Yepes, Antonio
    McInnes, Bridget T.
    Aronson, Alan R.
    BMC BIOINFORMATICS, 2011, 12
  • [28] Building a Pediatric Medical Corpus: Word Segmentation and Named Entity Annotation
    Zan Hongying
    Li Wenxin
    Zhang Kunli
    Ye Yajuan
    Chang Baobao
    Sui Zhifang
    CHINESE LEXICAL SEMANTICS (CLSW 2020), 2021, 12278 : 652 - 664
  • [29] Word sense disambiguation using a second language monolingual corpus
    Dagan, Ido
    Itai, Alon
    Computational Linguistics, 1994, 20 (04)
  • [30] Corpus-based ontology learning for word sense disambiguation
    Kang, SJ
    PACLIC 17: Language, Information and Computation, Proceedings, 2003, : 399 - 407