A Multi-domain Corpus of Swedish Word Sense Annotation

被引:0
|
作者
Johansson, Richard [1 ,2 ]
Adesam, Yvonne [2 ]
Bouma, Gerlof [2 ]
Hedberg, Karin [2 ]
机构
[1] Univ Gothenburg, Dept Comp Sci & Engn, Gothenburg, Sweden
[2] Univ Gothenburg, Sprakbanken, Gothenburg, Sweden
基金
瑞典研究理事会;
关键词
word sense; annotation; Swedish; AGREEMENT;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
We describe the word sense annotation layer in Eukalyptus, a freely available five-domain corpus of contemporary Swedish with several annotation layers. The annotation uses the SALDO lexicon to define the sense inventory, and allows word sense annotation of compound segments and multiword units. We give an overview of the new annotation tool developed for this project, and finally present an analysis of the inter-annotator agreement between two annotators.
引用
下载
收藏
页码:3019 / 3022
页数:4
相关论文
共 50 条
  • [1] Word Sense Annotation Based on Corpus
    Bai, Linnan
    Yang, Lijiao
    Liu, Zhiying
    2013 ASIAN CONFERENCE ON THE SOCIAL SCIENCES (ACSS 2013), PT 2, 2013, 4 : 137 - 142
  • [2] A Chinese corpus with word sense annotation
    Wu, Yunfang
    Jin, Peng
    Zhang, Yangsen
    Yu, Shimen
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 414 - +
  • [3] The Role Collocation in Corpus Word Sense Annotation
    Liu, Jing
    Yang, Li-jiao
    Liu, Zhi-ying
    INTERNATIONAL CONFERENCE ON HUMANITY AND SOCIAL SCIENCE (ICHSS 2014), 2014, : 100 - 104
  • [4] Mitigating Vocabulary Mismatch on Multi-domain Corpus using Word Embeddings and Thesaurus
    Yadav, Nagesh
    Dibari, Alessandro
    Wei, Miao
    Segrave-Daly, John
    Cullen, Conor
    Moga, Denisa
    Scalvini, Jillian
    Hennessy, Ciaran
    Kristiansen, Morten
    O'Sullivan, Omar
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2020, : 441 - 445
  • [5] Neural Paraphrase Generation with Multi-domain Corpus
    Qiao, Lin
    Li, Yida
    Zhong, ChenLi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 54 - 66
  • [6] The Word Sense Annotation Based on Corpus of Teaching Chinese as a Second Language
    Wang, Jing
    Liao, Mingfu
    Hu, Renfen
    Wang, Shuanghong
    CHINESE LEXICAL SEMANTICS (CLSW 2015), 2015, 9332 : 234 - 243
  • [7] Automatic Word Sense Disambiguation and Construction Identification Based on Corpus Multilevel Annotation
    Lyashevskaya, Olga
    Mitrofanova, Olga
    Grachkova, Maria
    Romanov, Sergey
    Shimorina, Anastasia
    Shurygina, Alexandra
    TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 80 - 90
  • [8] Annotation transfer for genomics: Measuring functional divergence in multi-domain proteins
    Hegyi, H
    Gerstein, M
    GENOME RESEARCH, 2001, 11 (10) : 1632 - 1640
  • [9] Domain Adaptation in Multilingual and Multi-Domain Monolingual Settings for Complex Word Identification
    Zaharia, George-Eduard
    Smadu, Razvan-Alexandru
    Cercel, Dumitru-Clementin
    Dascalu, Mihai
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 70 - 80
  • [10] A Neural Word Embeddings Approach for Multi-Domain Sentiment Analysis
    Dragoni, Mauro
    Petrucci, Giulio
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (04) : 457 - 470