A Multi-domain Corpus of Swedish Word Sense Annotation

被引:0
|
作者
Johansson, Richard [1 ,2 ]
Adesam, Yvonne [2 ]
Bouma, Gerlof [2 ]
Hedberg, Karin [2 ]
机构
[1] Univ Gothenburg, Dept Comp Sci & Engn, Gothenburg, Sweden
[2] Univ Gothenburg, Sprakbanken, Gothenburg, Sweden
基金
瑞典研究理事会;
关键词
word sense; annotation; Swedish; AGREEMENT;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
We describe the word sense annotation layer in Eukalyptus, a freely available five-domain corpus of contemporary Swedish with several annotation layers. The annotation uses the SALDO lexicon to define the sense inventory, and allows word sense annotation of compound segments and multiword units. We give an overview of the new annotation tool developed for this project, and finally present an analysis of the inter-annotator agreement between two annotators.
引用
下载
收藏
页码:3019 / 3022
页数:4
相关论文
共 50 条
  • [21] WENETSPEECH: A 10000+HOURS MULTI-DOMAIN MANDARIN CORPUS FOR SPEECH RECOGNITION
    Zhang, Binbin
    Lv, Hang
    Guo, Pengcheng
    Shao, Qijie
    Yang, Chao
    Xie, Lei
    Xu, Xin
    Bu, Hui
    Chen, Xiaoyu
    Zeng, Chenchen
    Wu, Di
    Peng, Zhendong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6182 - 6186
  • [22] Adaptation of Multi-domain Corpus Learned Seeds and Polarity Lexicon for Sentiment Analysis
    Sanagar, Swati
    Gupta, Deepa
    2015 INTERNATIONAL CONFERENCE ON COMPUTING AND NETWORK COMMUNICATIONS (COCONET), 2015, : 50 - 58
  • [23] Knowledge Graph Extension for Word Sense Annotation
    Simov, Kiril
    Popov, Alexander
    Osenova, Petya
    INNOVATIVE APPROACHES AND SOLUTIONS IN ADVANCED INTELLIGENT SYSTEMS, 2016, 648 : 151 - 166
  • [24] Multi-domain clinical natural language processing with MedCAT: The Medical Concept Annotation Toolkit
    Kraljevic, Zeljko
    Searle, Thomas
    Shek, Anthony
    Roguski, Lukasz
    Noor, Kawsar
    Bean, Daniel
    Mascio, Aurelie
    Zhu, Leilei
    Folarin, Amos A.
    Roberts, Angus
    Bendayan, Rebecca
    Richardson, Mark P.
    Stewart, Robert
    Shah, Anoop D.
    Wong, Wai Keong
    Ibrahim, Zina
    Teo, James T.
    Dobson, Richard J. B.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 117
  • [25] Exploring Discriminative Word-Level Domain Contexts for Multi-Domain Neural Machine Translation
    Su, Jinsong
    Zeng, Jiali
    Xie, Jun
    Wen, Huating
    Yin, Yongjing
    Liu, Yang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1530 - 1545
  • [26] Word-Based Domain Feature-Sensitive Multi-domain Neural Machine Translation
    Huang Z.
    Man Z.
    Zhang Y.
    Xu J.
    Chen Y.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2023, 59 (01): : 1 - 10
  • [27] Verb Sense Annotation in News Texts in the CSTNews Corpus
    Sobrevilla Cabezudo, Marco Antonio
    Maziero, Erick Galani
    da Cruz Souza, Jackson Wilke
    Dias, Myrcio de Souza
    Figueira Cardoso, Paula Christina
    Balage Filho, Pedro Paulo
    Agostini, Veronica
    Asevedo Nobrega, Fernando Antonio
    de Barros, Claudia Dias
    Di Felippo, Ariani
    Salgueiro Pardo, Thiago Alexandre
    REVISTA DE ESTUDOS DA LINGUAGEM, 2015, 23 (03) : 797 - 832
  • [28] Adaptive word sense tagging on Chinese corpus
    Ker, Sue-Jin
    Chen, Jen-Nan
    PACLIC 18: Proceedings of the 18th Pacific Asia Conference on Language, Information and Computation, 2004, : 267 - 273
  • [29] AspectEmo: Multi-Domain Corpus of Consumer Reviews for Aspect-Based Sentiment Analysis
    Kocon, Jan
    Radom, Jarema
    Kaczmarz-Wawryk, Ewa
    Wabnic, Kamil
    Zajaczkowska, Ada
    Zasko-Zielinska, Monika
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 166 - 173
  • [30] rHDP: An Aspect Sharing-Enhanced Hierarchical Topic Model for Multi-Domain Corpus
    Zhang, Yitao
    Wan, Changxuan
    Xiao, Keli
    Wan, Qizhi
    Liu, Dexi
    Liu, Xiping
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (03)