Interannotator Agreement for Lexico-Semantic Annotation of a Corpus

被引:0
|
作者
Hajnicz, Elzbieta [1 ]
机构
[1] Polish Acad Sci, Inst Comp Sci, Ul Jana Kazimierza 5, PL-01248 Warsaw, Poland
关键词
corpus; lexico-semantic annotation; interannotator agreement; Polish;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper examines the procedure for lexico-semantic annotation of the Basic Corpus of Polish Metaphors that is the first step for annotating metaphoric expressions occurring in it. The procedure involves correcting the morphosyntactic annotation of part of the corpus that is automatically annotated on the morphosyntactic level. The main procedure concerns annotation of adjectives, adverbs, nouns and verbs (including gerunds and participles), including abbreviations of the words that belong to the above classes. It is composed of three steps: deciding whether a particular occurrence of a word is asemantic (e.g. anaphoric or strictly grammatical), whether we are dealing with a multi-word expression, reciprocal usages of the sie marker and pluralia tantum, which may involve annotation with two lexical units (having two different lemmas) for a single token. We propose an interannotator agreement statistics adequate for this procedure. Finally, we discuss the preliminary results of annotation of a fragment of the corpus.
引用
收藏
页码:1842 / 1848
页数:7
相关论文
共 50 条
  • [1] The Procedure of Lexico-Semantic Annotation of Skladnica Treebank
    Hajnicz, Elzbieta
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2290 - 2297
  • [2] The lexico-semantic annotation of PDT:: Some results, problems and solutions
    Bejcek, Eduard
    Mollerova, Petra
    Stranak, Pavel
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 21 - 28
  • [3] Lexico-Semantic Annotation of Skladnica Treebank by means of PLWN Lexical Units
    Hajnicz, Elzbieta
    [J]. PROCEEDINGS OF THE SEVENTH GLOBAL WORDNET CONFERENCE, GWC 2014, 2014, : 23 - 31
  • [4] Lexico-Semantic Features of Pakistani English Newspapers: A Corpus-Based Approach
    Jilani, Sartaj Fakhar
    Anwar, Behzad
    [J]. INTERNATIONAL JOURNAL OF ENGLISH LINGUISTICS, 2018, 8 (04) : 50 - 63
  • [5] On the Dynamics of Lexico-Semantic Processes in Russian
    Aleksandr Dmitrievič Duličenko
    [J]. Russian Linguistics, 2002, 26 (2) : 255 - 268
  • [6] The Theta System - A lexico-semantic approach?
    Rapp, I
    [J]. THEORETICAL LINGUISTICS, 2002, 28 (03) : 375 - 382
  • [7] A Lexico-Semantic Analysis of Military Language
    Okongor, Takim Ajom
    [J]. INTERNATIONAL JOURNAL OF HUMANITIES AND CULTURAL STUDIES, 2015, 2 (03): : 652 - 664
  • [8] Lexico-Semantic Transfers in Medical Terminology
    Staicu, Simona Nicoleta
    [J]. PROCEEDINGS OF THE EUROPEAN INTEGRATION: BETWEEN TRADITION AND MODERNITY, VOL 4, 2011, : 596 - 608
  • [9] LEXICO-SEMANTIC INTERFERENCE IN THE PROCESS OF TRANSLATION
    Dubichynskyi, V. V.
    [J]. YAZYK I KULTURA-LANGUAGE AND CULTURE, 2018, (43): : 317 - 321
  • [10] Lexico-semantic processing in Williams syndrome
    Garayzabal Heinze, Elena
    Cuetos Vega, Fernando
    [J]. PSICOTHEMA, 2010, 22 (04) : 732 - 738