The annotation of the modal particles in the GeWiss corpus A syntactic and semantic-pragmatic analysis of the PTKMA annotation

被引:0
|
作者
Storo, Sven Robert [1 ]
机构
[1] Tech Nat Wissensch Univ Norwegens, NTNU, Inst Sprache & Literatur, N-7491 Trondheim, Norway
来源
DEUTSCHE SPRACHE | 2022年 / 50卷 / 02期
关键词
Annotation; POS-Tagging; Modalpartikeln; PTKMA; Gesprochene Wissenschaftssprache; GeWiss-Korpus; Prufungsgesprache;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
This article examines the automatic annotation of eight German modal particles in two sub-corpora of the GeWiss Corpus, which originate from the oral examinations of L1 and L2 examinees in a German academic context. Because of the poor reliability of automatic methods for POS tagging of spoken language data, the modal particles were checked manually for correctness using lists of criteria. In addition, the linguistic units ja, eben, halt, einfach, aber, mal, doch and denn which did not have a modal particle annotation, but had been automatically annotated with a different POS tag, were also checked for incorrect annotations, their modal particle properties were examined and the uses of these as modal particles were annotated. The results show that the POS tagging system has a very high error rate of 19,2% in the automatic annotations of the above-mentioned modal particles, and that it annotates the particles with widely varying reliability, ranging from 100% incorrect to 100% correct. Checking the non-PTKMA (modal and modulating particles) types ja, eben, halt, einfach, aber, mal, doch and denn for MP properties shows that several tokens exhibited this property.
引用
收藏
页码:124 / 149
页数:26
相关论文
共 50 条
  • [31] Annotation of a Corpus of Tweets for Sentiment Analysis
    dos Santos, Allisfrank
    Barros Junior, Jorge Daniel
    Camargo, Heloisa de Arruda
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 294 - 302
  • [32] Interrogative sentences in contradictory function - A semantic-pragmatic analysis
    Smirnova, M
    [J]. DEUTSCHE SPRACHE, 2001, 29 (01): : 46 - 62
  • [33] Computational Methods for Corpus Annotation and Analysis
    Lei, Lei
    [J]. INTERNATIONAL JOURNAL OF CORPUS LINGUISTICS, 2016, 21 (01) : 133 - 138
  • [34] Looking behind the scenes of syntactic dependency corpus annotation: Towards a motivated annotation schema of surface-syntax in Spanish
    Burga, Alicia
    Mille, Simon
    Wanner, Leo
    [J]. Frontiers in Artificial Intelligence and Applications, 2013, 258 : 26 - 46
  • [35] A Scalable Architecture for Cross-Modal Semantic Annotation and Retrieval
    Moeller, Manuel
    Sintek, Michael
    [J]. KI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5243 : 391 - 392
  • [36] Explicit Fine Grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic
    Hawwari, Abdelati
    Attia, Mohammed
    Ghoneim, Mahmoud
    Diab, Mona
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3569 - 3577
  • [37] TOWARD THE MORPHO-SYNTACTIC ANNOTATION OF AN OLD ENGLISH CORPUS WITH UNIVERSAL DEPENDENCIES
    Arista, Javier Martin
    [J]. REVISTA DE LINGUISTICA Y LENGUAS APLICADAS, 2022, 17 : 85 - 97
  • [38] When CORDIAL Becomes Friendly: Endowing the CORDIAL Corpus with a Syntactic Annotation Layer
    Magro, Catarina
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3705 - 3711
  • [39] Verbs of opinion among the parenthetical verbs and weak verbs direction: syntactic and semantic-pragmatic aspects
    Gonzalez Ruiz, Ramon
    [J]. CIRCULO DE LINGUISTICA APLICADA A LA COMUNICACION, 2015, (62): : 148 - 173
  • [40] UNSUPERVISED SPORTS VIDEO PARTICLES ANNOTATION BASED ON SOCIAL LATENT SEMANTIC ANALYSIS
    Ntalianis, Klimis
    Tsapatsoulis, Nicolas
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 222 - 226