The annotation of the modal particles in the GeWiss corpus A syntactic and semantic-pragmatic analysis of the PTKMA annotation

被引:0
|
作者
Storo, Sven Robert [1 ]
机构
[1] Tech Nat Wissensch Univ Norwegens, NTNU, Inst Sprache & Literatur, N-7491 Trondheim, Norway
来源
DEUTSCHE SPRACHE | 2022年 / 50卷 / 02期
关键词
Annotation; POS-Tagging; Modalpartikeln; PTKMA; Gesprochene Wissenschaftssprache; GeWiss-Korpus; Prufungsgesprache;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
This article examines the automatic annotation of eight German modal particles in two sub-corpora of the GeWiss Corpus, which originate from the oral examinations of L1 and L2 examinees in a German academic context. Because of the poor reliability of automatic methods for POS tagging of spoken language data, the modal particles were checked manually for correctness using lists of criteria. In addition, the linguistic units ja, eben, halt, einfach, aber, mal, doch and denn which did not have a modal particle annotation, but had been automatically annotated with a different POS tag, were also checked for incorrect annotations, their modal particle properties were examined and the uses of these as modal particles were annotated. The results show that the POS tagging system has a very high error rate of 19,2% in the automatic annotations of the above-mentioned modal particles, and that it annotates the particles with widely varying reliability, ranging from 100% incorrect to 100% correct. Checking the non-PTKMA (modal and modulating particles) types ja, eben, halt, einfach, aber, mal, doch and denn for MP properties shows that several tokens exhibited this property.
引用
收藏
页码:124 / 149
页数:26
相关论文
共 50 条
  • [41] A Pragmatic Approach to Semantic Annotation for Search of Legal Texts - An Experiment on GDPR
    Nazarenko, Adeline
    Levy, Francois
    Wyner, Adam
    [J]. LEGAL KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 346 : 23 - 32
  • [42] Syntactic annotation in the Reference Corpus for the Processing of Basque (EPEC): Theoretical and practical issues
    Aldezabal, Izaskun
    Aranzabe, Maria Jesus
    Arriola, Jose Mari
    de Ilarraza, Arantza Diaz
    [J]. CORPUS LINGUISTICS AND LINGUISTIC THEORY, 2009, 5 (02) : 241 - 269
  • [43] Boosting or inhibiting - how semantic-pragmatic and syntactic cues affect prosodic prominence relations in German
    Baumann, Stefan
    Lorenzen, Janne
    [J]. PLOS ONE, 2024, 19 (04):
  • [44] Annotation and Analysis of Extractive Summaries for the Kyutech Corpus
    Yamamura, Takashi
    Shimada, Kazutaka
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3216 - 3220
  • [45] The Uppsala Corpus of Student Writings Corpus Creation, Annotation, and Analysis
    Megyesi, Beata
    Nasman, Jesper
    Palmer, Anne
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3192 - 3199
  • [46] A Review on Corpus Annotation for Arabic Sentiment Analysis
    Almuqren, Latifah
    Alzammam, Arwa
    Alotaibi, Shahad
    Cristea, Alexandra
    Alhumoud, Sarah
    [J]. SOCIAL COMPUTING AND SOCIAL MEDIA: APPLICATIONS AND ANALYTICS, SCSM 2017, PT II, 2017, 10283 : 215 - 225
  • [47] Annotation and analysis of extractive summaries for the kyutech corpus
    Yamamura, Takashi
    Shimada, Kazutaka
    [J]. LREC 2018 - 11th International Conference on Language Resources and Evaluation, 2019, : 3216 - 3220
  • [48] Ontological Semantic Annotation of an English Corpus Through Condition Random Fields
    de Andrade, Guidson Coelho
    Oliveira, Alcione de Paiva
    Moreira, Alexandra
    [J]. INFORMATION, 2019, 10 (05)
  • [49] SEMANTIC ROLES OR SYNTACTIC FUNCTIONS: THE EFFECTS OF ANNOTATION SCHEME ON THE RESULTS OF DEPENDENCY MEASURES
    Yan, Jianwei
    Liu, Haitao
    [J]. STUDIA LINGUISTICA, 2022, 76 (02) : 406 - 428
  • [50] Automatic treatment of a corpus of Pyrenean journeys: syntactic, semantic and pragmatic Analysis under theory type
    Lefeuvre, Anais
    Moot, Richard
    Retore, Christian
    [J]. 3E CONGRES MONDIAL DE LINGUISTIQUE FRANCAISE, 2012, 1 : 2485 - 2495