Syntactic annotation of spoken French: ORFEO's choices

被引:0
|
作者
Kahane, Sylvain [1 ,2 ]
Gerdes, Kim [3 ,4 ]
机构
[1] Univ Paris Nanterre, Nanterre, France
[2] Modyco CNRS UMR 7114, Nanterre, France
[3] Univ Sorbonne Nouvelle, Paris, France
[4] LPP CNRS UMR 7018, Paris, France
关键词
dependency treebank; annotated corpus; spoken language; list; macrosyntax;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This article presents the syntactic annotation choices for the ORFEO project. A corpus of Spoken French of more than 180 000 words was manually annotated in dependency syntax, then a 3 M word corpus was automatically parsed. The annotation choices are compared with those of the RHAPSODIE project, which preceded ORFEO, with UNIVERSAL DEPENDENCIES (UD), which started shortly after ORFEO, and with SURFACE-SYNTACTIC UD (SUD), which synthesizes ORFEO and UD'S choices. oRFEo is characterized by a consideration of macrosyntax and list phenomena, as well as a restricted tag set that allowed a quick and more easily reproducible annotation.
引用
收藏
页码:69 / +
页数:19
相关论文
共 50 条
  • [1] The syntactic and prosodic annotation of the Spoken French corpus Rhapsodie
    Lacheret-Dujour, Anne
    Kahane, Sylvain
    Pietrandrea, Paola
    Avanzi, Mathieu
    Victorri, Bernard
    [J]. LANGUE FRANCAISE, 2011, (170): : 61 - +
  • [2] Syntactic annotation for the Spoken Dutch Corpus Project (CGN)
    Hoekstra, H
    Moortgat, M
    Schuurman, I
    van der Wouden, T
    [J]. COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS 2000, 2001, (37): : 73 - 87
  • [3] A prosodic and syntactic treebank for spoken French
    Krimou, Fanny
    [J]. TRAITEMENT AUTOMATIQUE DES LANGUES, 2019, 60 (03): : 139 - 141
  • [5] Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
    Lacheret, Anne
    Kahane, Sylvain
    Beliao, Julie
    Dister, Anne
    Gerdes, Kim
    Goldman, Jean-Philippe
    Obin, Nicolas
    Pietrandrea, Paola
    Tchobanov, Atanas
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [6] Prosodic annotation in ORFEO
    Martin, Philippe
    [J]. LANGAGES, 2020, (219) : 103 - +
  • [7] Automatic Detection and Annotation of Disfluencies in Spoken French Corpora
    Christodoulides, George
    Avanzi, Mathieu
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1849 - 1853
  • [8] Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie
    Bawden, Rachel
    Botalla, Marie-Amelie
    Gerdes, Kim
    Kahane, Sylvain
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2320 - 2325
  • [9] SOCIO-LINGUISTIC STUDY OF SYNTACTIC SEGMENTATION IN SPOKEN FRENCH - FRENCH - ROBACH,IB
    GERMAIN, C
    [J]. REVUE BELGE DE PHILOLOGIE ET D HISTOIRE, 1979, 57 (03): : 692 - 696
  • [10] Major prosodic and syntactic units in spoken French: inclusion, fragmentation, overlapping
    Lacheret-Dujour, Anne
    Kahane, Sylvain
    [J]. 7E CONGRES MONDIAL DE LINGUISTIQUE FRANCAISE, 2020, 78