Reflexive pronouns in Spanish Universal Dependencies: from annotation to automatic morphosyntactic analysis

被引:0
|
作者
Degraeuwe, Jasper [1 ]
Goethals, Patrick [1 ]
机构
[1] Univ Ghent, Ghent, Belgium
来源
关键词
reflexive pronouns; se; Universal Dependencies; morphosyntactic tagging and parsing;
D O I
10.26342/2022-69-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this follow-up article of Degraeuwe and Goethals (2020), we present the annotation scheme used to reannotate the 7298 potentially reflexive pronouns included in the Universal Dependencies Spanish AnCora v2.6 treebank, which resulted in significant modifications for the "Case" feature (100% changed) and dependency relations (87% changed). Next, we evaluate the performance of spaCy v3.2.2 and Stanza v1.3.0 (both trained on AnCora v2.8, and thus based on our reannotations) on the AnCora v2.8 test set, which yielded weighted F1 scores up to 0.88 and 0.98 for the "Case" and "Reflex" features, respectively, and up to 0.71 for the dependency relations. Finally, the error analysis of the spaCy results underlines the (generalisation) potential of the model, but also reveals some of the remaining issues in the automatic morphosyntactic analysis of reflexive pronouns in Spanish, such as determining if expletive relations denote an impersonal, passive or inherently reflexive use.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 9 条
  • [1] Reflexive pronouns in Spanish Universal Dependencies
    Degraeuwe, Jasper
    Goethals, Patrick
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2020, (64): : 77 - 84
  • [2] Morphosyntactic variation in dialects of Spanish: an analysis from the diacronia
    Buenafuentes de la Mata, Cristina
    [J]. ZEITSCHRIFT FUR ROMANISCHE PHILOLOGIE, 2015, 131 (02): : 383 - 407
  • [3] Contrastive analysis of the infinitive in Spanish and French from a morphosyntactic perspective
    Quintero Ramirez, Sara
    [J]. LENGUA Y HABLA, 2012, 16 : 150 - 171
  • [5] Automatic analysis of global spinal alignment from simple annotation of vertebral bodies
    Doerr, Sophia A.
    De Silva, Tharindu
    Vijayan, Rohan
    Han, Runze
    Uneri, Ali
    Ketcha, Michael D.
    Zhang, Xiaoxuan
    Khanna, Nishanth
    Westbroek, Erick
    Jiang, Bowen
    Zygourakis, Corinna
    Aygun, Nafi
    Theodore, Nicholas
    Siewerdsen, Jeffrey H.
    [J]. JOURNAL OF MEDICAL IMAGING, 2020, 7 (03)
  • [7] Automatic Analysis of Emotions from the Voices/Speech in Spanish TV Debates
    deVelasco, Mikel
    Justo, Raquel
    Lopez Zorrilla, Asier
    Ines Torres, M.
    [J]. ACTA POLYTECHNICA HUNGARICA, 2022, 19 (05) : 149 - 171
  • [8] Linguistic contact mapuzugun/Spanish. Linguistic, historical and social aspects. Bibliographic review and analysis proposal from the morphosyntactic and typological dimensions
    Olate Vinet, Aldo
    [J]. ONOMAZEIN, 2017, (36): : 122 - 158
  • [9] Automatic detection of hypernasal speech of children with cleft lip and palate from spanish vowels and words using classical measures and nonlinear analysis
    Rafael Orozco-Arroyave, Juan
    Francisco Vargas-Bonilla, Jesus
    Camilo Vasquez-Correa, Juan
    German Castellanos-Dominguez, Cesar
    Noth, Elmar
    [J]. REVISTA FACULTAD DE INGENIERIA-UNIVERSIDAD DE ANTIOQUIA, 2016, (80): : 109 - 123