Building Corpora for the Development of a Dependency Parser for Spanish Using Maltparser

被引:0
|
作者
Herrera, Jesus [1 ]
Gervas, Pablo [2 ]
Moriano, Pedro J. [2 ]
Munoz, Alfonso [2 ]
Romero, Luis [2 ]
机构
[1] Univ Nacl Educ Distancia, Dept Lenguajes & Sistemas Informat, C Juan del Rosal 16, E-28040 Madrid, Spain
[2] Univ Complutense Madrid, Dept Ingn Software & Inteligencia Artificial, E-28040 Madrid, Spain
来源
关键词
Dependency parsing; training corpus; syntactic function label; Maltparser; JBeaver;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The present paper details the process followed for creating training and test corpora for a dependency parser generator (Maltparser). The starting point is the Cast3LB corpus, which contains constituency analyses of Spanish texts. These constituency analyses are automatically transformed into dependency analyses. In addition, the empirically and semiautomatically obtention of a set of syntactic function labels for the training corpus is described. As a result of the process followed, it has been obtained a dependency parser for Spanish showing a 91% precision when determining dependencies.
引用
收藏
页码:181 / 186
页数:6
相关论文
共 50 条
  • [21] Building ASR corpora using Eyra
    Gudnason, Jon
    Petursson, Matthias
    Kjaran, Robert
    Klupfel, Simon
    Nikulasdottir, Anna Bjork
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2173 - 2177
  • [22] Towards a Dependency Parser for Greek Using a Small Training Data Set
    Herrera, Jesus
    Gervas, Pablo
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 29 - 36
  • [23] Research Report: Building a File Observatory for Secure Parser Development
    Allison, Tim
    Burke, Wayne
    Mattmann, Chris
    Mensikova, Anastasija
    Southam, Philip
    Stonebraker, Ryan
    2021 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2021), 2021, : 121 - 127
  • [24] A Feasibility Study on Low Level Techniques for Improving Parsing Accuracy for Spanish Using Maltparser
    Ballesterosl, Miguel
    Herrera, Jesus
    Francisco, Virginia
    Gervas, Pablo
    ARTIFICIAL INTELLIGENCE: THEORIES, MODELS AND APPLICATIONS, PROCEEDINGS, 2010, 6040 : 39 - +
  • [25] Learning relations from biomedical corpora using dependency trees
    Katrenko, Sophia
    Adriaans, Pieter
    KNOWLEDGE DISCOVERY AND EMERGENT COMPLEXITY IN BIOINFORMATICS, 2007, 4366 : 61 - +
  • [26] Unsupervised Word Sense Disambiguation Using Markov Random Field and Dependency Parser
    Chaplot, Devendra Singh
    Bhattacharyya, Pushpak
    Paranjape, Ashwin
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2217 - 2223
  • [27] Automatic Extraction of Hypernym & Meronym Relations in English Sentences Using Dependency Parser
    Sheena, N.
    Jasmine, Smitha M.
    Joseph, Shelbi
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS, 2016, 93 : 539 - 546
  • [28] It Depends: Dependency Parser Comparison Using A Web-based Evaluation Tool
    Choi, Jinho D.
    Tetreault, Joel
    Stent, Amanda
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 387 - 396
  • [29] Research Report: Building a Wide Reach Corpus for Secure Parser Development
    Allison, Tim
    Burke, Wayne
    Constantinou, Valentino
    Goh, Edwin
    Mattmann, Chris
    Mensikova, Anastasija
    Southam, Philip
    Stonebraker, Ryan
    Timmaraju, Virisha
    2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 318 - 326
  • [30] Research Report: Progress on Building a File Observatory for Secure Parser Development
    Allison, Tim
    Burke, Wayne
    Graf, Dustin
    Mattmann, Chris
    Mensikova, Anastasija
    Milano, Mike
    Southam, Philip
    Stonebraker, Ryan
    2022 43RD IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2022), 2022, : 168 - 175