Building a UD treebank using existing resources from related languages: the case of Galician

被引:0
|
作者
Garcia, Marcos [1 ]
Gomez-Rodriguez, Carlos [2 ]
Alonso, Miguel A. [2 ]
机构
[1] Univ A Coruna, Grp LyS, Dept Galego Portugues Frances & Linguist, La Coruna, Spain
[2] Univ A Coruna, Dept Comp, Grp LyS, La Coruna, Spain
来源
关键词
parsing; treebank; universal dependencies; Galician;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper presents a novel strategy for creating a Universal Dependencies (UD) treebank of a low-resource language. The method consists of adapting and combining different UD treebanks from related varieties in order to train a parser for the target language. More precisely, the paper explores the influence of three different levels for the selection and adaptation of the source treebanks: (i) the relatedness of the linguistic varieties, (ii) the adaptation of features based on lexical and spelling data, and (iii) the agreement in annotation criteria between different treebanks. The proposed strategy allowed us to train a parser for analyzing, with promising results, a small Galician corpus without previous availability of labeled data for this language. After a few bootstrapping iterations, we obtained a UD gold-standard corpus, used for proving the effectiveness of the proposed method.
引用
收藏
页码:33 / 40
页数:8
相关论文
共 50 条
  • [1] A methodology of estimation to accumulated resources and dismantling materials from the existing building stock
    Weng, Chia-Liang
    Yashiro, Tomanori
    [J]. FOURTH INTERNATIONAL SYMPOSIUM ON ENVIRONMENTALLY CONSCIOUS DESIGN AND INVERSE MANUFACTURING, PROCEEDINGS, 2005, : 840 - 841
  • [2] Learning Languages from Bounded Resources: The Case of the DFA and the Balls of Strings
    de la Higuera, Colin
    Janodet, Jean-Christophe
    Tantini, Frederic
    [J]. GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, PROCEEDINGS, 2008, 5278 : 43 - 56
  • [3] Retrofit of an Existing School Building: A Case Study from Hyderabad, India
    Srivastav, Vertika
    Puchalapalli, Swati
    Manu, Sanyogita
    [J]. SMART AND HEALTHY WITHIN THE TWO-DEGREE LIMIT (PLEA 2018), VOL 3, 2018, : 1012 - 1014
  • [4] New Algorithm for Building Ontology from Existing Rules: A Case Study
    Kharbat, Faten
    Ghalayini, Haya
    [J]. 2009 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT AND ENGINEERING, PROCEEDINGS, 2009, : 12 - +
  • [5] From local building practices to vulnerability reduction: building resilience through existing resources, knowledge and know-how
    Moles, Olivier
    Caimi, Annalisa
    Islam, Mohammad Shariful
    Hossain, Tahsin Reza
    Podder, Ratan Kumar
    [J]. 4TH INTERNATIONAL CONFERENCE ON BUILDING RESILIENCE, INCORPORATING THE 3RD ANNUAL CONFERENCE OF THE ANDROID DISASTER RESILIENCE NETWORK, 2014, 18 : 932 - 939
  • [6] Are Existing LCIA Methods Related to Mineral and Metal Resources Relevant for an AESA Approach Applied to the Building Sector? Case Study on the Construction of New Buildings in France
    Bendahmane, Nada
    Gondran, Natacha
    Chevalier, Jacques
    [J]. SUSTAINABILITY, 2024, 16 (03)
  • [7] From existing conventional building towards LEED certified green building: case study in Bangladesh
    Iqbal, Ashik
    Jahan, Ismat
    Al Wasiew, Qudrati
    Emu, Imtiaz Ahmed
    Chowdhury, Dipta
    [J]. FRONTIERS IN BUILT ENVIRONMENT, 2023, 9
  • [8] Indoor Localisation using Existing WiFi Infrastructure - A Case Study at a University Building
    Toh, Cornelius
    Lau, Sian Lun
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON VIRTUAL SYSTEMS AND MULTIMEDIA (VSMM), 2016, : 404 - 408
  • [9] Building taxonomies using organizational resources: A case of business consulting environment
    Chaudhry, AS
    Ling, GH
    [J]. KNOWLEDGE ORGANIZATION, 2005, 32 (01): : 25 - 46
  • [10] International Existing Building Code Implementation and Associated Challenges: Case Studies Related to the Repair of Damaged Structures
    Smith, Ross J.
    Lewis, Matthew E.
    [J]. Structures Congress 2015, 2015, : 690 - 701