Improvements to Dependency Parsing Using Automatic Simplification of Data

被引:0
|
作者
Jelinek, Tomas [1 ]
机构
[1] Charles Univ Prague, Prague, Czech Republic
关键词
dependency parsing; text simplification; syntax;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In dependency parsing, much effort is devoted to the development of new methods of language modeling and better feature settings. Less attention is paid to actual linguistic data and how appropriate they are for automatic parsing: linguistic data can be too complex for a given parser, morphological tags may not reflect well syntactic properties of words, a detailed, complex annotation scheme may be ill suited for automatic parsing. In this paper, I present a study of this problem on the following case: automatic dependency parsing using the data of the Prague Dependency Treebank with two dependency parsers: MSTParser and MaltParser. I will show that by means of small, reversible simplifications of the text and of the annotation, a considerable improvement of parsing accuracy can be achieved. In order to facilitate the task of language modeling performed by the parser, I reduce variability of lemmas and word forms in the text. I modify the system of morphological annotation to make it more suitable for parsing. Finally, the dependency annotation scheme is also partially modified. All such modifications are automatic and fully reversible: after the parsing is done, the original data and structures are automatically restored. With MaltParser, I achieve an 8.3% error rate reduction.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] TEXT SIMPLIFICATION USING DEPENDENCY PARSING FOR SPANISH
    Ballesteros, Miguel
    Bautista, Susana
    Gervas, Pablo
    [J]. KDIR 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2010, : 330 - 335
  • [2] Transforming Complex Sentences using Dependency Trees for Automatic Text Simplification in Basque
    Jesus Aranzabe, Maria
    Diaz de Ilarraza, Arantza
    Gonzalez-Dios, Itziar
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (50): : 61 - 68
  • [3] Wide-coverage deep statistical parsing using automatic dependency structure annotation
    Cahill, Aoife
    Burke, Michael
    O'Donovan, Ruth
    Riezler, Stefan
    van Genabith, Josef
    Way, Andy
    [J]. COMPUTATIONAL LINGUISTICS, 2008, 34 (01) : 81 - 124
  • [4] Using BiLSTM in Dependency Parsing for Vietnamese
    Luong Nguyen Thi
    Linh Ha My
    Huyen Nguyen Thi Minh
    Phuong Le-Hong
    [J]. COMPUTACION Y SISTEMAS, 2018, 22 (03): : 853 - 862
  • [5] The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish
    Eryigit, Gulsen
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1960 - 1965
  • [6] Korean Dependency Parsing Using Deep Biaffine Dependency Parser
    Cui, Danxin
    Bi, Yude
    [J]. 2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 451 - 455
  • [7] Dependency Parsing
    Nivre, Joakim
    [J]. LANGUAGE AND LINGUISTICS COMPASS, 2010, 4 (03):
  • [8] Dependency Parsing
    Carroll, John
    [J]. COMPUTATIONAL LINGUISTICS, 2010, 36 (01) : 154 - 156
  • [9] Graph Transformations in Data-Driven Dependency Parsing
    Nilsson, Jens
    Nivre, Joakim
    Hall, Johan
    [J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 257 - 264
  • [10] Using a Database of Multiword Expressions in Dependency Parsing
    Jelinek, Tomas
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2019), 2019, 11697 : 19 - 31