Nucleus Composition in Transition-based Dependency Parsing

被引:1
|
作者
Nivre, Joakim [1 ]
Basirat, Ali [2 ]
Duerich, Luise [1 ]
Moss, Adam [3 ]
机构
[1] Uppsala Univ, RISE Res Inst Sweden, Dept Linguist & Philol, Uppsala, Sweden
[2] Linkoping Univ, Dept Comp & Informat Sci, Linkoping, Sweden
[3] Uppsala Univ, Dept Linguist & Philol, Uppsala, Sweden
基金
瑞典研究理事会;
关键词
Syntactics;
D O I
10.1162/coli_a_00450
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dependency-based approaches to syntactic analysis assume that syntactic structure can be analyzed in terms of binary asymmetric dependency relations holding between elementary syntactic units. Computational models for dependency parsing almost universally assume that an elementary syntactic unit is a word, while the influential theory of Lucien Tesniere instead posits a more abstract notion of nucleus, which may be realized as one or more words. In this article, we investigate the effect of enriching computational parsing models with a concept of nucleus inspired by Tesniere. We begin by reviewing how the concept of nucleus can be defined in the framework of Universal Dependencies, which has become the de facto standard for training and evaluating supervised dependency parsers, and explaining how composition functions can be used to make neural transition-based dependency parsers aware of the nuclei thus defined. We then perform an extensive experimental study, using data from 20 languages to assess the impact of nucleus composition across languages with different typological characteristics, and utilizing a variety of analytical tools including ablation, linear mixed-effects models, diagnostic classifiers, and dimensionality reduction. The analysis reveals that nucleus composition gives small but consistent improvements in parsing accuracy for most languages, and that the improvement mainly concerns the analysis of main predicates, nominal dependents, clausal dependents, and coordination structures. Significant factors explaining the rate of improvement across languages include entropy in coordination structures and frequency of certain function words, in particular determiners. Analysis using dimensionality reduction and diagnostic classifiers suggests that nucleus composition increases the similarity of vectors representing nuclei of the same syntactic type.
引用
收藏
页码:849 / 886
页数:38
相关论文
共 50 条
  • [1] Bidirectional Transition-Based Dependency Parsing
    Yuan, Yunzhe
    Jiang, Yong
    Tu, Kewei
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7434 - 7441
  • [2] Transition-Based Dependency Parsing Exploiting Supertags
    Ouchi, Hiroki
    Duh, Kevin
    Shindo, Hiroyuki
    Matsumoto, Yuji
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 2059 - 2068
  • [3] Transition-based dependency parsing with topological fields
    de Kok, Daniel
    Hinrichs, Erhard
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 1 - 7
  • [4] Transition-Based Parsing for Deep Dependency Structures
    Zhang, Xun
    Du, Yantao
    Sun, Weiwei
    Wan, Xiaojun
    COMPUTATIONAL LINGUISTICS, 2016, 42 (03) : 353 - 389
  • [5] Undirected Dependency Structures for Transition-Based Parsing
    Gomez-Rodriguez, Carlos
    Fernandez-Gonzalez, Daniel
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (48): : 43 - 50
  • [6] Vietnamese Transition-based Dependency Parsing with Supertag Features
    Nguyen, Kiet V.
    Ngan Luu-Thuy Nguyen
    2016 EIGHTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2016, : 175 - 180
  • [7] Transition-Based Dependency Parsing with Long Distance Collocations
    Zhu, Chenxi
    Qiu, Xipeng
    Huang, Xuanjing
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2015, 2015, 9362 : 12 - 24
  • [8] Greedy Transition-Based Dependency Parsing with Stack LSTMs
    Ballesteros, Miguel
    Dyer, Chris
    Goldberg, Yoav
    Smith, Noah A.
    COMPUTATIONAL LINGUISTICS, 2017, 43 (02) : 311 - 347
  • [9] Exploring Automatic Feature Selection for Transition-Based Dependency Parsing
    Ballesteros, Miguel
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (51): : 119 - 126
  • [10] Global Transition-based Non-projective Dependency Parsing
    Gomez-Rodriguez, Carlos
    Shi, Tianze
    Lee, Lillian
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2664 - 2675