Nucleus Composition in Transition-based Dependency Parsing

被引:1
|
作者
Nivre, Joakim [1 ]
Basirat, Ali [2 ]
Duerich, Luise [1 ]
Moss, Adam [3 ]
机构
[1] Uppsala Univ, RISE Res Inst Sweden, Dept Linguist & Philol, Uppsala, Sweden
[2] Linkoping Univ, Dept Comp & Informat Sci, Linkoping, Sweden
[3] Uppsala Univ, Dept Linguist & Philol, Uppsala, Sweden
基金
瑞典研究理事会;
关键词
Syntactics;
D O I
10.1162/coli_a_00450
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dependency-based approaches to syntactic analysis assume that syntactic structure can be analyzed in terms of binary asymmetric dependency relations holding between elementary syntactic units. Computational models for dependency parsing almost universally assume that an elementary syntactic unit is a word, while the influential theory of Lucien Tesniere instead posits a more abstract notion of nucleus, which may be realized as one or more words. In this article, we investigate the effect of enriching computational parsing models with a concept of nucleus inspired by Tesniere. We begin by reviewing how the concept of nucleus can be defined in the framework of Universal Dependencies, which has become the de facto standard for training and evaluating supervised dependency parsers, and explaining how composition functions can be used to make neural transition-based dependency parsers aware of the nuclei thus defined. We then perform an extensive experimental study, using data from 20 languages to assess the impact of nucleus composition across languages with different typological characteristics, and utilizing a variety of analytical tools including ablation, linear mixed-effects models, diagnostic classifiers, and dimensionality reduction. The analysis reveals that nucleus composition gives small but consistent improvements in parsing accuracy for most languages, and that the improvement mainly concerns the analysis of main predicates, nominal dependents, clausal dependents, and coordination structures. Significant factors explaining the rate of improvement across languages include entropy in coordination structures and frequency of certain function words, in particular determiners. Analysis using dimensionality reduction and diagnostic classifiers suggests that nucleus composition increases the similarity of vectors representing nuclei of the same syntactic type.
引用
收藏
页码:849 / 886
页数:38
相关论文
共 50 条
  • [41] Transition-based Dependency Parser with Postponed Determinations for Japanese Sentences
    Xi, Xiaobo
    Inokuchi, Akihiro
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 281 - 284
  • [42] Enhancements on a Transition-based Approach for AMR Parsing using LSTM Networks
    Pop, Roxana
    Drcgan, Anda
    Macicasan, Florin
    Lemnaru, Camelia
    Potolea, Rodica
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2018, : 55 - 62
  • [43] A Transition-Based Parser for 2-Planar Dependency Structures
    Gomez-Rodriguez, Carlos
    Nivre, Joakim
    ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 1492 - 1501
  • [44] Dependency Grammar Induction with a Neural Variational Transition-Based Parser
    Li, Bowen
    Cheng, Jianpeng
    Liu, Yang
    Keller, Frank
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6658 - 6665
  • [45] WesterParse: A Transition-based Dependency Parser for Tonal Species Counterpoint
    Snarrenberg, Robert
    CSEDU: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED EDUCATION - VOL 1, 2021, : 669 - 679
  • [46] Transition-Based Discourse Parsing with Multilayer Stack Long Short Term Memory
    Jia, Yanyan
    Feng, Yansong
    Luo, Bingfeng
    Ye, Yuan
    Liu, Tianyang
    Zhao, Dongyan
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 360 - 373
  • [47] Transition based neural network dependency parsing of Tibetan
    Duo, Jiecairang
    Hua, Quecairang
    Huan, Keyou
    Cai, Rangdangzhi
    2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336
  • [48] DeepCx: A transition-based approach for shallow semantic parsing with complex constructional triggers
    Dunietz, Jesse
    Carbonell, Jaime
    Levin, Lori
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1691 - 1701
  • [49] Structured predicton for transition-based constituent parsing: dense models and hollow models
    Coavoux, Maximin
    Crabbe, Benoit
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2016, 57 (01): : 59 - 83
  • [50] Kopsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
    Hershcovich, Daniel
    de Lhoneux, Miryam
    Kulmizev, Artur
    Pejhan, Elham
    Nivre, Joakim
    16TH INTERNATIONAL CONFERENCE ON PARSING TECHNOLOGIES AND IWPT 2020 SHARED TASK ON PARSING INTO ENHANCED UNIVERSAL DEPENDENCIES, 2020, : 236 - 244