Nucleus Composition in Transition-based Dependency Parsing

被引:1
|
作者
Nivre, Joakim [1 ]
Basirat, Ali [2 ]
Duerich, Luise [1 ]
Moss, Adam [3 ]
机构
[1] Uppsala Univ, RISE Res Inst Sweden, Dept Linguist & Philol, Uppsala, Sweden
[2] Linkoping Univ, Dept Comp & Informat Sci, Linkoping, Sweden
[3] Uppsala Univ, Dept Linguist & Philol, Uppsala, Sweden
基金
瑞典研究理事会;
关键词
Syntactics;
D O I
10.1162/coli_a_00450
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dependency-based approaches to syntactic analysis assume that syntactic structure can be analyzed in terms of binary asymmetric dependency relations holding between elementary syntactic units. Computational models for dependency parsing almost universally assume that an elementary syntactic unit is a word, while the influential theory of Lucien Tesniere instead posits a more abstract notion of nucleus, which may be realized as one or more words. In this article, we investigate the effect of enriching computational parsing models with a concept of nucleus inspired by Tesniere. We begin by reviewing how the concept of nucleus can be defined in the framework of Universal Dependencies, which has become the de facto standard for training and evaluating supervised dependency parsers, and explaining how composition functions can be used to make neural transition-based dependency parsers aware of the nuclei thus defined. We then perform an extensive experimental study, using data from 20 languages to assess the impact of nucleus composition across languages with different typological characteristics, and utilizing a variety of analytical tools including ablation, linear mixed-effects models, diagnostic classifiers, and dimensionality reduction. The analysis reveals that nucleus composition gives small but consistent improvements in parsing accuracy for most languages, and that the improvement mainly concerns the analysis of main predicates, nominal dependents, clausal dependents, and coordination structures. Significant factors explaining the rate of improvement across languages include entropy in coordination structures and frequency of certain function words, in particular determiners. Analysis using dimensionality reduction and diagnostic classifiers suggests that nucleus composition increases the similarity of vectors representing nuclei of the same syntactic type.
引用
收藏
页码:849 / 886
页数:38
相关论文
共 50 条
  • [21] Minimalist Grammar Transition-Based Parsing
    Stanojevic, Milos
    LOGICAL ASPECTS OF COMPUTATIONAL LINGUISTICS: CELEBRATING 20 YEARS OF LACL (1996-2016), 2016, 10054 : 273 - 290
  • [22] Improving Transition-Based Dependency Parsing of Hindi and Urdu by Modeling Syntactically Relevant Phenomena
    Bhat, Riyaz Ahmad
    Bhat, Irshad Ahmad
    Sharma, Dipti Misra
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2017, 16 (03)
  • [23] Digital document analytics using logistic regressive and deep transition-based dependency parsing
    Rekha, D.
    Sangeetha, J.
    Ramaswamy, V.
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (02): : 2580 - 2596
  • [24] Digital document analytics using logistic regressive and deep transition-based dependency parsing
    D. Rekha
    J. Sangeetha
    V. Ramaswamy
    The Journal of Supercomputing, 2022, 78 : 2580 - 2596
  • [25] Improving multi-pass transition-based dependency parsing using enhanced shift actions
    Zhu, Chenxi
    Qiu, Xipeng
    Huang, Xuanjing
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8801 : 13 - 22
  • [26] Improving Multi-pass Transition-Based Dependency Parsing Using Enhanced Shift Actions
    Zhu, Chenxi
    Qiu, Xipeng
    Huang, Xuanjing
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 13 - 22
  • [27] Efficient Disfluency Detection with Transition-based Parsing
    Wu, Shuangzhi
    Zhang, Dongdong
    Zhou, Ming
    Zhao, Tiejun
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 495 - 503
  • [28] Transition-based Parsing with Stack-Transformers
    Astudillo, Ramon Fernandez
    Ballesteros, Miguel
    Naseem, Tahira
    Blodget, Austin
    Florian, Radu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1001 - 1007
  • [29] Transition-Based Korean Dependency Parsing Using Hybrid Word Representations of Syllables and Morphemes with LSTMs
    Na, Seung-Hoon
    Li, Jianri
    Shin, Jong-Hoon
    Kim, Kangil
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (02)
  • [30] Unlexicalized Transition-based Discontinuous Constituency Parsing
    Coavoux, Maximin
    Crabbe, Benoit
    Cohen, Shay B.
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2019, 7 : 73 - 89