Integrating Selectional Constraints and Subcategorization Frames in a Dependency Parser

被引:2
|
作者
Mirroshandel, Seyed Abolghasem [1 ]
Nasr, Alexis [2 ]
机构
[1] Univ Guilan, Fac Engn, Dept Comp Engn, Rasht, Iran
[2] Univ Aix Marseille, CNRS, Lab Informat Fondamentale, Marseille, France
关键词
D O I
10.1162/COLI_a_00242
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Statistical parsers are trained on treebanks that are composed of a few thousand sentences. In order to prevent data sparseness and computational complexity, such parsers make strong independence hypotheses on the decisions that are made to build a syntactic tree. These independence hypotheses yield a decomposition of the syntactic structures into small pieces, which in turn prevent the parser from adequately modeling many lexico-syntactic phenomena like selectional constraints and subcategorization frames. Additionally, treebanks are several orders of magnitude too small to observe many lexico-syntactic regularities, such as selectional constraints and subcategorization frames. In this article, we propose a solution to both problems: how to account for patterns that exceed the size of the pieces that are modeled in the parser and how to obtain subcategorization frames and selectional constraints from raw corpora and incorporate them in the parsing process. The method proposed was evaluated on French and on English. The experiments on French showed a decrease of 41.6% of selectional constraint violations and a decrease of 22% of erroneous subcategorization frame assignment. These figures are lower for English: 16.21% in the first case and 8.83% in the second.
引用
下载
收藏
页码:55 / 90
页数:36
相关论文
共 50 条
  • [1] DILUCT: An open-source Spanish dependency parser based on rules, heuristics, and selectional preferences
    Calvo, Hiram
    Gelbukh, Alexander
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2006, 3999 : 164 - 175
  • [2] Text Completion using a Context-Integrating Dependency Parser
    Salama, Amr Rekaby
    Alacam, Oezge
    Menzel, Wolfgang
    REPRESENTATION LEARNING FOR NLP, 2018, : 41 - 49
  • [3] ON THE SEMANTIC CONTENT OF SUBCATEGORIZATION FRAMES
    FISHER, C
    GLEITMAN, H
    GLEITMAN, LR
    COGNITIVE PSYCHOLOGY, 1991, 23 (03) : 331 - 392
  • [4] A Dependency Parser for Thai
    Tongchim, Shisanu
    Altmeyer, Randolf
    Sornlertlamvanich, Virach
    Isahara, Hitoshi
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 136 - 139
  • [5] Two approaches for incorporating linguistic constraints to improve the usability of Telugu dependency parser
    Rao, R. Rajeswara
    Kumari, B. Venkata Seshu
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2016, 3 (02) : 135 - 144
  • [6] Automatic Acquisition of Hungarian Subcategorization Frames
    Simon, Eszter
    Sereny, Andras
    Babarczy, Anna
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : E7 - E11
  • [7] Automatic extraction of subcategorization frames for Italian
    Ienco, Dino
    Villata, Serena
    Bosco, Cristina
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2094 - 2100
  • [8] Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser
    Rubino, Francesco
    Frontini, Francesca
    Quochi, Valeria
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2125 - 2131
  • [9] Dependency Parser for Sanskrit Verses
    Kulkarni, Amba
    Vikram, Sanal
    Sriram, K.
    PROCEEDINGS OF THE 6TH INTERNATIONAL SANSKRIT COMPUTATIONAL LINGUISTICS SYMPOSIUM (ISCLS 2019), 2019, : 15 - 28
  • [10] Improving Malt Dependency Parser using a Simple Grammar-Driven Unlexicalised Dependency Parser
    Eragani, Anil Krishna
    Kuchibhotla, Varun
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2014), 2014, : 211 - 214