Deep learning can contrast the minimal pairs of syntactic data

被引:1
|
作者
Park, Kwonsik [1 ]
Park, Myung-Kwan [2 ]
Song, Sanghoun [1 ]
机构
[1] Korea Univ, Dept Linguist, 145 Anam Ro, Seoul 02841, South Korea
[2] Dongguk Univ, Dept English, 30,1 Gil, Seoul 04620, South Korea
基金
新加坡国家研究基金会;
关键词
deep learning; BERT; syntactic judgment; minimal pair; contrast;
D O I
10.17250/khisli.38.2.202106.008
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
The present work aims to assess the feasibility of using deep learning as a useful tool to investigate syntactic phenomena. To this end, the present study concerns three research questions: (i) whether deep learning can detect syntactically inappropriate constructions, (ii) whether deep learning's acceptability judgments are accountable, and (iii) whether deep learning's aspects of acceptability judgments are similar to human judgments. As a proxy for a deep learning language model, this study chooses BERT. The current paper comprises syntactically contrasted pairs of English sentences which come from the three test suites already available. The first one is 196 grammatical -ungrammatical minimal pairs from DeKeyser (2000). The second one is examples in four published syntax textbooks excerpted from Warstadt et al. (2019). The last one is extracted from Sprouse et al. (2013), which collects the examples reported in a theoretical linguistics journal, Linguistic Inquiry. The BERT models, base BERT and large BERT, are assessed by judging acceptability of items in the test suites with an evaluation metric, surprisal, which is used to measure how `surprised' a model is when encountering a word in a sequence of words, i.e., a sentence. The results are analyzed in the two frameworks: directionality and repulsion. The results of directionality reveals that the two versions of BERT are overall competent at distinguishing ungrammatical sentences from grammatical ones. The statistical results of both repulsion and directionality also reveal that the two variants of BERT do not differ significantly. Regarding repulsion, correct judgments and incorrect ones are significantly different. Additionally, the repulsion of the first test suite, which is excerpted from the items for testing learners' grammaticality judgments, is higher than the other test suites, which are excerpted from the syntax textbooks and published literature. This study compares BERT's acceptability judgments with magnitude estimation results reported in Sprouse et al. (2013) in order to examine if deep learning's syntactic knowledge is akin to human knowledge. The error analyses on incorrectly judged items reveal that there are some syntactic constructions that the two BERTs have trouble learning, which indicates that BERT's acceptability judgments are distributed not randomly.
引用
收藏
页码:395 / 424
页数:30
相关论文
共 50 条
  • [31] An Enhanced Deep Learning Model for Duplicate Question Pairs Recognition
    Abishek, K.
    Hariharan, Basuthkar Rajaram
    Valliyammai, C.
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 769 - 777
  • [32] A Deep Reinforcement Learning Floorplanning Algorithm Based on Sequence Pairs
    Yu, Shenglu
    Du, Shimin
    Yang, Chang
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [33] Analysis of Evaluation Metrics with the Distance between Positive Pairs and Negative Pairs in Deep Metric Learning
    Oi, Hajime
    Kawakami, Rei
    Naemura, Takeshi
    PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [34] A deep learning approach for virtual contrast enhancement in Contrast Enhanced Spectral Mammography
    Rofena, Aurora
    Guarrasi, Valerio
    Sarli, Marina
    Piccolo, Claudia Lucia
    Sammarra, Matteo
    Zobel, Bruno Beomonte
    Soda, Paolo
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 116
  • [35] Big Data and Deep Learning
    Wilamowski, B. M.
    Wu, Bo
    Korniak, Janusz
    INES 2016 20TH JUBILEE IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS, 2016, : 11 - 16
  • [36] Deep Learning for Big Data
    Correia, Filipe
    Madureira, Ana
    Bernardino, Jorge
    INNOVATIONS IN BIO-INSPIRED COMPUTING AND APPLICATIONS, IBICA 2021, 2022, 419 : 205 - 215
  • [37] Synthetic Data for Deep Learning
    Horvath, Blanka
    QUANTITATIVE FINANCE, 2022, 22 (03) : 423 - 425
  • [38] Deep Learning on Private Data
    Riazi M.S.
    Darvish Rouani B.
    Koushanfar F.
    IEEE Security and Privacy, 2019, 17 (06): : 54 - 63
  • [39] Deep learning for geospatial data
    Wegner, Jan Dirk
    PROCEEDINGS OF THE 34TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2021), 2021, : 4045 - 4079
  • [40] Deep Learning on Private Data
    Fasano, Andrew
    Leek, Tim
    Dolan-Gavitt, Brendan
    Bundt, Josh
    IEEE SECURITY & PRIVACY, 2019, 17 (06) : 84 - 88