Deep learning can contrast the minimal pairs of syntactic data

被引:1
|
作者
Park, Kwonsik [1 ]
Park, Myung-Kwan [2 ]
Song, Sanghoun [1 ]
机构
[1] Korea Univ, Dept Linguist, 145 Anam Ro, Seoul 02841, South Korea
[2] Dongguk Univ, Dept English, 30,1 Gil, Seoul 04620, South Korea
基金
新加坡国家研究基金会;
关键词
deep learning; BERT; syntactic judgment; minimal pair; contrast;
D O I
10.17250/khisli.38.2.202106.008
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
The present work aims to assess the feasibility of using deep learning as a useful tool to investigate syntactic phenomena. To this end, the present study concerns three research questions: (i) whether deep learning can detect syntactically inappropriate constructions, (ii) whether deep learning's acceptability judgments are accountable, and (iii) whether deep learning's aspects of acceptability judgments are similar to human judgments. As a proxy for a deep learning language model, this study chooses BERT. The current paper comprises syntactically contrasted pairs of English sentences which come from the three test suites already available. The first one is 196 grammatical -ungrammatical minimal pairs from DeKeyser (2000). The second one is examples in four published syntax textbooks excerpted from Warstadt et al. (2019). The last one is extracted from Sprouse et al. (2013), which collects the examples reported in a theoretical linguistics journal, Linguistic Inquiry. The BERT models, base BERT and large BERT, are assessed by judging acceptability of items in the test suites with an evaluation metric, surprisal, which is used to measure how `surprised' a model is when encountering a word in a sequence of words, i.e., a sentence. The results are analyzed in the two frameworks: directionality and repulsion. The results of directionality reveals that the two versions of BERT are overall competent at distinguishing ungrammatical sentences from grammatical ones. The statistical results of both repulsion and directionality also reveal that the two variants of BERT do not differ significantly. Regarding repulsion, correct judgments and incorrect ones are significantly different. Additionally, the repulsion of the first test suite, which is excerpted from the items for testing learners' grammaticality judgments, is higher than the other test suites, which are excerpted from the syntax textbooks and published literature. This study compares BERT's acceptability judgments with magnitude estimation results reported in Sprouse et al. (2013) in order to examine if deep learning's syntactic knowledge is akin to human knowledge. The error analyses on incorrectly judged items reveal that there are some syntactic constructions that the two BERTs have trouble learning, which indicates that BERT's acceptability judgments are distributed not randomly.
引用
收藏
页码:395 / 424
页数:30
相关论文
共 50 条
  • [41] Enhancing Question Pairs Identification with Ensemble Learning: Integrating Machine Learning and Deep Learning Models
    Tarek, Salsabil
    Noaman, Hatem M.
    Kayed, Mohammed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 981 - 992
  • [42] Deep Contrast Learning Approach for Address Semantic Matching
    Chen, Jian
    Chen, Jianpeng
    She, Xiangrong
    Mao, Jian
    Chen, Gang
    APPLIED SCIENCES-BASEL, 2021, 11 (16):
  • [43] A syntactic dependency method for aspect-level sentiment classification by deep learning
    Chen, Siyi
    Du, Xinhao
    Zhao, Ji
    Huang, Huixian
    Chen, Xiaolong
    MEASUREMENT & CONTROL, 2023, 56 (5-6): : 1057 - 1065
  • [44] Respect the surroundings: Effects of phonetic context variability on infants' learning of minimal pairs
    Hoehle, Barbara
    Fritzsche, Tom
    Boll-Avetisyan, Natalie
    Hullebus, Marc
    Gafos, Adamantios
    JASA EXPRESS LETTERS, 2021, 1 (02):
  • [45] Deep hash Based on Asymmetric Learning and Center Contrast
    Xu, Yongjian
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 686 - 690
  • [46] From Dose Reduction to Contrast Maximization Can Deep Learning Amplify the Impact of Contrast Media on Brain Magnetic Resonance Image Quality? A Reader Study
    Bone, Alexandre
    Ammari, Samy
    Menu, Yves
    Balleyguier, Corinne
    Moulton, Eric
    Chouzenoux, Emilie
    Volk, Andreas
    Garcia, Gabriel C. T. E.
    Nicolas, Francois
    Robert, Philippe
    Rohe, Marc-Michel
    Lassau, Nathalie
    INVESTIGATIVE RADIOLOGY, 2022, 57 (08) : 527 - 535
  • [47] Learning Data Transformations with Minimal User Effort
    Minh Pham
    Knoblock, Craig A.
    Pujara, Jay
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 657 - 664
  • [48] Can Deep Learning Improve Technical Analysis of Forex Data to Predict Future Price Movements?
    Fisichella, Marco
    Garolla, Filippo
    IEEE Access, 2021, 9 : 153083 - 153101
  • [49] STARTING DRIVING STYLE RECOGNITION OF ELECTRIC CITY BUS BASED ON DEEP LEARNING AND CAN DATA
    Zhao, Dengfeng
    Fu, Zhijun
    Liu, Chaohui
    Hou, Junjian
    Dong, Shesen
    Zhong, Yudong
    Transport, 2024, 39 (03) : 229 - 239
  • [50] Can Deep Learning Improve Technical Analysis of Forex Data to Predict Future Price Movements?
    Fisichella, Marco
    Garolla, Filippo
    IEEE ACCESS, 2021, 9 : 153083 - 153101