Neural Network Acceptability Judgments

被引:287
|
作者
Warstadt, Alex [1 ]
Singh, Amanpreet [1 ,2 ]
Bowman, Samuel R. [1 ]
机构
[1] NYU, New York, NY 10003 USA
[2] Facebook AI Res, Menlo Pk, CA USA
基金
美国国家科学基金会;
关键词
ENGLISH; POVERTY; FAMILY;
D O I
10.1162/tacl_a_00290
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the ability of artificial neural networks to judge the grammatical acceptability of a sentence, with the goal of testing their linguistic competence. We introduce the Corpus of Linguistic Acceptability (CoLA), a set of 10,657 English sentences labeled as grammatical or ungrammatical from published linguistics literature. As baselines, we train several recurrent neural network models on acceptability classification, and find that our models outperform unsupervised models by Lau et al. (2016) on CoLA. Error-analysis on specific grammatical phenomena reveals that both Lau et al.'s models and ours learn systematic generalizations like subject-verb-object order. However, all models we test perform far below human level on a wide range of grammatical constructions.
引用
收藏
页码:625 / 641
页数:17
相关论文
共 50 条
  • [31] Raising the Bar on Acceptability Judgments Classification: An Experiment on ItaCoLA Using ELECTRA
    Guarasci, Raffaele
    Minutolo, Aniello
    Buonaiuto, Giuseppe
    De Pietro, Giuseppe
    Esposito, Massimo
    [J]. ELECTRONICS, 2024, 13 (13)
  • [32] Schemas and the frequency/acceptability mismatch: Corpus distribution predicts sentence judgments
    Flach, Susanne
    [J]. COGNITIVE LINGUISTICS, 2020, 31 (04) : 609 - 645
  • [33] Naive v. expert intuitions: An empirical study of acceptability judgments
    Dabrowska, Ewa
    [J]. LINGUISTIC REVIEW, 2010, 27 (01): : 1 - 23
  • [34] The source ambiguity problem: Distinguishing the effects of grammar and processing on acceptability judgments
    Hofmeister, Philip
    Jaeger, T. Florian
    Arnon, Inbal
    Sag, Ivan A.
    Snider, Neal
    [J]. LANGUAGE AND COGNITIVE PROCESSES, 2013, 28 (1-2): : 48 - 87
  • [35] Inquiry of a Task Parameter and a Sampling Parameter for Speeded Acceptability Judgments Experiments
    de Souza, Ricardo Augusto
    Fonseca de Oliveira, Candido Samuel
    Soares-Silva, Jesiel
    Araujo Penzin, Alberto Gallo
    Santos, Alexandre Alves
    [J]. REVISTA DE ESTUDOS DA LINGUAGEM, 2015, 23 (01) : 211 - 244
  • [36] Acceptability judgments in bilectal populations Competition, gradience and socio-syntax
    Papadopoulou, Elena
    Leivada, Evelina
    Pavlou, Natalia
    [J]. LINGUISTIC VARIATION, 2014, 14 (01) : 109 - 128
  • [37] FREQUENCY DURATION AND PERCEPTUAL MEASURES IN RELATION TO JUDGMENTS OF ALARYNGEAL SPEECH ACCEPTABILITY
    SHIPP, T
    [J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1967, 10 (03): : 417 - &
  • [38] A validation of Amazon Mechanical Turk for the collection of acceptability judgments in linguistic theory
    Sprouse, Jon
    [J]. BEHAVIOR RESEARCH METHODS, 2011, 43 (01) : 155 - 167
  • [39] A validation of Amazon Mechanical Turk for the collection of acceptability judgments in linguistic theory
    Jon Sprouse
    [J]. Behavior Research Methods, 2011, 43 : 155 - 167
  • [40] Monolingual and Cross-Lingual Acceptability Judgments with the Italian CoLA corpus
    Trotta, Daniela
    Guarasci, Raffaele
    Leonardelli, Elisa
    Tonelli, Sara
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2929 - 2940