A Systematic Assessment of Syntactic Generalization in Neural Language Models

被引:0
|
作者
Hu, Jennifer [1 ]
Gauthier, Jon [1 ]
Qian, Peng [1 ]
Wilcox, Ethan [2 ]
Levy, Roger P. [1 ]
机构
[1] MIT, Dept Brain & Cognit Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Harvard Univ, Dept Linguist, Cambridge, MA 02138 USA
关键词
ACCEPTABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While state-of-the-art neural network models continue to achieve lower perplexity scores on language modeling benchmarks, it remains unknown whether optimizing for broad-coverage predictive performance leads to human-like syntactic knowledge. Furthermore, existing work has not provided a clear picture about the model properties required to produce proper syntactic generalizations. We present a systematic evaluation of the syntactic knowledge of neural language models, testing 20 combinations of model types and data sizes on a set of 34 English-language syntactic test suites. We find substantial differences in syntactic generalization performance by model architecture, with sequential models underperforming other architectures. Factorially manipulating model architecture and training dataset size (1M-40M words), we find that variability in syntactic generalization performance is substantially greater by architecture than by dataset size for the corpora tested in our experiments. Our results also reveal a dissociation between perplexity and syntactic generalization performance.
引用
收藏
页码:1725 / 1744
页数:20
相关论文
共 50 条
  • [1] Structural Supervision Improves Few-Shot Learning and Syntactic Generalization in Neural Language Models
    Wilcox, Ethan
    Qian, Peng
    Futrell, Richard
    Kohita, Ryosuke
    Levy, Roger
    Ballesteros, Miguel
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4640 - 4652
  • [2] Overestimation of Syntactic Representation in Neural Language Models
    Kodner, Jordan
    Gupta, Nitish
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1757 - 1762
  • [3] Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models
    Finlayson, Matthew
    Mueller, Aaron
    Gehrmann, Sebastian
    Shieber, Stuart
    Linzen, Tal
    Belinkov, Yonatan
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1828 - 1843
  • [4] Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State
    Futrell, Richard
    Wilcox, Ethan
    Morita, Takashi
    Qian, Peng
    Ballesteros, Miguel
    Levy, Roger
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 32 - 42
  • [5] Transductive Learning of Neural Language Models for Syntactic and Semantic Analysis
    Ouchi, Hiroki
    Suzuki, Jun
    Inui, Kentaro
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3665 - 3671
  • [6] Do Neural Language Models Show Preferences for Syntactic Formalisms?
    Kulmizev, Artur
    Ravishankar, Vinit
    Abdou, Mostafa
    Nivre, Joakim
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4077 - 4091
  • [7] Generalization of Syntactic Knowledge in Semiartificial Language Learning
    Tao, Yan
    Williams, John N.
    [J]. LANGUAGE LEARNING, 2018, 68 (04) : 1001 - 1031
  • [8] Mind the Gap: Assessing Temporal Generalization in Neural Language Models
    Lazaridou, Angeliki
    Kuncoro, Adhiguna
    Gribovskaya, Elena
    Agrawal, Devang
    Liska, Adam
    Terzi, Tayfun
    Gimenez, Mai
    d'Autume, Cyprien de Masson
    Kocisky, Tomas
    Ruder, Sebastian
    Yogatama, Dani
    Cao, Kris
    Young, Susannah
    Blunsom, Phil
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] A neural syntactic language model
    Emami, A
    Jelinek, F
    [J]. MACHINE LEARNING, 2005, 60 (1-3) : 195 - 227
  • [10] A Neural Syntactic Language Model
    Ahmad Emami
    Frederick Jelinek
    [J]. Machine Learning, 2005, 60 : 195 - 227