Analyzing Machine-Learned Representations: A Natural Language Case Study

被引:3
|
作者
Dasgupta, Ishita [1 ,2 ]
Guo, Demi [3 ]
Gershman, Samuel J. [4 ,5 ]
Goodman, Noah D. [6 ,7 ]
机构
[1] Princeton Univ, Dept Psychol, 35 Olden St, Princeton, NJ 08540 USA
[2] Princeton Univ, Dept Comp Sci, 35 Olden St, Princeton, NJ 08540 USA
[3] Harvard Univ, Dept Comp Sci, Cambridge, MA 02138 USA
[4] Harvard Univ, Dept Psychol, Cambridge, MA 02138 USA
[5] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA
[6] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA
[7] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
Representation learning; Natural language inference; Compositionality; Heuristic; Strategies; Sentence embeddings; Generalization; Test datasets; BELIEF BIAS; ACQUISITION; INFORMATION; MODELS; LOGIC;
D O I
10.1111/cogs.12925
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
As modern deep networks become more complex, and get closer to human-like capabilities in certain domains, the question arises as to how the representations and decision rules they learn compare to the ones in humans. In this work, we study representations of sentences in one such artificial system for natural language processing. We first present a diagnostic test dataset to examine the degree of abstract composable structure represented. Analyzing performance on these diagnostic tests indicates a lack of systematicity in representations and decision rules, and reveals a set of heuristic strategies. We then investigate the effect of training distribution on learning these heuristic strategies, and we study changes in these representations with various augmentations to the training set. Our results reveal parallels to the analogous representations in people. We find that these systems can learn abstract rules and generalize them to new contexts under certain circumstances-similar to human zero-shot reasoning. However, we also note some shortcomings in this generalization behavior-similar to human judgment errors like belief bias. Studying these parallels suggests new ways to understand psychological phenomena in humans as well as informs best strategies for building artificial intelligence with human-like language understanding.
引用
收藏
页数:31
相关论文
共 50 条
  • [1] A theoretical case study of the generalization of machine-learned potentials
    Wang, Yangshuai
    Patel, Shashwat
    Ortner, Christoph
    [J]. Computer Methods in Applied Mechanics and Engineering, 2024, 422
  • [2] A theoretical case study of the generalization of machine-learned potentials
    Wang, Yangshuai
    Patel, Shashwat
    Ortner, Christoph
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2024, 422
  • [3] AMALEU: A Machine-Learned Universal Language Representation
    Costa-jussa, Marta R.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2020, (65): : 105 - 108
  • [4] The machine-learned radii of atoms
    Nikolaienko, Tymofii Yu.
    Bulavin, Leonid A.
    [J]. COMPUTATIONAL AND THEORETICAL CHEMISTRY, 2021, 1204
  • [5] Machine-Learned Premise Selection for Lean
    Piotrowski, Bartosz
    Mir, Ramon Fernandez
    Ayers, Edward
    [J]. AUTOMATED REASONING WITH ANALYTIC TABLEAUX AND RELATED METHODS, TABLEAUX 2023, 2023, 14278 : 175 - 186
  • [6] Understanding machine-learned density functionals
    Li, Li
    Snyder, John C.
    Pelaschier, Isabelle M.
    Huang, Jessica
    Niranjan, Uma-Naresh
    Duncan, Paul
    Rupp, Matthias
    Mueller, Klaus-Robert
    Burke, Kieron
    [J]. INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2016, 116 (11) : 819 - 833
  • [7] Machine-learned electron densities of nucleic acids
    Lee, Alex J.
    Rackers, Joshua A.
    Bricker, William P.
    [J]. BIOPHYSICAL JOURNAL, 2024, 123 (03) : 499A - 499A
  • [8] Toward Requirements Specification for Machine-Learned Components
    Rahimi, Mona
    Guo, Jin L. C.
    Kokaly, Sahar
    Chechik, Marsha
    [J]. 2019 IEEE 27TH INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS (REW 2019), 2019, : 241 - 244
  • [9] Machine-learned prediction of the electronic fields in a crystal
    Teh, Ying Shi
    Ghosh, Swarnava
    Bhattacharya, Kaushik
    [J]. MECHANICS OF MATERIALS, 2021, 163
  • [10] Machine-learned potentials for eucryptite: A systematic comparison
    Hill, Jorg-Rudiger
    Mannstadt, Wolfgang
    [J]. JOURNAL OF MATERIALS RESEARCH, 2023, 38 (24) : 5188 - 5197