Learning How to Generalize

被引:11
|
作者
Austerweil, Joseph L. [1 ]
Sanborn, Sophia [2 ]
Griffiths, Thomas L. [3 ]
机构
[1] Univ Wisconsin, Dept Psychol, 1202 W Johnson St, Madison, WI 53706 USA
[2] Univ Calif, Dept Psychol, Berkeley, CA USA
[3] Princeton Univ, Dept Psychol, Princeton, NJ 08544 USA
关键词
Generalization; Inductive inference; Bayesian modeling; Category learning; SIMILARITY; INDUCTION; MODELS; CATEGORIES; ATTENTION; EYETRACKING; INFORMATION;
D O I
10.1111/cogs.12777
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Generalization is a fundamental problem solved by every cognitive system in essentially every domain. Although it is known that how people generalize varies in complex ways depending on the context or domain, it is an open question how people learn the appropriate way to generalize for a new context. To understand this capability, we cast the problem of learning how to generalize as a problem of learning the appropriate hypothesis space for generalization. We propose a normative mathematical framework for learning how to generalize by learning inductive biases for which properties are relevant for generalization in a domain from the statistical structure of features and concepts observed in that domain. More formally, the framework predicts that an ideal learner should learn to generalize by either taking the weighted average of the results of generalizing according to each hypothesis space, with weights given by how well each hypothesis space fits the previously observed concepts, or by using the most likely hypothesis space. We compare the predictions of this framework to human generalization behavior with three experiments in one perceptual (rectangles) and two conceptual (animals and numbers) domains. Across all three studies we find support for the framework's predictions, including individual-level support for averaging in the third study.
引用
收藏
页数:32
相关论文
共 50 条
  • [41] Deep learning for blood glucose level prediction: How well do models generalize across different data sets?
    Ghimire, Sarala
    Celik, Turgay
    Gerdes, Martin
    Omlin, Christian W.
    PLOS ONE, 2024, 19 (09):
  • [42] HOW CAN WE SIMPLIFY AND GENERALIZE WIND LOADS - DISCUSSION
    HOLMES, JD
    JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 1995, 58 (1-2) : 139 - 141
  • [43] HOW FAR SHOULD WE GENERALIZE - THE CASE OF A WORKLOAD MODEL
    WICKER, AW
    AUGUST, RA
    PSYCHOLOGICAL SCIENCE, 1995, 6 (01) : 39 - 44
  • [44] EXPERIMENTALLY INDUCED LEARNED HELPLESSNESS - HOW FAR DOES IT GENERALIZE
    TUFFIN, K
    HESKETH, B
    PODD, J
    SOCIAL BEHAVIOR AND PERSONALITY, 1985, 13 (01): : 55 - 62
  • [45] Learning Modular Structures That Generalize Out-of-Distribution
    Ashok, Arjun
    Devaguptapu, Chaitanya
    Balasubramanian, Vineeth N.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12905 - 12906
  • [46] LEADS: Learning Dynamical Systems that Generalize Across Environments
    Yin, Yuan
    Ayed, Ibrahim
    de Bezenac, Emmanuel
    Baskiotis, Nicolas
    Gallinari, Patrick
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [47] How not to Renyi-generalize the quantum conditional mutual information
    Erker, Paul
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2015, 48 (27)
  • [48] From ten to four and back again: how to generalize the geometry
    Koerber, Paul
    Martucci, Luca
    JOURNAL OF HIGH ENERGY PHYSICS, 2007, (08):
  • [49] How do deep-learning models generalize across populations? Cross-ethnicity generalization of COPD detection
    D. Almeida, Silvia
    Norajitra, Tobias
    Lueth, Carsten T.
    Wald, Tassilo
    Weru, Vivienn
    Nolden, Marco
    Jaeger, Paul F.
    von Stackelberg, Oyunbileg
    Heussel, Claus Peter
    Weinheimer, Oliver
    Biederer, Juergen
    Kauczor, Hans-Ulrich
    Maier-Hein, Klaus
    INSIGHTS INTO IMAGING, 2024, 15 (01):
  • [50] HOW TO GENERALIZE ONE-RELATOR GROUP-THEORY
    HOWIE, J
    ANNALS OF MATHEMATICS STUDIES, 1987, (111): : 53 - 78