Scaling laws for language encoding models in fMRI

被引:0
|
作者
Antonello, Richard J. [1 ]
Vaidya, Aditya R. [1 ]
Huth, Alexander G. [1 ,2 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[2] Univ Texas Austin, Dept Neurosci, Austin, TX 78712 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Representations from transformer-based unidirectional language models are known to be effective at predicting brain responses to natural language. However, most studies comparing language models to brains have used GPT-2 or similarly sized language models. Here we tested whether larger open-source models such as those from the OPT and LLaMA families are better at predicting brain responses recorded using fMRI. Mirroring scaling results from other contexts, we found that brain prediction performance scales logarithmically with model size from 125M to 30B parameter models, with similar to 15% increased encoding performance as measured by correlation with a held-out test set across 3 subjects. Similar logarithmic behavior was observed when scaling the size of the fMRI training set. We also characterized scaling for acoustic encoding models that use HuBERT, WavLM, and Whisper, and we found comparable improvements with model size. A noise ceiling analysis of these large, high-performance encoding models showed that performance is nearing the theoretical maximum for brain areas such as the precuneus and higher auditory cortex. These results suggest that increasing scale in both models and data will yield incredibly effective models of language processing in the brain, enabling better scientific understanding as well as applications such as decoding.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] SCALING RECURRENT NEURAL NETWORK LANGUAGE MODELS
    Williams, Will
    Prasad, Niranjani
    Mrva, David
    Ash, Tom
    Robinson, Tony
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5391 - 5395
  • [32] Scaling Shrinkage-Based Language Models
    Chen, Stanley F.
    Mangu, Lidia
    Ramabhadran, Bhuvana
    Sarikaya, Ruhi
    Sethy, Abhinav
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 299 - 304
  • [33] SCALING LAWS FOR ISING MODELS NEAR T-c
    Kadanoff, Leo P.
    RESONANCE-JOURNAL OF SCIENCE EDUCATION, 2016, 21 (10): : 952 - 961
  • [34] FLOW MODELS + SCALING LAWS FOR FLOW THROUGH POROUS MEDIA
    GREENKORN, RA
    INDUSTRIAL AND ENGINEERING CHEMISTRY, 1964, 56 (03): : 32 - &
  • [35] Scaling laws for laboratory models of abnormally pressured gas reservoirs
    Weijun, Shen (wjshen763@gmail.com), 1600, E-Journal of Geotechnical Engineering (19):
  • [36] NUMERICAL-SIMULATION OF MODELS OF ORDERING - SCALING AND GROWTH LAWS
    GUNTON, JD
    GAWLINSKI, ET
    CHAKRABARTI, A
    KASKI, K
    JOURNAL OF APPLIED CRYSTALLOGRAPHY, 1988, 21 (06) : 811 - 817
  • [37] Scaling laws and similarity models for the preliminary design of multirotor drones
    Budinger, M.
    Reysset, A.
    Ochotorena, A.
    Delbecq, S.
    AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 98
  • [38] SCALING LAWS IN HIERARCHICAL-CLUSTERING MODELS WITH POISSON SUPERPOSITION
    HEGYI, S
    PHYSICS LETTERS B, 1994, 327 (1-2) : 171 - 178
  • [39] Scaling in the atmosphere: On global laws of persistence and tests of climate models
    Bunde, A
    Havlin, S
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2003, 11 : 205 - 216
  • [40] Establishment of structural similitude for elastic models and validation of scaling laws
    Murugan Ramu
    V. Prabhu Raja
    P. R. Thyla
    KSCE Journal of Civil Engineering, 2013, 17 : 139 - 144