Scaling laws for language encoding models in fMRI

被引:0
|
作者
Antonello, Richard J. [1 ]
Vaidya, Aditya R. [1 ]
Huth, Alexander G. [1 ,2 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[2] Univ Texas Austin, Dept Neurosci, Austin, TX 78712 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Representations from transformer-based unidirectional language models are known to be effective at predicting brain responses to natural language. However, most studies comparing language models to brains have used GPT-2 or similarly sized language models. Here we tested whether larger open-source models such as those from the OPT and LLaMA families are better at predicting brain responses recorded using fMRI. Mirroring scaling results from other contexts, we found that brain prediction performance scales logarithmically with model size from 125M to 30B parameter models, with similar to 15% increased encoding performance as measured by correlation with a held-out test set across 3 subjects. Similar logarithmic behavior was observed when scaling the size of the fMRI training set. We also characterized scaling for acoustic encoding models that use HuBERT, WavLM, and Whisper, and we found comparable improvements with model size. A noise ceiling analysis of these large, high-performance encoding models showed that performance is nearing the theoretical maximum for brain areas such as the precuneus and higher auditory cortex. These results suggest that increasing scale in both models and data will yield incredibly effective models of language processing in the brain, enabling better scientific understanding as well as applications such as decoding.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Powerful predictions of biodiversity from ecological models and scaling laws
    Locey, Kenneth J.
    Lennon, Jay T.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (35) : E5097 - E5097
  • [42] Establishment of structural similitude for elastic models and validation of scaling laws
    Ramu, Murugan
    Raja, V. Prabhu
    Thyla, P. R.
    KSCE JOURNAL OF CIVIL ENGINEERING, 2013, 17 (01) : 139 - 144
  • [43] Encoding and decoding in fMRI
    Naselaris, Thomas
    Kay, Kendrick N.
    Nishimoto, Shinji
    Gallant, Jack L.
    NEUROIMAGE, 2011, 56 (02) : 400 - 410
  • [44] The "Narratives" fMRI dataset for evaluating models of naturalistic language comprehension
    Nastase, Samuel A.
    Liu, Yun-Fei
    Hillman, Hanna
    Zadbood, Asieh
    Hasenfratz, Liat
    Keshavarzian, Neggin
    Chen, Janice
    Honey, Christopher J.
    Yeshurun, Yaara
    Regev, Mor
    Nguyen, Mai
    Chang, Claire H. C.
    Baldassano, Christopher
    Lositsky, Olga
    Simony, Erez
    Chow, Michael A.
    Leong, Yuan Chang
    Brooks, Paula P.
    Micciche, Emily
    Choe, Gina
    Goldstein, Ariel
    Vanderwal, Tamara
    Halchenko, Yaroslav O.
    Norman, Kenneth A.
    Hasson, Uri
    SCIENTIFIC DATA, 2021, 8 (01)
  • [45] The “Narratives” fMRI dataset for evaluating models of naturalistic language comprehension
    Samuel A. Nastase
    Yun-Fei Liu
    Hanna Hillman
    Asieh Zadbood
    Liat Hasenfratz
    Neggin Keshavarzian
    Janice Chen
    Christopher J. Honey
    Yaara Yeshurun
    Mor Regev
    Mai Nguyen
    Claire H. C. Chang
    Christopher Baldassano
    Olga Lositsky
    Erez Simony
    Michael A. Chow
    Yuan Chang Leong
    Paula P. Brooks
    Emily Micciche
    Gina Choe
    Ariel Goldstein
    Tamara Vanderwal
    Yaroslav O. Halchenko
    Kenneth A. Norman
    Uri Hasson
    Scientific Data, 8
  • [46] Scaling Multilingual Corpora and Language Models to 500 Languages
    Imani, Ayyoob
    Lin, Peiqin
    Kargaran, Amir Hossein
    Severini, Silvia
    Sabet, Masoud Jalili
    Kassner, Nora
    Ma, Chunlan
    Schmid, Helmut
    Martins, Andre F. T.
    Yvon, Francois
    Schuetze, Hinrich
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1082 - 1117
  • [47] Magnetic field and plasma scaling laws:: Their implications for coronal heating models
    Mandrini, CH
    Démoulin, P
    Klimchuk, JA
    ASTROPHYSICAL JOURNAL, 2000, 530 (02): : 999 - 1015
  • [48] HUMAN RESULTS FROM ANIMAL MODELS: SCALING LAWS FOR BLAST NEUROTRAUMA
    Panzer, Matthew B.
    Bass, Cameron R. Dale
    JOURNAL OF NEUROTRAUMA, 2012, 29 (10) : A151 - A151
  • [49] Comparison of Coarse Graining DEM Models Based on Exact Scaling Laws
    Zhang, Bin
    Huang, Yiming
    Zhao, Tingting
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2021, 127 (03): : 1133 - 1150
  • [50] Electrodynamic magnetic suspension-models, scaling laws, and experimental results
    Thompson, MT
    IEEE TRANSACTIONS ON EDUCATION, 2000, 43 (03) : 336 - 342