Bias in data-driven replicability analysis of univariate brain-wide association studies

被引:0
|
作者
Burns, Charles D. G. [1 ]
Fracasso, Alessio [1 ]
Rousselet, Guillaume A. [1 ]
机构
[1] Univ Glasgow, Sch Psychol & Neurosci, Glasgow G12 8QB, Scotland
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
英国生物技术与生命科学研究理事会;
关键词
STATISTICAL POWER; FMRI;
D O I
10.1038/s41598-025-89257-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent studies have used big neuroimaging datasets to answer an important question: how many subjects are required for reproducible brain-wide association studies? These data-driven approaches could be considered a framework for testing the reproducibility of several neuroimaging models and measures. Here we test part of this framework, namely estimates of statistical errors of univariate brain-behaviour associations obtained from resampling large datasets with replacement. We demonstrate that reported estimates of statistical errors are largely a consequence of bias introduced by random effects when sampling with replacement close to the full sample size. We show that future meta-analyses can largely avoid these biases by only resampling up to 10% of the full sample size. We discuss implications that reproducing mass-univariate association studies requires tens-of-thousands of participants, urging researchers to adopt other methodological approaches.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Structured and Sparse Canonical Correlation Analysis as a Brain-Wide Multi-Modal Data Fusion Approach
    Mohammadi-Nejad, Ali-Reza
    Hossein-Zadeh, Gholam-Ali
    Soltanian-Zadeh, Hamid
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (07) : 1438 - 1448
  • [32] Novel subgroups of obesity and their association with outcomes: a data-driven cluster analysis
    Takeshita, Saki
    Nishioka, Yuichi
    Tamaki, Yuko
    Kamitani, Fumika
    Mohri, Takako
    Nakajima, Hiroki
    Kurematsu, Yukako
    Okada, Sadanori
    Myojin, Tomoya
    Noda, Tatsuya
    Imamura, Tomoaki
    Takahashi, Yutaka
    BMC PUBLIC HEALTH, 2024, 24 (01)
  • [33] Data-Driven Analysis of Stimulation Treatments Using Association Rule Mining
    Ahmadi R.
    Aminshahidy B.
    Shahrabi J.
    SPE Production and Operations, 2023, 38 (03): : 552 - 564
  • [34] Novel subgroups of obesity and their association with outcomes: a data-driven cluster analysis
    Saki Takeshita
    Yuichi Nishioka
    Yuko Tamaki
    Fumika Kamitani
    Takako Mohri
    Hiroki Nakajima
    Yukako Kurematsu
    Sadanori Okada
    Tomoya Myojin
    Tatsuya Noda
    Tomoaki Imamura
    Yutaka Takahashi
    BMC Public Health, 24
  • [35] A univariate perspective of multivariate genome-wide association analysis
    Guo, Xiaobo
    Zhu, Junxian
    Fan, Qiao
    He, Mingguang
    Wang, Xueqin
    Zhang, Heping
    GENETIC EPIDEMIOLOGY, 2018, 42 (05) : 470 - 479
  • [36] Data-driven resolvent analysis
    Herrmann, Benjamin
    Baddoo, Peter J.
    Semaan, Richard
    Brunton, Steven L.
    McKeon, Beverley J.
    JOURNAL OF FLUID MECHANICS, 2021, 918
  • [37] Data-driven analysis of speech
    Hermansky, H
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 10 - 18
  • [38] Improving mass-univariate analysis of neuroimaging data by modelling important unknown covariates: Application to Epigenome-Wide Association Studies
    Guillaume, Bryan
    Wang, Changqing
    Poh, Joann
    Shen, Mo Jun
    Ong, Mei Lyn
    Tan, Pei Fang
    Karnani, Neerja
    Meaney, Michael
    Qiu, Anqi
    NEUROIMAGE, 2018, 173 : 57 - 71
  • [39] The Choice-Wide Behavioral Association Study: Data-Driven Identification of Interpretable Behavioral Components
    Kastner, David
    Williams, Greer
    Holobetz, Cristopher
    Romano, Joseph
    Dayan, Peter
    NEUROPSYCHOPHARMACOLOGY, 2024, 49 : 108 - 108
  • [40] Data-Driven Nonparametric Existence and Association Problems
    Liu, Yixian
    Liang, Yingbin
    Cui, Shuguang
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (24) : 6377 - 6389