Genomic data analysis using a two stage expectation propagation algorithm for analysis of sparse Bayesian high-dimensional instrumental variables regression

被引:0
|
作者
Amini, Morteza [1 ]
机构
[1] Univ Tehran, Coll Sci, Sch Math Stat & Comp Sci, Dept Stat, POB 14155-6455, Tehran, Iran
基金
美国国家科学基金会;
关键词
Causal inference; Expectation propagation; Spike-and-slab prior; Sparse instrumental variables model; GENE-EXPRESSION; SELECTION; INFERENCE; PRIORS; LASSO;
D O I
10.1080/03610918.2022.2075896
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Simultaneous analysis of gene expression data and genetic variants is highly of interest, especially when the number of gene expressions and genetic variants are both greater than the sample size. Association of both causal genes and effective SNPs makes the use of sparse modeling of such genetic data sets, highly important. The high-dimensional sparse instrumental variables models are one of such useful association models, which models the simultaneous relation of the gene expressions and genetic variants with complex traits. From a Bayesian viewpoint, the sparsity can be favored using sparsity-enforcing priors such as spike-and-slab priors. A two-stage modification of the expectation propagation (EP) algorithm is proposed and examined for approximate inference in high-dimensional sparse instrumental variables models with spike-and-slab priors. This method is an adoption of the classical two-stage least squares method, to be used with the Bayes context. A simulation study is performed to examine the performance of the methods. The proposed method is applied to analysis of the mouse obesity data.
引用
收藏
页码:2351 / 2365
页数:15
相关论文
共 50 条
  • [1] Sparse redundancy analysis of high-dimensional genetic and genomic data
    Csala, Attila
    Voorbraak, Frans P. J. M.
    Zwinderman, Aeilko H.
    Hof, Michel H.
    [J]. BIOINFORMATICS, 2017, 33 (20) : 3228 - 3234
  • [2] Integrative Sparse Bayesian Analysis of High-dimensional Multi-platform Genomic Data in Glioblastoma
    Bhadra, Anindya
    Baladandayuthapani, Veerabhadran
    [J]. 2013 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2013), 2013, : 1 - 4
  • [3] HYPOTHESIS TESTING IN HIGH-DIMENSIONAL INSTRUMENTAL VARIABLES REGRESSION WITH AN APPLICATION TO GENOMICS DATA
    Lu, Jiarui
    Li, Hongzhe
    [J]. STATISTICA SINICA, 2022, 32 : 613 - 633
  • [4] On sparse linear discriminant analysis algorithm for high-dimensional data classification
    Ng, Michael K.
    Liao, Li-Zhi
    Zhang, Leihong
    [J]. NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2011, 18 (02) : 223 - 235
  • [5] Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis
    Aijun Yang
    Xuejun Jiang
    Lianjie Shu
    Jinguan Lin
    [J]. Computational Statistics, 2017, 32 : 127 - 143
  • [6] Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis
    Yang, Aijun
    Jiang, Xuejun
    Shu, Lianjie
    Lin, Jinguan
    [J]. COMPUTATIONAL STATISTICS, 2017, 32 (01) : 127 - 143
  • [7] Bayesian high-dimensional regression for change point analysis
    Datta, Abhirup
    Zou, Hui
    Banerjee, Sudipto
    [J]. STATISTICS AND ITS INTERFACE, 2019, 12 (02) : 253 - 264
  • [8] NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA
    Sun, Hokeun
    Lin, Wei
    Feng, Rui
    Li, Hongzhe
    [J]. STATISTICA SINICA, 2014, 24 (03) : 1433 - 1459
  • [9] Bayesian sparse heritability analysis with high-dimensional neuroimaging phenotypes
    Zhao, Yize
    Li, Tengfei
    Zhu, Hongtu
    [J]. BIOSTATISTICS, 2022, 23 (02) : 467 - 484
  • [10] Sparse Dual of the Density Peaks Algorithm for Cluster Analysis of High-dimensional Data
    Floros, Dimitris
    Liu, Tiancheng
    Pitsianis, Nikos
    Sun, Xiaobai
    [J]. 2018 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2018,