R2GUESS: A Graphics Processing Unit-Based R Package for Bayesian Variable Selection Regression of Multivariate Responses

被引:0
|
作者
Liquet, Benoit [1 ,5 ,6 ]
Bottolo, Leonardo [2 ]
Campanella, Gianluca [2 ]
Richardson, Sylvia [3 ]
Chadeau-Hyam, Marc [4 ]
机构
[1] Univ Pau & Pays Adour, CNRS, UMR 5142, LMAP, Pau, France
[2] Univ London Imperial Coll Sci Technol & Med, London W2 1PG, England
[3] MRC, Biostat Unit, Cambridge CB2 2BW, England
[4] Univ London Imperial Coll Sci Technol & Med, St Marys Hosp, Dept Epidemiol & Biostat, Norfolk Pl, London W2 1PG, England
[5] Univ Pau & Pays Adour, CNRS, UMR 5142, Lab Math & Applicat, Pau, France
[6] Queensland Univ Technol, ARC Ctr Excellence Math & Stat Frontiers, Brisbane, Qld 4001, Australia
来源
JOURNAL OF STATISTICAL SOFTWARE | 2016年 / 69卷 / 02期
基金
英国工程与自然科学研究理事会;
关键词
Bayesian variable selection; OMICs data; C plus; graphics processing unit; multivariate regression; R; STOCHASTIC SEARCH; STATISTICAL-METHODS;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Technological advances in molecular biology over the past decade have given rise to high dimensional and complex datasets offering the possibility to investigate biological associations between a range of genomic features and complex phenotypes. The analysis of this novel type of data generated unprecedented computational challenges which ultimately led to the definition and implementation of computationally efficient statistical models that were able to scale to genome-wide data, including Bayesian variable selection approaches. While extensive methodological work has been carried out in this area, only few methods capable of handling hundreds of thousands of predictors were implemented and distributed. Among these we recently proposed GUESS, a computationally optimised algorithm making use of graphics processing unit capabilities, which can accommodate multiple outcomes. In this paper we propose R2GUESS, an R package wrapping the original C++ source code. In addition to providing a user-friendly interface of the original code automating its parametrisation, and data handling, R2GUESS also incorporates many features to explore the data, to extend statistical inferences from the native algorithm (e.g., effect size estimation, significance assessment), and to visualize outputs from the algorithm. We first detail the model and its parametrisation, and describe in details its optimised implementation. Based on two examples we finally illustrate its statistical performances and flexibility.
引用
收藏
页数:32
相关论文
共 14 条
  • [1] BayesSUR: An R Package for High-Dimensional Multivariate Bayesian Variable and Covariance Selection in Linear Regression
    Zhao, Zhi
    Banterle, Marco
    Bottolo, Leonardo
    Richardson, Sylvia
    Lewin, Alex
    Zucknick, Manuela
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2021, 100 (11): : 1 - 32
  • [2] BNSP: an R Package for Fitting Bayesian Semiparametric Regression Models and Variable Selection
    Papageorgiou, Georgios
    [J]. R JOURNAL, 2018, 10 (02): : 526 - 548
  • [3] FWDselect: An R Package for Variable Selection in Regression Models
    Sestelo, Marta
    Villanueva, Nora M.
    Meira-Machado, Luis
    Roca-Pardinas, Javier
    [J]. R JOURNAL, 2016, 8 (01): : 132 - 148
  • [4] Bayesian Variable Selection Regression of Multivariate Responses for Group Data
    Liquet, B.
    Mengersen, K.
    Pettitt, A. N.
    Sutton, M.
    [J]. BAYESIAN ANALYSIS, 2017, 12 (04): : 1039 - 1067
  • [5] Correction to: MicroBVS: Dirichlet-tree multinomial regression models with Bayesian variable selection - an R package
    Matthew D. Koslovsky
    Marina Vannucci
    [J]. BMC Bioinformatics, 21
  • [6] MicroBVS: Dirichlet-tree multinomial regression models with Bayesian variable selection-an R package
    Koslovsky, Matthew D.
    Vannucci, Marina
    [J]. BMC BIOINFORMATICS, 2020, 21 (01)
  • [7] GALGO: an R package for multivariate variable selection using genetic algorithms
    Trevino, V
    Falciani, F
    [J]. BIOINFORMATICS, 2006, 22 (09) : 1154 - 1156
  • [8] High-Dimensional Bayesian Clustering with Variable Selection: The R Package bclus
    Nia, Vahid Partovi
    Davison, Anthony C.
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2012, 47 (05): : 1 - 22
  • [9] Multitrait Bayesian shrinkage and variable selection models with the BGLR-R package
    Perez-Rodriguez, Paulino
    de los Campos, Gustavo
    [J]. GENETICS, 2022, 222 (01)
  • [10] clustvarsel: A Package Implementing Variable Selection for Gaussian Model-Based Clustering in R
    Scrucca, Luca
    Raftery, Adrian E.
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2018, 84 (01): : 1 - 28