Parallel regressions for variable selection using GPU

被引:0
|
作者
Lauro Cássio Martins de Paula
Anderson S. Soares
Telma W. L. Soares
Arlindo R. G. Filho
Clarimar J. Coelho
Alexandre C. B. Delbem
Wellington S. Martins
机构
[1] Federal University of Goiás,
来源
Computing | 2017年 / 99卷
关键词
Multivariate calibration; Variable selection; GPU; SPA; 68W10;
D O I
暂无
中图分类号
学科分类号
摘要
This paper proposes a parallel regression formulation to reduce the computational time of variable selection algorithms. The proposed strategy can be used for several forward algorithms in order to select uncorrelated variables that contribute for a better predictive capability of the model. Our demonstration of the proposed method include the use of Successive Projections Algorithm (SPA), which is an iterative forward technique that minimizes multicollinearity. SPA is traditionally used for variable selection in the context of multivariate calibration. Nevertheless, due to the need of calculating an inverse matrix for each insertion of a new variable in the model calibration, the computational performance of the algorithm may become impractical as the matrix size increases. Based on such limitation, this paper proposes a new strategy called Parallel Regressions (PR). PR strategy was implemented in the SPA to avoid the matrix inverse calculation of original SPA in order to increase the computational performance of the algorithm. It uses a parallel computing platform called Compute Unified Device Architecture (CUDA) in order to exploit a Graphics Processing Unit, and was called SPA-PR-CUDA. For this purpose, we used a case study involving a large data set of spectral variables. The results obtained with SPA-PR-CUDA presented 37×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} times better performance compared to a traditional SPA implementation. Additionally, when compared to traditional algorithms we demonstrated that SPA-PR-CUDA may be a more viable choice for obtaining a model with a reduced prediction error value.
引用
收藏
页码:219 / 234
页数:15
相关论文
共 50 条
  • [41] Bayesian sparse seemingly unrelated regressions model with variable selection and covariance estimation via the horseshoe
    Han, Dongu
    Lim, Daeyoung
    Choi, Taeryon
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2023, 52 (03) : 676 - 714
  • [42] "Parallelized Variable Selection and Modeling based on Prediction" algorithm on GPU for Feature Selection and ADMET Model Generation
    Koneti, Geervani
    Ramamurthi, Narayanan
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 2268 - 2269
  • [43] Intercalibration of DMSP/OLS by Parallel Regressions
    Stathakis, Demetris
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (10) : 1420 - 1424
  • [44] Genomic selection using random regressions on known and latent environmental covariates
    Daniel J. Tolhurst
    R. Chris Gaynor
    Brian Gardunia
    John M. Hickey
    Gregor Gorjanc
    Theoretical and Applied Genetics, 2022, 135 : 3393 - 3415
  • [45] Genomic selection using random regressions on known and latent environmental covariates
    Tolhurst, Daniel J.
    Gaynor, R. Chris
    Gardunia, Brian
    Hickey, John M.
    Gorjanc, Gregor
    THEORETICAL AND APPLIED GENETICS, 2022, 135 (10) : 3393 - 3415
  • [46] RandGA: injecting randomness into parallel genetic algorithm for variable selection
    Zhang, Chun-Xia
    Wang, Guan-Wei
    Liu, Jun-Min
    JOURNAL OF APPLIED STATISTICS, 2015, 42 (03) : 630 - 647
  • [47] Illustrating Instrumental Variable Regressions Using the Career Adaptability - Job Satisfaction Relationship
    Bollmann, Gregoire
    Rouzinov, Serguei
    Berchtold, Andre
    Rossier, Jerome
    FRONTIERS IN PSYCHOLOGY, 2019, 10
  • [48] Instrumental variable and variable addition based inference in predictive regressions
    Breitung, Joerg
    Demetrescu, Matei
    JOURNAL OF ECONOMETRICS, 2015, 187 (01) : 358 - 375
  • [49] A Hybrid Approach for Optimizing Parallel Clustering Throughput using the GPU
    Gowanlock, Michael
    Rude, Cody M.
    Blair, David M.
    Li, Justin D.
    Pankratius, Victor
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (04) : 766 - 777
  • [50] On the Accelerated Convergence of Genetic Algorithm Using GPU Parallel Operations
    Li, Cheng-Chieh
    Liu, Jung-Chun
    Lin, Chu-Hsing
    Lo, Winston
    SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING 2015, 2016, 612 : 1 - 16