Parallel regressions for variable selection using GPU

被引:0
|
作者
Lauro Cássio Martins de Paula
Anderson S. Soares
Telma W. L. Soares
Arlindo R. G. Filho
Clarimar J. Coelho
Alexandre C. B. Delbem
Wellington S. Martins
机构
[1] Federal University of Goiás,
来源
Computing | 2017年 / 99卷
关键词
Multivariate calibration; Variable selection; GPU; SPA; 68W10;
D O I
暂无
中图分类号
学科分类号
摘要
This paper proposes a parallel regression formulation to reduce the computational time of variable selection algorithms. The proposed strategy can be used for several forward algorithms in order to select uncorrelated variables that contribute for a better predictive capability of the model. Our demonstration of the proposed method include the use of Successive Projections Algorithm (SPA), which is an iterative forward technique that minimizes multicollinearity. SPA is traditionally used for variable selection in the context of multivariate calibration. Nevertheless, due to the need of calculating an inverse matrix for each insertion of a new variable in the model calibration, the computational performance of the algorithm may become impractical as the matrix size increases. Based on such limitation, this paper proposes a new strategy called Parallel Regressions (PR). PR strategy was implemented in the SPA to avoid the matrix inverse calculation of original SPA in order to increase the computational performance of the algorithm. It uses a parallel computing platform called Compute Unified Device Architecture (CUDA) in order to exploit a Graphics Processing Unit, and was called SPA-PR-CUDA. For this purpose, we used a case study involving a large data set of spectral variables. The results obtained with SPA-PR-CUDA presented 37×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} times better performance compared to a traditional SPA implementation. Additionally, when compared to traditional algorithms we demonstrated that SPA-PR-CUDA may be a more viable choice for obtaining a model with a reduced prediction error value.
引用
收藏
页码:219 / 234
页数:15
相关论文
共 50 条
  • [31] Parallel Skyline Processing Using Space Pruning on GPU
    Li, Chuanwen
    Gu, Yu
    Qi, Jianzhong
    Yu, Ge
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 1074 - 1083
  • [32] A particle packing parallel geometric method using GPU
    Lucas G. O. Lopes
    Diogo T. Cintra
    William W. M. Lira
    Computational Particle Mechanics, 2021, 8 : 931 - 942
  • [33] Massively Parallel Ray Tracing Algorithm Using GPU
    Qin, Yutong
    Lin, Jianbiao
    Huang, Xiang
    2015 SCIENCE AND INFORMATION CONFERENCE (SAI), 2015, : 699 - 703
  • [34] Massively parallel palmprint identification system using GPU
    Syed Ali Tariq
    Shahzaib Iqbal
    Mubeen Ghafoor
    Imtiaz A. Taj
    Noman M. Jafri
    Saad Razzaq
    Tehseen Zia
    Cluster Computing, 2019, 22 : 7201 - 7216
  • [35] Massively parallel palmprint identification system using GPU
    Tariq, Syed Ali
    Iqbal, Shahzaib
    Ghafoor, Mubeen
    Taj, Imtiaz A.
    Jafri, Noman M.
    Razzaq, Saad
    Zia, Tehseen
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3): : S7201 - S7216
  • [36] Variable selection methods for multiple regressions influence the parsimony of risk prediction models for cardiac surgery
    Karim, Md Nazmul
    Epi, M. Clin
    Reid, Christopher M.
    Tran, Lavinia
    Cochrane, Andrew
    Billah, Baki
    JOURNAL OF THORACIC AND CARDIOVASCULAR SURGERY, 2017, 153 (05): : 1128 - +
  • [37] A particle packing parallel geometric method using GPU
    Lopes, Lucas G. O.
    Cintra, Diogo T.
    Lira, William W. M.
    COMPUTATIONAL PARTICLE MECHANICS, 2021, 8 (04) : 931 - 942
  • [38] Parallel Graph Coloring Algorithms on the GPU Using OpenCL
    Sengupta, Shilpi
    2014 INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2014, : 353 - 357
  • [39] Parallel Nonlinear Dimensionality Reduction Using GPU Acceleration
    Tegegne, Yezihalem
    Qu, Zhonglin
    Qian, Yu
    Quang Vinh Nguyen
    DATA MINING, AUSDM 2021, 2021, 1504 : 3 - 15
  • [40] Bayesian variable selection in distributed lag models: a focus on binary quantile and count data regressions
    Dempsey, Daniel
    Wyse, Jason
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2025,