Parallel regressions for variable selection using GPU

被引:0
|
作者
Lauro Cássio Martins de Paula
Anderson S. Soares
Telma W. L. Soares
Arlindo R. G. Filho
Clarimar J. Coelho
Alexandre C. B. Delbem
Wellington S. Martins
机构
[1] Federal University of Goiás,
来源
Computing | 2017年 / 99卷
关键词
Multivariate calibration; Variable selection; GPU; SPA; 68W10;
D O I
暂无
中图分类号
学科分类号
摘要
This paper proposes a parallel regression formulation to reduce the computational time of variable selection algorithms. The proposed strategy can be used for several forward algorithms in order to select uncorrelated variables that contribute for a better predictive capability of the model. Our demonstration of the proposed method include the use of Successive Projections Algorithm (SPA), which is an iterative forward technique that minimizes multicollinearity. SPA is traditionally used for variable selection in the context of multivariate calibration. Nevertheless, due to the need of calculating an inverse matrix for each insertion of a new variable in the model calibration, the computational performance of the algorithm may become impractical as the matrix size increases. Based on such limitation, this paper proposes a new strategy called Parallel Regressions (PR). PR strategy was implemented in the SPA to avoid the matrix inverse calculation of original SPA in order to increase the computational performance of the algorithm. It uses a parallel computing platform called Compute Unified Device Architecture (CUDA) in order to exploit a Graphics Processing Unit, and was called SPA-PR-CUDA. For this purpose, we used a case study involving a large data set of spectral variables. The results obtained with SPA-PR-CUDA presented 37×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} times better performance compared to a traditional SPA implementation. Additionally, when compared to traditional algorithms we demonstrated that SPA-PR-CUDA may be a more viable choice for obtaining a model with a reduced prediction error value.
引用
收藏
页码:219 / 234
页数:15
相关论文
共 50 条
  • [11] Input Variable Selection Using Parallel Processing of RBF Neural Networks
    Awad, Mohammed
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2010, 7 (01) : 6 - 13
  • [12] Parallel Document Inversion using GPU
    Jung, Sungbo
    Chang, Dar-Jen
    Park, Juw Won
    2016 RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS, 2016, : 236 - 242
  • [13] Parallel MLEM algorithm using GPU
    Valencia-Perez, T. A.
    2017 14TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE), 2017,
  • [14] The revisited knockoffs method for variable selection in L 1-penalized regressions
    Gégout-Petit, Anne
    Gueudin-Muller, Aurélie
    Karmann, Clémence
    Communications in Statistics: Simulation and Computation, 2020, : 1 - 14
  • [15] Variable selection and functional form uncertainty in cross-country growth regressions
    Salimans, Tim
    JOURNAL OF ECONOMETRICS, 2012, 171 (02) : 267 - 280
  • [16] The revisited knockoffs method for variable selection inL1-penalized regressions
    Gegout-Petit, Anne
    Gueudin-Muller, Aurelie
    Karmann, Clemence
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (10) : 5582 - 5595
  • [17] Improving the Performance of Risk Adjustment Systems Constrained Regressions, Reinsurance, and Variable Selection
    McGuire, Thomas G.
    Zink, Anna L.
    Rose, Sherri
    AMERICAN JOURNAL OF HEALTH ECONOMICS, 2021, 7 (04) : 497 - 521
  • [18] Parallel Variable Selection for Effective Performance Prediction
    Wang, Jonathan
    Yoo, Wucherl
    Sim, Alex
    Nugent, Peter
    Wu, Kesheng
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 208 - 217
  • [19] Parallel Botnet Detection System by Using GPU
    Hung, Che-Lun
    Wang, Hsiao-Hsi
    2014 IEEE/ACIS 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2014, : 65 - 70
  • [20] Efficient Model Points Selection in Insurance by Parallel Global Optimization Using Multi CPU and Multi GPU
    Ferreiro-Ferreiro, Ana Maria
    Garcia-Rodriguez, Jose Antonio
    Souto, Luis A.
    Vazquez, Carlos
    BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2020, 62 (01) : 5 - 20