SySCD: A System-Aware Parallel Coordinate Descent Algorithm

被引:0
|
作者
Ioannou, Nikolas [1 ]
Mendler-Dunner, Celestine [1 ,2 ]
Parnell, Thomas [1 ]
机构
[1] IBM Res, Zurich, Switzerland
[2] Univ Calif Berkeley, Berkeley, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a novel parallel stochastic coordinate descent (SCD) algorithm with convergence guarantees that exhibits strong scalability. We start by studying a state-of-the-art parallel implementation of SCD and identify scalability as well as system-level performance bottlenecks of the respective implementation. We then take a principled approach to develop a new SCD variant which is designed to avoid the identified system bottlenecks, such as limited scaling due to coherence traffic of model sharing across threads, and inefficient CPU cache accesses. Our proposed system-aware parallel coordinate descent algorithm (SySCD) scales to many cores and across numa nodes, and offers a consistent bottom line speedup in training time of up to x12 compared to an optimized asynchronous parallel SCD algorithm and up to x42, compared to state-of-the-art GLM solvers (scikit-learn, Vowpal Wabbit, and H2O) on a range of datasets and multi-core CPU architectures.
引用
收藏
页数:11
相关论文
共 50 条
  • [11] Parallel coordinate descent for the Adaboost problem
    Fercoq, Olivier
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 354 - 358
  • [12] PPD: A Scalable and Efficient Parallel Primal-Dual Coordinate Descent Algorithm
    Wu, Hejun
    Huang, Xinchuan
    Luo, Qiong
    Yang, Zhongheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (04) : 1958 - 1966
  • [13] Novel optimum reduced search MLSE decoding algorithm enabling system-aware power savings
    Wellig, A
    Zory, J
    Wehn, N
    RAWCON: 2004 IEEE RADIO AND WIRELESS CONFERENCE, PROCEEDINGS, 2004, : 27 - 30
  • [14] Designing an Open Database of System-aware Code Optimizations
    Egawa, Ryusuke
    Komatsu, Kazuhiko
    Takizawa, Hiroyuki
    2017 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2017, : 369 - 374
  • [15] An EasyGrid portal for scheduling system-aware applications on computational Grids
    Boeres, C.
    Fonseca, A. A.
    Mendes, H. A.
    Menezes, L. T.
    Moura, N. T.
    Silva, J. A.
    Vianna, B. A.
    Rebello, V. E. F.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2006, 18 (06): : 553 - 566
  • [16] Stochastic parallel gradient descent algorithm for adaptive optics system
    Ma H.
    Zhang P.
    Zhang J.
    Fan C.
    Wang Y.
    Qiangjiguang Yu Lizishu/High Power Laser and Particle Beams, 2010, 22 (06): : 1206 - 1210
  • [17] A FAST RECONSTRUCTION ALGORITHM BASED ON PARALLEL COORDINATE DESCENT OPTIMIZATION FOR IN VIVO TOMOGRAPHIC BIOLUMINESCENCE IMAGING
    Liu, Kai
    Tian, Jie
    Yang, Xin
    Qin, Chenghu
    Zhu, Shouping
    Han, Dong
    Wu, Ping
    2011 8TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, 2011, : 492 - 495
  • [18] Parallel Coordinate Descent Algorithms for Sparse Phase Retrieval
    Yang, Yang
    Pesavento, Marius
    Eldar, Yonina C.
    Ottersten, Bjoern
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7670 - 7674
  • [19] Parallel coordinate descent methods for big data optimization
    Peter Richtárik
    Martin Takáč
    Mathematical Programming, 2016, 156 : 433 - 484
  • [20] Parallel Asynchronous Stochastic Coordinate Descent with Auxiliary Variables
    Yu, Hsiang-Fu
    Hsieh, Cho-Jui
    Dhillon, Inderjit S.
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89