SySCD: A System-Aware Parallel Coordinate Descent Algorithm

被引:0
|
作者
Ioannou, Nikolas [1 ]
Mendler-Dunner, Celestine [1 ,2 ]
Parnell, Thomas [1 ]
机构
[1] IBM Res, Zurich, Switzerland
[2] Univ Calif Berkeley, Berkeley, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a novel parallel stochastic coordinate descent (SCD) algorithm with convergence guarantees that exhibits strong scalability. We start by studying a state-of-the-art parallel implementation of SCD and identify scalability as well as system-level performance bottlenecks of the respective implementation. We then take a principled approach to develop a new SCD variant which is designed to avoid the identified system bottlenecks, such as limited scaling due to coherence traffic of model sharing across threads, and inefficient CPU cache accesses. Our proposed system-aware parallel coordinate descent algorithm (SySCD) scales to many cores and across numa nodes, and offers a consistent bottom line speedup in training time of up to x12 compared to an optimized asynchronous parallel SCD algorithm and up to x42, compared to state-of-the-art GLM solvers (scikit-learn, Vowpal Wabbit, and H2O) on a range of datasets and multi-core CPU architectures.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] An Accelerated Random Coordinate Descent Algorithm For Compressed Sensing
    Chen, Dongfang
    Zhang, Ningtao
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [32] A Distributed Coordinate Descent Algorithm for Learning Factorization Machine
    Zhao, Kankan
    Zhang, Jing
    Zhang, Liangfu
    Li, Cuiping
    Chen, Hong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II, 2020, 12085 : 881 - 893
  • [33] Coordinate descent algorithm for generalized group fused Lasso
    Ohishi M.
    Okamura K.
    Itoh Y.
    Wakaki H.
    Yanagihara H.
    Behaviormetrika, 2025, 52 (1) : 105 - 137
  • [34] Fixing the Convergence Problems in Parallel Asynchronous Dual Coordinate Descent
    Zhang, Huan
    Hsieh, Cho-Jui
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 619 - 628
  • [35] Coordinate majorization descent algorithm for nonconvex penalized regression
    Wang, Yanxin
    Zhu, Li
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (13) : 2684 - 2698
  • [36] Coordinate descent iterations in fast affine projection algorithm
    Zakharov, Y
    Albu, F
    IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (05) : 353 - 356
  • [37] Local linear convergence of proximal coordinate descent algorithm
    Quentin Klopfenstein
    Quentin Bertrand
    Alexandre Gramfort
    Joseph Salmon
    Samuel Vaiter
    Optimization Letters, 2024, 18 : 135 - 154
  • [38] Fast Lasso Algorithm via Selective Coordinate Descent
    Fujiwara, Yasuhiro
    Ida, Yasutoshi
    Shiokawa, Hiroaki
    Iwamura, Sotetsu
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1561 - 1567
  • [39] Accelerated Asynchronous Greedy Coordinate Descent Algorithm for SVMs
    Guy, Bin
    Shan, Yingying
    Geng, Xiang
    Zheng, Guansheng
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2170 - 2176
  • [40] Coordinate descent iterations in pseudo affine projection algorithm
    Albu, F
    Kotropoulos, C
    2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 2283 - 2286