SySCD: A System-Aware Parallel Coordinate Descent Algorithm

被引:0
|
作者
Ioannou, Nikolas [1 ]
Mendler-Dunner, Celestine [1 ,2 ]
Parnell, Thomas [1 ]
机构
[1] IBM Res, Zurich, Switzerland
[2] Univ Calif Berkeley, Berkeley, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a novel parallel stochastic coordinate descent (SCD) algorithm with convergence guarantees that exhibits strong scalability. We start by studying a state-of-the-art parallel implementation of SCD and identify scalability as well as system-level performance bottlenecks of the respective implementation. We then take a principled approach to develop a new SCD variant which is designed to avoid the identified system bottlenecks, such as limited scaling due to coherence traffic of model sharing across threads, and inefficient CPU cache accesses. Our proposed system-aware parallel coordinate descent algorithm (SySCD) scales to many cores and across numa nodes, and offers a consistent bottom line speedup in training time of up to x12 compared to an optimized asynchronous parallel SCD algorithm and up to x42, compared to state-of-the-art GLM solvers (scikit-learn, Vowpal Wabbit, and H2O) on a range of datasets and multi-core CPU architectures.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] A Local Block Coordinate Descent Algorithm for the CSC Model
    Zisselman, Ev
    Sulam, Jeremias
    Elad, Michael
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8200 - 8209
  • [42] Optimal sparsity allows reliable system-aware restoration of fluorescence microscopy images
    Mandracchia, Biagio
    Liu, Wenhao
    Hua, Xuanwen
    Forghani, Parvin
    Lee, Soojung
    Hou, Jessica
    Nie, Shuyi
    Xu, Chunhui
    Jia, Shu
    SCIENCE ADVANCES, 2023, 9 (35)
  • [43] Architectural scoring framework for the creation and evaluation of System-Aware Cyber Security solutions
    Jones R.A.
    Luckett B.A.
    Beling P.A.
    Horowitz B.M.
    Environment Systems and Decisions, 2013, 33 (3) : 341 - 361
  • [44] An efficient GPU-parallel coordinate descent algorithm for sparse precision matrix estimation via scaled lasso
    Lee, Seunghwan
    Kim, Sang Cheol
    Yu, Donghyeon
    COMPUTATIONAL STATISTICS, 2023, 38 (01) : 217 - 242
  • [45] An efficient GPU-parallel coordinate descent algorithm for sparse precision matrix estimation via scaled lasso
    Seunghwan Lee
    Sang Cheol Kim
    Donghyeon Yu
    Computational Statistics, 2023, 38 : 217 - 242
  • [46] Cluster-aware arrangement of the parallel coordinate plots
    Zhou, Zhiguang
    Ye, Zhifei
    Yu, Jiajun
    Chen, Weifeng
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2018, 46 : 43 - 52
  • [47] MaPart: An Efficient Multi-FPGA System-Aware Hypergraph Partitioning Framework
    Li, Benzheng
    Bi, Shunyang
    You, Hailong
    Qi, Zhongdong
    Guo, Guangxin
    Sun, Richard
    Zhang, Yuming
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (10) : 3212 - 3225
  • [48] PARALLEL STOCHASTIC ASYNCHRONOUS COORDINATE DESCENT: TIGHT BOUNDS ON THE POSSIBLE PARALLELISM
    Cheung, Yun Kuen
    Cole, Richard J.
    Tao, Yixin
    SIAM JOURNAL ON OPTIMIZATION, 2021, 31 (01) : 448 - 460
  • [49] Optimization in High Dimensions via Accelerated, Parallel, and Proximal Coordinate Descent
    Fercoq, Olivier
    Richtarik, Peter
    SIAM REVIEW, 2016, 58 (04) : 739 - 771
  • [50] Sparse Representation and Dictionary Learning Based on Alternating Parallel Coordinate Descent
    Tang, Zunyi
    Tamura, Toshiyo
    Ding, Shuxue
    Li, Zhenni
    2013 INTERNATIONAL JOINT CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY & UBI-MEDIA COMPUTING (ICAST-UMEDIA), 2013, : 491 - +