Large-scale L-BFGS using MapReduce

被引:0
|
作者
Chen, Weizhu [1 ]
Wang, Zhenghao [1 ]
Zhou, Jingren [1 ]
机构
[1] Microsoft, Albuquerque, NM 87107 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
L-BFGS has been applied as an effective parameter estimation method for various machine learning algorithms since 1980s. With an increasing demand to deal with massive instances and variables, it is important to scale up and parallelize L-BFGS effectively in a distributed system. In this paper, we study the problem of parallelizing the L-BFGS algorithm in large clusters of tens of thousands of shared-nothing commodity machines. First, we show that a naive implementation of L-BFGS using Map-Reduce requires either a significant amount of memory or a large number of map-reduce steps with negative performance impact. Second, we propose a new L-BFGS algorithm, called Vector-free L-BFGS, which avoids the expensive dot product operations in the two loop recursion and greatly improves computation efficiency with a great degree of parallelism. The algorithm scales very well and enables a variety of machine learning algorithms to handle a massive number of variables over large datasets. We prove the mathematical equivalence of the new Vector-free L-BFGS and demonstrate its excellent performance and scalability using real-world machine learning problems with billions of variables in production clusters.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Medical image fusion using Transfer Learning and L-BFGS optimization algorithm
    Jiang, Jionghui
    Feng, Xi'an
    Hu, Zhiwen
    Hu, Xiaodong
    Liu, Fen
    Huang, Hui
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2021, 31 (04) : 2003 - 2013
  • [22] A Linearly-Convergent Stochastic L-BFGS Algorithm
    Moritz, Philipp
    Nishihara, Robert
    Jordan, Michael, I
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 249 - 258
  • [23] Fast B-spline curve fitting by L-BFGS
    Zheng, Wenni
    Bo, Pengbo
    Liu, Yang
    Wang, Wenping
    [J]. COMPUTER AIDED GEOMETRIC DESIGN, 2012, 29 (07) : 448 - 462
  • [24] ON THE LIMITED MEMORY BFGS METHOD FOR LARGE-SCALE OPTIMIZATION
    LIU, DC
    NOCEDAL, J
    [J]. MATHEMATICAL PROGRAMMING, 1989, 45 (03) : 503 - 528
  • [25] BFGS-ADMM for Large-Scale Distributed Optimization
    Li, Yichuan
    Gong, Yonghai
    Freris, Nikolaos M.
    Voulgaris, Petros
    Stipanovic, Dusan
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1689 - 1694
  • [26] MapReduce for Large-scale Monitor Data Analyses
    Ding, Jianwei
    Liu, Yingbo
    Zhang, Li
    Wang, Jianmin
    [J]. 2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, : 747 - 754
  • [27] Large-Scale Deep Belief Nets With MapReduce
    Zhang, Kunlei
    Chen, Xue-Wen
    [J]. IEEE ACCESS, 2014, 2 : 395 - 403
  • [28] MapReduce in MPI for Large-scale graph algorithms
    Plimpton, Steven J.
    Devine, Karen D.
    [J]. PARALLEL COMPUTING, 2011, 37 (09) : 610 - 632
  • [29] Large-Scale Frequent Subgraph Mining in MapReduce
    Lin, Wenqing
    Xiao, Xiaokui
    Ghinita, Gabriel
    [J]. 2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 844 - 855
  • [30] Large-scale Neural Modeling in MapReduce and Giraph
    Yang, Shuo
    Spielman, Nicholas D.
    Jackson, Jadin C.
    Rubin, Brad S.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2014, : 556 - 561