On Efficient Training of Large-Scale Deep Learning Models

被引:0
|
作者
Shen, Li [1 ]
Sun, Yan [2 ]
Yu, Zhiyuan [3 ]
Ding, Liang [4 ]
Tian, Xinmei [3 ]
Tao, Dacheng [5 ]
机构
[1] Shenzhen Campus Of Sun Yat-sen University, Shenzhen, China
[2] The University Of Sydney, Sydney, Australia
[3] University Of Science And Technology Of China, Hefei, China
[4] JD.com Inc, Beijing, China
[5] Nanyang Technological University, Singapore, Singapore
来源
ACM Computing Surveys | / 57卷 / 03期
关键词
Budget control;
D O I
10.1145/3700439
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [1] Efficient Use of GPU Memory for Large-Scale Deep Learning Model Training
    Choi, Hyeonseong
    Lee, Jaehwan
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [2] Enabling Efficient Large-Scale Deep Learning Training with Cache Coherent Disaggregated Memory Systems
    Wang, Zixuan
    Sim, Joonseop
    Lim, Euicheol
    Zhao, Jishen
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, : 126 - 140
  • [3] MixPipe: Efficient Bidirectional Pipeline Parallelism for Training Large-Scale Models
    Zhang, Weigang
    Zhou, Biyu
    Tang, Xuehai
    Wang, Zhaoxing
    Hu, Songlin
    [J]. 2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [4] A Survey on Auto-Parallelism of Large-Scale Deep Learning Training
    Liang, Peng
    Tang, Yu
    Zhang, Xiaoda
    Bai, Youhui
    Su, Teng
    Lai, Zhiquan
    Qiao, Linbo
    Li, Dongsheng
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (08) : 2377 - 2390
  • [5] A Comparison of Svm With Deep Learning Models for Large-Scale Intents Analysis
    Islamic, Toqeer Ali
    Jan, Salman
    Faizullah, Safiullah
    Musa, Shahrulniza
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (07): : 38 - 46
  • [6] Large-scale Deep Learning at Baidu
    Yu, Kai
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2211 - 2211
  • [7] Efficient Large-Scale Structured Learning
    Branson, Steve
    Beijbom, Oscar
    Belongie, Serge
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1806 - 1813
  • [8] Distributed Training Large-Scale Deep Architectures
    Zou, Shang-Xuan
    Chen, Chun-Yen
    Wu, Jui-Lin
    Chou, Chun-Nan
    Tsao, Chia-Chin
    Tung, Kuan-Chieh
    Lin, Ting-Wei
    Sung, Cheng-Lung
    Chang, Edward Y.
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2017, 2017, 10604 : 18 - 32
  • [9] Toward Optimally Efficient Search With Deep Learning for Large-Scale MIMO Systems
    He, Le
    He, Ke
    Fan, Lisheng
    Lei, Xianfu
    Nallanathan, Arumugam
    Karagiannidis, George K.
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (05) : 3157 - 3168
  • [10] Comparing models of learning and relearning in large-scale cognitive training data sets
    Aakriti Kumar
    Aaron S. Benjamin
    Andrew Heathcote
    Mark Steyvers
    [J]. npj Science of Learning, 7