ScaDL 2022: Fourth IPDPS Workshop on Scalable Deep Learning over Parallel and Distributed Infrastructure

被引:0
|
作者
Ardagna, Danilo [1 ]
Patterson, Stacy [2 ]
机构
[1] Politecnico di Milano, Italy
[2] Rensselaer Polytechnic Institute (RPI), United States
关键词
D O I
10.1109/IPDPSW55747.2022.00165
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [41] Distributed learning of deep neural network over multiple agents
    Gupta, Otkrist
    Raskar, Ramesh
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2018, 116 : 1 - 8
  • [42] SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications
    Nuriyev, Emin
    Manumachu, Ravi Reddy
    Aseeri, Samar
    Verma, Mahendra K.
    Lastovetsky, Alexey L.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 183
  • [43] GeePS: Scalable deep learning on distributed GPUs with a GPU-specialized parameter server
    Cui, Henggang
    Zhang, Hao
    Ganger, Gregory R.
    Gibbons, Phillip B.
    Xing, Eric P.
    PROCEEDINGS OF THE ELEVENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS, (EUROSYS 2016), 2016,
  • [44] Preserving Near-Optimal Gradient Sparsification Cost for Scalable Distributed Deep Learning
    Yoon, Daegun
    Oh, Sangyoon
    arXiv,
  • [45] Distributed and Scalable Cooperative Formation of Unmanned Ground Vehicles Using Deep Reinforcement Learning
    Huang, Shichun
    Wang, Tao
    Tang, Yong
    Hu, Yiwen
    Xin, Gu
    Zhou, Dianle
    AEROSPACE, 2023, 10 (02)
  • [46] Scalable Blockchain-empowered Distributed Computation Offloading: A Deep Reinforcement Learning Approach
    Xu, Feng
    Zhao, Zitong
    Liu, Lei
    Yuan, Xiaoming
    Pei, Qingqi
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS, INFOCOM WKSHPS 2024, 2024,
  • [47] Preserving Near-Optimal Gradient Sparsification Cost for Scalable Distributed Deep Learning
    Yoon, Daegun
    Oh, Sangyoon
    2024 IEEE 24TH INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID 2024, 2024, : 320 - 329
  • [48] DOPpler: Parallel Measurement Infrastructure for Auto-Tuning Deep Learning Tensor Programs
    Borowiec, Damian
    Yeung, Gingfung
    Friday, Adrian
    Harper, Richard
    Garraghan, Peter
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (07) : 2208 - 2220
  • [49] A scalable modified deep reinforcement learning algorithm for serverless IoT microservice composition infrastructure in fog layer
    Khansari, Mina Emami
    Sharifian, Saeed
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 153 : 206 - 221
  • [50] A characterization of soft-error sensitivity in data-parallel and model-parallel distributed deep learning
    Rojas, Elvis
    Perez, Diego
    Meneses, Esteban
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 190