The Case for Domain-Specific Networks

被引:0
|
作者
Abts, Dennis [1 ]
Kim, John [2 ]
机构
[1] NVIDIA, Santa Clara, CA 95050 USA
[2] Korea Adv Inst Sci & Technol, Daejeon, South Korea
关键词
D O I
10.1109/HOTI59126.2023.00021
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Modern parallel computers are dichotomized into capacity or capability systems. Capacity systems cater to a wide range of weak scaling workloads, using distributed parallel systems with message passing while capability systems focus on strong scaling workloads across a significant fraction of the machine's processing units. The interconnection network differs under these regimes, with commodity Ethernet or Infiniband solutions typically deployed for capacity systems, while capabilityclass systems often necessitate tightly-coupled, fine-grained communication. Systems built for AI training and inference embody traits from both classes: tight coupling and strong scaling for model parallelism, and weak scaling for data parallelism in a distributed system. Handling 100-billion-parameter large-language models and trillion-token data sets presents computational challenges for current supercomputing infrastructure. This paper discusses the crucial role of the interconnection network in these large-scale systems, advocating for flexible, low-latency interconnects that can deliver high throughput at large scales with tens of thousands of endpoints. This work also emphasizes the importance of reliability and resilience in enduring long-running training workloads and demanding inference requirements of domain-specific workloads.
引用
下载
收藏
页码:49 / 52
页数:4
相关论文
共 50 条
  • [41] Democratizing Domain-Specific Computing
    Chi, Yuze
    Qiao, Weikang
    Sohrabizadeh, Atefeh
    Wang, Jie
    Cong, Jason
    COMMUNICATIONS OF THE ACM, 2023, 66 (01) : 74 - 85
  • [42] A domain-specific modeling milestone
    Jeff Gray
    Bernhard Rumpe
    Juha-Pekka Tolvanen
    Software and Systems Modeling, 2021, 20 : 917 - 918
  • [43] Domain-specific Event Abstraction
    Klessascheck, Finn
    Lichtenstein, Tom
    Meier, Martin
    Remy, Simon
    Sachs, Jan Philipp
    Pufahl, Luise
    Miotto, Riccardo
    Boettinger, Erwin
    Weske, Mathias
    24TH INTERNATIONAL CONFERENCE ON BUSINESS INFORMATION SYSTEMS (BIS): ENTERPRISE KNOWLEDGE AND DATA SPACES, 2021, : 117 - 126
  • [44] Are there domain-specific thinking skills?
    Smith, G
    JOURNAL OF PHILOSOPHY OF EDUCATION, 2002, 36 (02) : 207 - 227
  • [45] Domain-Specific Paraphrase Extraction
    Pavlick, Ellie
    Ganitkevitch, Juri
    Chan, Tsz Ping
    Yao, Xuchen
    Van Durme, Benjamin
    Callison-Burch, Chris
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 57 - 62
  • [46] Designing domain-specific processors
    Arnold, M
    Corporaal, H
    PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON HARDWARE/SOFTWARE CODESIGN, 2001, : 61 - 66
  • [47] Tutorials in domain-specific acquisition
    BastienToniazzo, M
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1997, 32 (03) : 129 - 138
  • [48] Unembedding Domain-Specific Languages
    Atkey, Robert
    Lindley, Sam
    Yallop, Jeremy
    HASKELL'09: PROCEEDINGS OF THE 2009 ACM SIGPLAN HASKELL SYMPOSIUM, 2009, : 37 - 48
  • [49] A domain-specific software architecture
    Geng, GY
    Zhong, CH
    Chen, W
    1997 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT PROCESSING SYSTEMS, VOLS 1 & 2, 1997, : 1833 - 1837
  • [50] Exploring Domain-Specific Perfectionism
    McArdle, Siobhain
    JOURNAL OF PERSONALITY, 2010, 78 (02) : 493 - 508