The Case for Domain-Specific Networks

被引:0
|
作者
Abts, Dennis [1 ]
Kim, John [2 ]
机构
[1] NVIDIA, Santa Clara, CA 95050 USA
[2] Korea Adv Inst Sci & Technol, Daejeon, South Korea
关键词
D O I
10.1109/HOTI59126.2023.00021
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Modern parallel computers are dichotomized into capacity or capability systems. Capacity systems cater to a wide range of weak scaling workloads, using distributed parallel systems with message passing while capability systems focus on strong scaling workloads across a significant fraction of the machine's processing units. The interconnection network differs under these regimes, with commodity Ethernet or Infiniband solutions typically deployed for capacity systems, while capabilityclass systems often necessitate tightly-coupled, fine-grained communication. Systems built for AI training and inference embody traits from both classes: tight coupling and strong scaling for model parallelism, and weak scaling for data parallelism in a distributed system. Handling 100-billion-parameter large-language models and trillion-token data sets presents computational challenges for current supercomputing infrastructure. This paper discusses the crucial role of the interconnection network in these large-scale systems, advocating for flexible, low-latency interconnects that can deliver high throughput at large scales with tens of thousands of endpoints. This work also emphasizes the importance of reliability and resilience in enduring long-running training workloads and demanding inference requirements of domain-specific workloads.
引用
下载
收藏
页码:49 / 52
页数:4
相关论文
共 50 条
  • [21] ENSEMBLE DEEP NEURAL NETWORKS FOR DOMAIN-SPECIFIC IMAGE RECOGNITION
    Li, Wenbo
    Ke, Chuan
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2016,
  • [22] Utilizing Domain-Specific Information in Decision Support for Logistics Networks
    Rabe, Markus
    Schmitt, Dominik
    Ammouriova, Majsa
    DYNAMICS IN LOGISTICS, 2018, : 413 - 417
  • [23] RADENN: A Domain-Specific Language for the Rapid Development of Neural Networks
    Pineda, Israel
    Carrion-Ojeda, Dustin
    Fonseca-Delgado, Rigoberto
    IEEE ACCESS, 2023, 11 : 86727 - 86738
  • [24] Improving Domain-Specific Classification by Collaborative Learning with Adaptation Networks
    Wu, Si
    Zhong, Jian
    Cao, Wenming
    Li, Rui
    Yu, Zhiwen
    Wong, Hau-San
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5450 - 5457
  • [25] DOMAIN-SPECIFIC LANGUAGE FOR MODELING AND SIMULATING ACTIONS IN LOGISTICS NETWORKS
    Rabe, Markus
    Schmitt, Dominik
    2019 WINTER SIMULATION CONFERENCE (WSC), 2019, : 1579 - 1590
  • [26] Untangling Crosscutting Concerns in Domain-specific Languages with Domain-specific Join Points
    Dinkelaker, Tom
    Monperrus, Martin
    Mezini, Mira
    DSAL09: DOMAIN-SPECIFIC ASPECT LANGUAGES, 2009, : 1 - 5
  • [27] Ruling Networks with RDL: A Domain-Specific Language to Task Wireless Sensor Networks
    Terfloth, Kirsten
    Schiller, Jochen
    RULE REPRESENTATION, INTERCHANGE AND REASONING ON THE WEB, RULEML 2008, 2008, 5321 : 127 - 134
  • [28] Domain-specific knowledge as playful interaction: the case of Prime Slaughter
    Valente, Andrea
    Marchetti, Emanuela
    INTERNATIONAL JOURNAL OF ARTS AND TECHNOLOGY, 2015, 8 (01) : 30 - 49
  • [29] Infrastructure for domain-specific aspect languages: the ReLAx case study
    Fabry, J.
    Tanter, E.
    D'Hondt, T.
    IET SOFTWARE, 2009, 3 (03) : 238 - 254
  • [30] Requirements Definition for Domain-Specific Modelling Languages: The ComVantage Case
    Buchmann, Robert Andrei
    Karagiannis, Dimitris
    Visic, Niksa
    PERSPECTIVES IN BUSINESS INFORMATICS RESEARCH, BIR 2013, 2013, 158 : 19 - 33