Large switches or blocking multi-stage networks? An evaluation of routing strategies for datacenter fabrics

被引:4
|
作者
Chrysos, N. [1 ]
Neeser, F. [1 ]
Gusat, M. [1 ]
Minkenberg, C. [1 ]
Denzel, W. [1 ]
Basso, C. [2 ]
Rudquist, M. [2 ]
Valk, K. [2 ]
Vanderpool, B. [2 ]
机构
[1] IBM Res Zurich Syst, CH-8803 Zurich, Switzerland
[2] IBM Corp, Syst & Technol Grp, Rochester, MN 55901 USA
关键词
Data-center networks; Routing; TCP; Switches;
D O I
10.1016/j.comnet.2015.08.029
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud computing clusters require efficient interconnects to deal with the increasing volume of inter-server (east west) traffic. To cope with these new traffic patterns, datacenter networks are abandoning the oversubscribed topologies of the past, and adopt fat-tree fabrics with high bisection bandwidth. However, these fabrics typically employ either single-path or coarse-grained (flow-level) multipath routing. In this paper, we characterize the waste of bandwidth due to routing inefficiencies. Our analysis, confirmed by computer simulations, demonstrates that under a randomly selected permutation the expected throughputs of d-mod-k routing and of Equal-Cost-Multi-Pathing (ECMP) (or flow-level multipath routing) (Thaler and Hopps, 2000) [1] are close to 63% and 47%, respectively. Furthermore, nearly 30% of the flows are expected to undergo an unnecessary 3-fold slowdown. In contrast, packet-level multipath routing consistently delivers full throughput to all flows, thus serving the growing demands of inter-server (east west) traffic better. Using unmodified TCP stacks, we also demonstrate that under typical traffic conditions and system configurations flow-level multi-path routing can abruptly increase the completion time of latency-critical flows by more than one order of magnitude. In contrast, packet-level multipath routing, proactively avoids in-fabric backlogs, and minimizes the flow completion time across the full range of configurations that we examine. Finally, we present the design of a cost-efficient switch node performing adaptive packet-level spraying. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:316 / 328
页数:13
相关论文
共 50 条
  • [1] BLOCKING PERFORMANCE OF MULTI-STAGE ROUTING OPTICAL PATH NETWORKS
    Le, Hai-Chau
    Hasegawa, Hiroshi
    Sato, Ken-ichi
    2014 OPTOELECTRONICS AND COMMUNICATIONS CONFERENCE AND AUSTRALIAN CONFERENCE ON OPTICAL FIBRE TECHNOLOGY (OECC/ACOFT 2014), 2014, : 362 - 364
  • [2] Unblocking Placement and Routing in Rearrangeable Multi-Stage Networks
    Morais, Caio Von Rondow
    Penha, Jeronimo
    Nacif, Jose Augusto M.
    Ferreira, Ricardo S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (9-11):
  • [3] Dynamic Provisioning in Multi-stage Routing Optical Path Networks
    Cao, Son H.
    Hai-Chau Le
    Nguyen, Nhan D.
    Dang, Ngoc T.
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, MANAGEMENT AND TELECOMMUNICATIONS (COMMANTEL), 2015, : 214 - 218
  • [4] Performance evaluation of generic multi-stage interconnection networks with blocking and back-pressure mechanism
    Mohammad Amiri-Zarandi
    Farshad Safaei
    Milad Roozikhar
    The Journal of Supercomputing, 2015, 71 : 1038 - 1066
  • [5] Performance evaluation of generic multi-stage interconnection networks with blocking and back-pressure mechanism
    Amiri-Zarandi, Mohammad
    Safaei, Farshad
    Roozikhar, Milad
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (03): : 1038 - 1066
  • [6] On non-blocking multi-stage switching networks with rearrangement or repacking
    Schehrer, Rudolf G.
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2007, 61 (07) : 423 - 432
  • [7] Large capacity optical networks applying multi-stage hetero-granular optical path routing
    Le, Hai-Chau
    Hasegawa, Hiroshi
    Sato, Ken-ichi
    OPTICAL SWITCHING AND NETWORKING, 2014, 11 : 105 - 112
  • [8] Packet Routing Method for Multi-Stage Networks Based on Reinforcement Learning
    Gao Y.
    Luo L.
    Sun G.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2022, 51 (02): : 200 - 206
  • [9] Comparative evaluation of hypermesh and multi-stage interconnection networks
    OuldKhaoua, M
    Mackenzie, LM
    Sotudeh, R
    COMPUTER JOURNAL, 1996, 39 (03): : 232 - 240
  • [10] Large multi-stage OXC
    Lelic, I
    ECOC'01: 27TH EUROPEAN CONFERENCE ON OPTICAL COMMUNICATION, VOLS 1-6, 2001, : 540 - 541