Receiver-Driven Congestion Control for InfiniBand

被引:2
|
作者
Zhang, Yiran [1 ]
Qian, Kun [2 ]
Ren, Fengyuan [1 ]
机构
[1] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing, Peoples R China
[2] Alibaba Inc, Hangzhou, Peoples R China
关键词
InfiniBand; congestion control; MANAGEMENT;
D O I
10.1145/3472456.3472466
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
InfiniBand (IB) has become one of the most popular high-speed interconnects in High Performance Computing (HPC). The backpressure effect of credit-based link-layer flow control in IB introduces congestion spreading, which increases queueing delay and hurts application completion time. IB congestion control (IB CC) has been defined in IB specification to address the congestion spreading problem. Nowadays, HPC clusters are increasingly being used to run diverse workloads with a shared network infrastructure. The coexistence of messages transfers of different applications imposes great challenges to IB CC. In this paper, we re-exam IB CC through fine-grained experimental observations and reveal several fundamental problems. Inspired by our understanding and insights, we present a new receiver-driven congestion control for InfiniBand (RR CC). RR CC includes two key mechanisms: receiver-driven congestion identification and receiver-driven rate regulation, which empower eliminating both in-network congestion and endpoint congestion in one control loop. RR CC has much fewer parameters and requires no modifications to InfiniBand switches. Evaluations show that RR CC achieves better average/tail message latency and link utilization than IB CC under various scenarios.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Receiver-driven handover between Independent Networks
    Tallon, Justin
    Kibilda, Jacek
    Forde, Tim K.
    DaSilva, Luiz A.
    Doyle, Linda
    2012 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS, 2012, : 276 - 277
  • [22] Improvements to the InfiniBand Congestion Control Mechanism
    Liu, Qian
    Russell, Robert D.
    Gran, Ernst Gunnar
    2016 IEEE 24TH ANNUAL SYMPOSIUM ON HIGH-PERFORMANCE INTERCONNECTS (HOTI), 2016, : 27 - 36
  • [23] Receiver-driven Flow Scheduling for Commodity Datacenters
    Khan, Aadil Zia
    Qazi, Ihsan Ayyub
    2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,
  • [24] Multilayer Joining for Receiver Driven Multicast Congestion Control
    Singh, Karan
    Yadav, Rama Shankar
    2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND INFORMATION TECHNOLOGY (C3IT-2012), 2012, 4 : 151 - 157
  • [25] Low complexity adaptive error control for receiver-driven layered video multicast
    Ou, Chien-Min
    Hwang, Wen-Jyi
    Lo, Tsung-Yen
    Wei, Hui-Hsien
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2006, 29 (07) : 1215 - 1226
  • [26] Handling high-bandwidth traffic aggregates by receiver-driven feedback control
    Tan, CW
    Chiu, DM
    Lui, JCS
    Yau, DKY
    PROCEEDINGS OF THE 29TH ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE, WORKSHOPS AND FAST ABSTRACTS, 2005, : 143 - 145
  • [27] Adaptive receiver-driven streaming from multiple senders
    Nazanin Magharei
    Reza Rejaie
    Multimedia Systems, 2006, 11 : 550 - 567
  • [28] ARTHost: Age-optimized Receiver-driven Transport Control Scheme in Datacenter Networks
    Zhao, Wei
    Wang, Li
    Bai, Bo
    Song, Mei
    2019 IEEE 90TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2019-FALL), 2019,
  • [29] End-to-end congestion control for InfiniBand
    Santos, JR
    Turner, Y
    Janakiraman, G
    IEEE INFOCOM 2003: THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2003, : 1123 - 1133
  • [30] RGBCC: A New Congestion Control Mechanism for InfiniBand
    Liu, Qian
    Russell, Robert D.
    2016 24TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP), 2016, : 91 - 100