Receiver-Driven Congestion Control for InfiniBand

被引:2
|
作者
Zhang, Yiran [1 ]
Qian, Kun [2 ]
Ren, Fengyuan [1 ]
机构
[1] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing, Peoples R China
[2] Alibaba Inc, Hangzhou, Peoples R China
关键词
InfiniBand; congestion control; MANAGEMENT;
D O I
10.1145/3472456.3472466
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
InfiniBand (IB) has become one of the most popular high-speed interconnects in High Performance Computing (HPC). The backpressure effect of credit-based link-layer flow control in IB introduces congestion spreading, which increases queueing delay and hurts application completion time. IB congestion control (IB CC) has been defined in IB specification to address the congestion spreading problem. Nowadays, HPC clusters are increasingly being used to run diverse workloads with a shared network infrastructure. The coexistence of messages transfers of different applications imposes great challenges to IB CC. In this paper, we re-exam IB CC through fine-grained experimental observations and reveal several fundamental problems. Inspired by our understanding and insights, we present a new receiver-driven congestion control for InfiniBand (RR CC). RR CC includes two key mechanisms: receiver-driven congestion identification and receiver-driven rate regulation, which empower eliminating both in-network congestion and endpoint congestion in one control loop. RR CC has much fewer parameters and requires no modifications to InfiniBand switches. Evaluations show that RR CC achieves better average/tail message latency and link utilization than IB CC under various scenarios.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Receiver-driven congestion control for streaming video application
    Yang Xiaogang
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 1271 - 1275
  • [2] On receiver-driven congestion control for multicast streaming delivery
    Trossen, D
    PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 1905 - 1910
  • [3] Receiver-Driven RDMA Congestion Control by Differentiating Congestion Types in Datacenter Networks
    Zhang, Jiao
    Shi, Jiaming
    Zhong, Xiaolong
    Wan, Zirui
    Tian, Yu
    Pan, Tian
    Huang, Tao
    2021 IEEE 29TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP 2021), 2021,
  • [4] Quality incentive assisted congestion control or receiver-driven multicast
    Johansen, Stian
    Kim, Anna N.
    Perkis, Andrew
    2007 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-14, 2007, : 1642 - 1647
  • [5] Receiver-driven Congestion Control for Content Oriented Application with Multiple Sources
    Hayamizu, Yusaku
    Yamamoto, Miki
    2015 IEEE INTERNATIONAL WORKSHOP TECHNICAL COMMITTEE ON COMMUNICATIONS QUALITY AND RELIABILITY (CQR), 2015,
  • [6] CCTCP: A Scalable Receiver-driven Congestion Control Protocol for Content Centric Networking
    Saino, Lorenzo
    Cocora, Cosmin
    Pavlou, George
    2013 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2013, : 3775 - 3780
  • [7] REN: Receiver-Driven Congestion Control Using Explicit Notification for Data Center
    Li, Zhaoyi
    Huang, Jiawei
    Hu, Jinbin
    Li, Weihe
    Zhang, Tao
    Liu, Jingling
    Wang, Jianxin
    He, Tian
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (02) : 1381 - 1394
  • [8] Receiver-driven fair congestion control for TCP outcast in data center networks
    Huang, Jiawei
    Li, Shuping
    Han, Rui
    Wang, Jianxin
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2019, 131 : 75 - 88
  • [9] RCC: Enabling Receiver-Driven RDMA Congestion Control With Congestion Divide-and-Conquer in Datacenter Networks
    Zhang, Jiao
    Zhong, Xiaolong
    Wan, Zirui
    Tian, Yu
    Pan, Tian
    Huang, Tao
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (01) : 103 - 117
  • [10] Polo: Receiver-Driven Congestion Control for Low Latency over Commodity Network Fabric
    Ruan, Chang
    Wang, Jianxin
    Jiang, Wanchun
    Zhang, Tao
    PROCEEDINGS OF THE 49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2020, 2020,