Lightweight Automatic ECN Tuning Based on Deep Reinforcement Learning With Ultra-Low Overhead in Datacenter Networks

被引:1
|
作者
Hu, Jinbin [1 ,2 ]
Zhou, Zikai [1 ]
Zhang, Jin [1 ]
机构
[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Key Lab Broadband Wireless Commun & Sensor Network, Minist Educ, Nanjing 210049, Peoples R China
基金
中国国家自然科学基金;
关键词
Tuning; Degradation; Throughput; Heuristic algorithms; Topology; Servers; Network topology; Datacenter network; ECN; congestion control; deep reinforcement learning; SCHEME;
D O I
10.1109/TNSM.2024.3450596
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In modern datacenter networks (DCNs), mainstream congestion control (CC) mechanisms essentially rely on Explicit Congestion Notification (ECN) to reflect congestion. The traditional static ECN threshold performs poorly under dynamic scenarios, and setting a proper ECN threshold under various traffic patterns is challenging and time-consuming. The recently proposed reinforcement learning (RL) based ECN Tuning algorithm (ACC) consumes a large number of computational resources, making it difficult to deploy on switches. In this paper, we present a lightweight and hierarchical automated ECN tuning algorithm called LAECN, which can fully exploit the performance benefits of deep reinforcement learning with ultra-low overhead. The simulation results show that LAECN improves performance significantly by reducing latency and increasing throughput in stable network conditions, and also shows consistent high performance in small flows network environments. For example, LAECN effectively improves throughput by up to 47%, 34%, 32% and 24% over DCQCN, TIMELY, HPCC and ACC, respectively.
引用
收藏
页码:6398 / 6408
页数:11
相关论文
共 50 条
  • [1] HAECN: Hierarchical Automatic ECN Tuning with Ultra-Low Overhead in Datacenter Networks
    Hu, Jinbin
    Wang, Youyang
    Zhou, Zikai
    Rao, Shuying
    Xin, Rundong
    Wang, Jing
    He, Shiming
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT III, 2024, 14489 : 324 - 343
  • [2] ACC: Automatic ECN Tuning for High-Speed Datacenter Networks
    Yan, Siyu
    Wang, Xiaoliang
    Zheng, Xiaolong
    Xia, Yinben
    Liu, Derui
    Deng, Weishan
    SIGCOMM '21: PROCEEDINGS OF THE 2021 ACM SIGCOMM 2021 CONFERENCE, 2021, : 384 - 397
  • [3] Deep Reinforcement Learning Based Load Balancing for Heterogeneous Traffic in Datacenter Networks
    Hu, Jinbin
    Luo, Wangqing
    He, Yi
    Wang, Jing
    Zhang, Dengyong
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT III, 2024, 14489 : 270 - 289
  • [4] EagerCC: An ultra-low latency congestion control mechanism in datacenter networks
    Lu, Yuan
    Yuan, Guoyuan
    Bai, Yang
    Dong, Dezun
    Zhou, Renjie
    COMPUTER NETWORKS, 2023, 236
  • [5] A Reinforcement Learning Based Placement Strategy in Datacenter Networks
    Yang, Weihong
    Qin, Yang
    Yang, ZhaoZheng
    QUALITY, RELIABILITY, SECURITY AND ROBUSTNESS IN HETEROGENEOUS SYSTEMS, 2020, 300 : 87 - 101
  • [6] Automatic Hyperparameter Tuning in Deep Convolutional Neural Networks Using Asynchronous Reinforcement Learning
    Neary, Patrick L.
    2018 IEEE INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING (ICCC), 2018, : 73 - 77
  • [7] EMPRN: Reinforcement Learning-based ECN Tuning Using Message Passing Graph Recurrent Networks for Datacenters
    Zaeri, Kasra
    Abdelmoniem, Ahmed M.
    ICC 2024 - IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2024, : 4668 - 4673
  • [8] An Ultra-low Overhead LUT-based PUF for FPGA
    Wang, Jiadong
    Cui, Aijiao
    Li, Mengyang
    Qu, Gang
    Li, Huawei
    PROCEEDINGS OF THE 2016 IEEE ASIAN HARDWARE ORIENTED SECURITY AND TRUST SYMPOSIUM (ASIANHOST 2016), 2016,
  • [9] AuTO: Scaling Deep Reinforcement Learning for Datacenter-Scale Automatic Traffic Optimization
    Chen, Li
    Lingys, Justinas
    Chen, Kai
    Liu, Feng
    PROCEEDINGS OF THE 2018 CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION (SIGCOMM '18), 2018, : 191 - 205
  • [10] RILNET: A Reinforcement Learning Based Load Balancing Approach for Datacenter Networks
    Lin, Qinliang
    Gong, Zhibo
    Wang, Qiaoling
    Li, Jinlong
    MACHINE LEARNING FOR NETWORKING, 2019, 11407 : 44 - 55