RAN Information-Assisted TCP Congestion Control Using Deep Reinforcement Learning With Reward Redistribution

被引:4
|
作者
Chen, Minghao [1 ]
Li, Rongpeng [1 ]
Crowcroft, Jon [2 ]
Wu, Jianjun [3 ]
Zhao, Zhifeng [4 ]
Zhang, Honggang [1 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Univ Cambridge, Dept Comp Sci, Cambridge CB2 1TN, England
[3] Huawei Technol Co Ltd, Shanghai 201206, Peoples R China
[4] Zhejiang Lab, Hangzhou 311121, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Reinforcement learning; Servers; Internet; Throughput; Radio access networks; Bandwidth; 5G mobile communication; Deep reinforcement learning; congestion control; radio access network; reward redistribution; delayed feedback;
D O I
10.1109/TCOMM.2021.3123130
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we aim to propose a novel transmission control protocol (TCP) congestion control method from a cross-layer-based perspective and present a deep reinforcement learning (DRL)-driven method called DRL-3R (DRL for congestion control with Radio access network information and Reward Redistribution) so as to learn the TCP congestion control policy in a superior manner. In particular, we incorporate the RAN information to timely grasp the dynamics of RAN, and empower DRL to learn from the delayed RAN information feedback potentially induced by several consecutive actions. Meanwhile, we relax the implicit assumption (that the feedback to one specific action returns at a round-trip-time (RTT) after the action is applied) in previous researches, by redistributing the rewards and evaluating the merits of actions more accurately. Experiment results show that besides maintaining a reasonable fairness, DRL-3R significantly outperforms classical congestion control methods (e.g., TCP Reno, Westwood, Cubic, BBR and DRL-CC) on network utility by achieving a higher throughput while reducing delay in various network environments.
引用
收藏
页码:215 / 230
页数:16
相关论文
共 50 条
  • [1] RAN Information-assisted TCP Congestion Control via DRL with Reward Redistribution
    Chen, Minghao
    Li, Rongpeng
    Zhao, Zhifeng
    Zhang, Honggang
    2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [2] Rainbow deep reinforcement learning for TCP congestion control
    Martins, Jean P.
    Souza, Ricardo S.
    Almeida, Igor
    Lins, Silvia
    2021 IEEE LATIN-AMERICAN CONFERENCE ON COMMUNICATIONS (LATINCOM 2021), 2021,
  • [3] TCP-Drinc: Smart Congestion Control Based on Deep Reinforcement Learning
    Xiao, Kefan
    Mao, Shiwen
    Tugnait, Jitendra K.
    IEEE ACCESS, 2019, 7 : 11892 - 11904
  • [4] An Information-Assisted Deep Reinforcement Learning Path Planning Scheme for Dynamic and Unknown Underwater Environment
    Xi, Meng
    Yang, Jiachen
    Wen, Jiabao
    Li, Zhengjian
    Lu, Wen
    Gao, Xinbo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 842 - 853
  • [5] An Information-Assisted Deep Reinforcement Learning Path Planning Scheme for Dynamic and Unknown Underwater Environment
    Xi, Meng
    Yang, Jiachen
    Wen, Jiabao
    Li, Zhengjian
    Lu, Wen
    Gao, Xinbo
    IEEE Transactions on Neural Networks and Learning Systems, 2023, : 1 - 12
  • [6] TCP Congestion Control with Multiagent Reinforcement and Transfer Learning
    Kasi, Shahrukh Khan
    Das, Saptarshi
    Biswas, Subir
    2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 1507 - 1513
  • [7] Deep Reinforcement Learning for RAN Optimization and Control
    Chen, Yu
    Chen, Jie
    Krishnamurthi, Ganesh
    Yang, Huijing
    Wang, Huahui
    Zhao, Wenjie
    2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,
  • [8] Rax: Deep Reinforcement Learning for Congestion Control
    Bachl, Maximilian
    Zseby, Tanja
    Fabini, Joachim
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [9] TCP Congestion Avoidance in Data Centres using Reinforcement Learning
    Hassan, Ali
    Heydari, Shahram Shah
    2021 23RD INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT 2021): ON-LINE SECURITY IN PANDEMIC ERA, 2021, : 306 - 311
  • [10] TCP Congestion Avoidance in Data Centres using Reinforcement Learning
    Hassan, Ali
    Heydari, Shahram Shah
    2022 24TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ARITIFLCIAL INTELLIGENCE TECHNOLOGIES TOWARD CYBERSECURITY, 2022, : 306 - +