RAN Information-Assisted TCP Congestion Control Using Deep Reinforcement Learning With Reward Redistribution

被引：4

作者：

Chen, Minghao ^{[1
]}

Li, Rongpeng ^{[1
]}

Crowcroft, Jon ^{[2
]}

Wu, Jianjun ^{[3
]}

Zhao, Zhifeng ^{[4
]}

Zhang, Honggang ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China

[2] Univ Cambridge, Dept Comp Sci, Cambridge CB2 1TN, England

[3] Huawei Technol Co Ltd, Shanghai 201206, Peoples R China

[4] Zhejiang Lab, Hangzhou 311121, Peoples R China

来源：

IEEE TRANSACTIONS ON COMMUNICATIONS | 2022年 / 70卷 / 01期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Reinforcement learning; Servers; Internet; Throughput; Radio access networks; Bandwidth; 5G mobile communication; Deep reinforcement learning; congestion control; radio access network; reward redistribution; delayed feedback;

D O I：

10.1109/TCOMM.2021.3123130

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we aim to propose a novel transmission control protocol (TCP) congestion control method from a cross-layer-based perspective and present a deep reinforcement learning (DRL)-driven method called DRL-3R (DRL for congestion control with Radio access network information and Reward Redistribution) so as to learn the TCP congestion control policy in a superior manner. In particular, we incorporate the RAN information to timely grasp the dynamics of RAN, and empower DRL to learn from the delayed RAN information feedback potentially induced by several consecutive actions. Meanwhile, we relax the implicit assumption (that the feedback to one specific action returns at a round-trip-time (RTT) after the action is applied) in previous researches, by redistributing the rewards and evaluating the merits of actions more accurately. Experiment results show that besides maintaining a reasonable fairness, DRL-3R significantly outperforms classical congestion control methods (e.g., TCP Reno, Westwood, Cubic, BBR and DRL-CC) on network utility by achieving a higher throughput while reducing delay in various network environments.

引用

页码：215 / 230

页数：16

共 50 条

[1] RAN Information-assisted TCP Congestion Control via DRL with Reward Redistribution
Chen, Minghao
Li, Rongpeng
Zhao, Zhifeng
Zhang, Honggang
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
[2] Rainbow deep reinforcement learning for TCP congestion control
Martins, Jean P.
Souza, Ricardo S.
Almeida, Igor
Lins, Silvia
2021 IEEE LATIN-AMERICAN CONFERENCE ON COMMUNICATIONS (LATINCOM 2021), 2021,
[3] TCP-Drinc: Smart Congestion Control Based on Deep Reinforcement Learning
Xiao, Kefan
Mao, Shiwen
Tugnait, Jitendra K.
IEEE ACCESS, 2019, 7 : 11892 - 11904
[4] An Information-Assisted Deep Reinforcement Learning Path Planning Scheme for Dynamic and Unknown Underwater Environment
Xi, Meng
Yang, Jiachen
Wen, Jiabao
Li, Zhengjian
Lu, Wen
Gao, Xinbo
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 842 - 853
[5] An Information-Assisted Deep Reinforcement Learning Path Planning Scheme for Dynamic and Unknown Underwater Environment
Xi, Meng
Yang, Jiachen
Wen, Jiabao
Li, Zhengjian
Lu, Wen
Gao, Xinbo
IEEE Transactions on Neural Networks and Learning Systems, 2023, : 1 - 12
[6] TCP Congestion Control with Multiagent Reinforcement and Transfer Learning
Kasi, Shahrukh Khan
Das, Saptarshi
Biswas, Subir
2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 1507 - 1513
[7] Deep Reinforcement Learning for RAN Optimization and Control
Chen, Yu
Chen, Jie
Krishnamurthi, Ganesh
Yang, Huijing
Wang, Huahui
Zhao, Wenjie
2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,
[8] Rax: Deep Reinforcement Learning for Congestion Control
Bachl, Maximilian
Zseby, Tanja
Fabini, Joachim
ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
[9] TCP Congestion Avoidance in Data Centres using Reinforcement Learning
Hassan, Ali
Heydari, Shahram Shah
2021 23RD INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT 2021): ON-LINE SECURITY IN PANDEMIC ERA, 2021, : 306 - 311
[10] TCP Congestion Avoidance in Data Centres using Reinforcement Learning
Hassan, Ali
Heydari, Shahram Shah
2022 24TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ARITIFLCIAL INTELLIGENCE TECHNOLOGIES TOWARD CYBERSECURITY, 2022, : 306 - +

← 1 2 3 4 5 →