Leveraging Domain Knowledge for Robust Deep Reinforcement Learning in Networking

被引:2
|
作者
Zheng, Ying [1 ]
Chen, Haoyu [2 ]
Duan, Qingyang [2 ]
Lin, Lixiang [2 ]
Shao, Yiyang [3 ]
Wang, Wei [3 ]
Wang, Xin [1 ]
Xu, Yuedong [2 ]
机构
[1] Fudan Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
[2] Fudan Univ, Sch Informat Sci & Engn, Shanghai, Peoples R China
[3] Huawei Technol Co Ltd, Beijing, Peoples R China
关键词
D O I
10.1109/INFOCOM42981.2021.9488863
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The past few years has witnessed a surge of interest towards deep reinforcement learning (Deep RL) in computer networks. With extraordinary ability of feature extraction, Deep RL has the potential to re-engineer the fundamental resource allocation problems in networking without relying on pre-programmed models or assumptions about dynamic environments. However, such black-box systems suffer from poor robustness, showing high performance variance and poor tail performance. In this work, we propose a unified Teacher-Student learning framework that harnesses rich domain knowledge to improve robustness. The domain-specific algorithms, less performant but more trustable than Deep RL, play the role of teachers providing advice at critical states; the student neural network is steered to maximize the expected reward as usual and mimic the teacher's advice meanwhile. The Teacher-Student method comprises of three modules where the confidence check module locates wrong decisions and risky decisions, the reward shaping module designs a new updating function to incentive the learning of student network, and the prioritized experience replay module to effectively utilize the advised actions. We further implement our Teacher-Student framework in existing video streaming (Pensieve), load balancing (DeepLB) and TCP congestion control (Aurora). Experimental results manifest that the proposed approach reduces the performance standard deviation of DeepLB by 37%; it improves the 90th, 95th and 99th tail performance of Pensieve by 7.6%, 8.8%, 10.7% respectively; and it accelerates the rate of growth of Aurora by 2x at the initial stage, and achieves a more stable performance in dynamic environments.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Leveraging Domain Knowledge for Reinforcement Learning Using MMC Architectures
    Ramamurthy, Rajkumar
    Bauckhage, Christian
    Sifa, Rafet
    Schuecker, Jannis
    Wrobel, Stefan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 595 - 607
  • [2] Routing Optimization With Deep Reinforcement Learning in Knowledge Defined Networking
    He, Qiang
    Wang, Yu
    Wang, Xingwei
    Xu, Weiqiang
    Li, Fuliang
    Yang, Kaiqi
    Ma, Lianbo
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (02) : 1444 - 1455
  • [3] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
    Liu, Jiayi
    Wang, Gang
    Guo, Xiangke
    Wang, Siyuan
    Fu, Qiang
    [J]. IEEE Access, 2022, 10 : 114402 - 114413
  • [4] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
    Liu, Jiayi
    Wang, Gang
    Guo, Xiangke
    Wang, Siyuan
    Fu, Qiang
    [J]. IEEE ACCESS, 2022, 10 : 114402 - 114413
  • [5] Leveraging Deep Reinforcement Learning for Reaching Robotic Tasks
    Katyal, Kapil
    Wang, I-Jeng
    Burlina, Philippe
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 490 - 491
  • [6] Leveraging Human Guidance for Deep Reinforcement Learning Tasks
    Zhang, Ruohan
    Torabi, Faraz
    Guan, Lin
    Ballard, Dana H.
    Stone, Peter
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6339 - 6346
  • [7] Leveraging Deep Reinforcement Learning for Traffic Engineering: A Survey
    Xiao, Yang
    Liu, Jun
    Wu, Jiawei
    Ansari, Nirwan
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (04): : 2064 - 2097
  • [8] Efficiently Mastering the Game of NoGo with Deep Reinforcement Learning Supported by Domain Knowledge
    Gao, Yifan
    Wu, Lezhou
    [J]. ELECTRONICS, 2021, 10 (13)
  • [9] Applications of Deep Reinforcement Learning in Communications and Networking: A Survey
    Luong, Nguyen Cong
    Hoang, Dinh Thai
    Gong, Shimin
    Niyato, Dusit
    Wang, Ping
    Liang, Ying-Chang
    Kim, Dong In
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2019, 21 (04): : 3133 - 3174
  • [10] Leveraging Domain-expert Knowledge, Boosting and Deep Learning for Identification of Rare and Complex States
    Miao, Rebecca
    Yang, Zhenyi
    Gavrishchaka, Valeriy
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND ARTIFICIAL INTELLIGENCE (CCEAI 2019), 2019, 1207