Deep Reinforcement Learning for User Association and Resource Allocation in Heterogeneous Cellular Networks

被引:293
|
作者
Zhao, Nan [1 ,2 ]
Liang, Ying-Chang [2 ]
Niyato, Dusit [3 ]
Pei, Yiyang [4 ]
Wu, Minghu [5 ]
Jiang, Yunhao [5 ]
机构
[1] Hubei Univ Technol, Hubei Collaborat Innovat Ctr High Efficiency Util, Wuhan 430068, Hubei, Peoples R China
[2] Univ Elect Sci & Technol China, CINC, Chengdu 611731, Sichuan, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[4] Singapore Inst Technol, Infocomm Technol Cluster, Singapore, Singapore
[5] Hubei Univ Technol, Hubei Key Lab High Efficiency Utilizat Solar Ener, Wuhan 430068, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Heterogeneous cellular networks; user association; resource allocation; multi-agent deep reinforcement learning; ACCESS; MANAGEMENT; SELECTION; HETNETS;
D O I
10.1109/TWC.2019.2933417
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Heterogeneous cellular networks can offload the mobile traffic and reduce the deployment costs, which have been considered to be a promising technique in the next-generation wireless network. Due to the non-convex and combinatorial characteristics, it is challenging to obtain an optimal strategy for the joint user association and resource allocation issue. In this paper, a reinforcement learning (RL) approach is proposed to achieve the maximum long-term overall network utility while guaranteeing the quality of service requirements of user equipments (UEs) in the downlink of heterogeneous cellular networks. A distributed optimization method based on multi-agent RL is developed. Moreover, to solve the computationally expensive problem with the large action space, multi-agent deep RL method is proposed. Specifically, the state, action and reward function are defined for UEs, and dueling double deep Q-network (D3QN) strategy is introduced to obtain the nearly optimal policy. Through message passing, the distributed UEs can obtain the global state space with a small communication overhead. With the double-Q strategy and dueling architecture, D3QN can rapidly converge to a subgame perfect Nash equilibrium. Simulation results demonstrate that D3QN achieves the better performance than other RL approaches in solving large-scale learning problems.
引用
收藏
页码:5141 / 5152
页数:12
相关论文
共 50 条
  • [31] Distributed User Association with Resource Partitioning in Heterogeneous Cellular Networks
    Tian-Qing Zhou
    Yong-Ming Huang
    Yuan Sun
    Lu-Xi Yang
    [J]. Wireless Personal Communications, 2017, 95 : 4131 - 4148
  • [32] Distributed User Association with Resource Partitioning in Heterogeneous Cellular Networks
    Zhou, Tian-Qing
    Huang, Yong-Ming
    Sun, Yuan
    Yang, Lu-Xi
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2017, 95 (04) : 4131 - 4148
  • [33] Resource Allocation for Heterogeneous Service in Green Mobile Edge Networks Using Deep Reinforcement Learning
    Sun, Si-yuan
    Zheng, Ying
    Zhou, Jun-hua
    Weng, Jiu-xing
    Wei, Yi-fei
    Wang, Xiao-jun
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (07): : 2496 - 2512
  • [34] Deep Reinforcement Learning Based Caching Placement and User Association for Dynamic Cellular Networks
    Wang, Yue
    Feng, Chunyan
    Zhang, Tiankui
    [J]. 2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,
  • [35] BiLSTM Based Reinforcement Learning for Resource Allocation and User Association in LTE-U Networks
    Luo, Zhikun
    Yu, Guanding
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2020, 114 (03) : 2629 - 2641
  • [36] BiLSTM Based Reinforcement Learning for Resource Allocation and User Association in LTE-U Networks
    Zhikun Luo
    Guanding Yu
    [J]. Wireless Personal Communications, 2020, 114 : 2629 - 2641
  • [37] Combined Learning for Resource Allocation in Autonomous Heterogeneous Cellular Networks
    Chen, Xianfu
    Zhang, Honggang
    Chen, Tao
    Palicot, Jacques
    [J]. 2013 IEEE 24TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2013, : 1061 - 1065
  • [38] Multi-agent deep reinforcement learning for user association and resource allocation in integrated terrestrial and non-terrestrial networks
    Birabwa, Denise Joanitah
    Ramotsoela, Daniel
    Ventura, Neco
    [J]. COMPUTER NETWORKS, 2023, 231
  • [39] Deep Reinforcement Learning Based Resource Allocation with Heterogeneous QoS for Cellular V2X
    Tian, Jin
    Shi, Yan
    Tong, Xiaolu
    Chen, Shanzhi
    Zhao, Rui
    [J]. 2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [40] Joint user scheduling, user association, and resource partition in heterogeneous cellular networks
    Zhou, Hao
    Ji, Yusheng
    Wang, Xiaoyan
    Zhao, Baohua
    [J]. 2014 IEEE 11TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2014, : 46 - 54