Balancing Performance and Cost for Two-Hop Cooperative Communications: Stackelberg Game and Distributed Multi-Agent Reinforcement Learning

被引:0
|
作者
Geng, Yuanzhe [1 ]
Liu, Erwu [1 ]
Ni, Wei [2 ]
Wang, Rui [3 ]
Liu, Yan [1 ]
Xu, Hao [1 ]
Cai, Chen [4 ]
Jamalipour, Abbas [5 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
[2] Commonwealth Sci & Ind Res Org, Data61, Marsfield, NSW 2122, Australia
[3] Tongji Univ, Coll Elect & Informat Engn, Shanghai Inst Intelligent Sci & Technol, Shanghai 201804, Peoples R China
[4] Tongji Univ, Inst Carbon Neutral, Coll Environm Sci & Engn, Shanghai 200092, Peoples R China
[5] Univ Sydney, Sch Elect & Informat Engn, Fac Engn, Sydney, NSW 2006, Australia
基金
美国国家科学基金会;
关键词
Relays; Games; Optimization; Cooperative communication; Costs; Channel capacity; Signal to noise ratio; power control; multi-agent reinforcement learning; Stackelberg game; DETERMINISTIC POLICY GRADIENT; RELAY SELECTION; ALLOCATION; POWER; OPTIMIZATION;
D O I
10.1109/TCCN.2024.3400516
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This paper aims to balance performance and cost in a two-hop wireless cooperative communication network where the source and relays have contradictory optimization goals and make decisions in a distributed manner. This differs from most existing works that have typically assumed that source and relay nodes follow a schedule created implicitly by a central controller. We propose that the relays form an alliance in an attempt to maximize the benefit of relaying while the source aims to increase the channel capacity cost-effectively. To this end, we establish the trade problem as a Stackelberg game, and prove the existence of its equilibrium. Another important aspect is that we use multi-agent reinforcement learning (MARL) to approach the equilibrium in a situation where the instantaneous channel state information (CSI) is unavailable, and the source and relays do not have knowledge of each other's goal. A multi-agent deep deterministic policy gradient-based framework is designed, where the relay alliance and the source act as agents. Experiments demonstrate that the proposed method can obtain an acceptable performance that is close to the game-theoretic equilibrium for all players under time-invariant environments, which considerably outperforms its potential alternatives and is only about 2.9% away from the optimal solution.
引用
收藏
页码:2193 / 2208
页数:16
相关论文
共 50 条
  • [41] Reinforcement learning of coordination in cooperative multi-agent systems
    Kapetanakis, S
    Kudenko, D
    EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 326 - 331
  • [42] A reinforcement learning scheme for a multi-agent card game
    Fujita, H
    Matsuno, Y
    Ishii, S
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4071 - 4078
  • [43] Evolutionary game theory and multi-agent reinforcement learning
    Tuyls, K
    Nowé, A
    KNOWLEDGE ENGINEERING REVIEW, 2005, 20 (01): : 63 - 90
  • [44] Sharing of Energy Among Cooperative Households Using Distributed Multi-Agent Reinforcement Learning
    Ebell, Niklas
    Guetlein, Moritz
    Pruckner, Marco
    PROCEEDINGS OF 2019 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE), 2019,
  • [45] Multi-Agent Reinforcement Learning for a Random Access Game
    Lee, Dongwoo
    Zhao, Yu
    Seo, Jun-Bae
    Lee, Joohyun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (08) : 9119 - 9124
  • [46] Multi-Agent Reinforcement Learning for Cooperative Task Offloading in Distributed Edge Cloud Computing
    Ding, Shiyao
    Lin, Donghui
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05) : 936 - 945
  • [47] Cooperative Internet of UAVs: Distributed Trajectory Design by Multi-Agent Deep Reinforcement Learning
    Hu, Jingzhi
    Zhang, Hongliang
    Song, Lingyang
    Schober, Robert
    Poor, H. Vincent
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (11) : 6807 - 6821
  • [48] Distributed Cooperative Spectrum Sharing in UAV Networks Using Multi-Agent Reinforcement Learning
    Shamsoshoara, Alireza
    Khaledi, Mehrdad
    Afghah, Fatemeh
    Razi, Abolfazl
    Ashdown, Jonathan
    2019 16TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2019,
  • [49] Stackelberg Game Theoretical Learning for Multi-Agent Formation Tracking Control
    Ye, Haichuan
    Zhou, Haoyu
    Ma, Bei
    Wu, Yongbao
    Xue, Lei
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 2205 - 2210
  • [50] Negotiation agent based on Deep reinforcement learning for multi-agent cooperative distributed predictive control.
    Aponte-Rengifo, O.
    Francisco, M.
    Vega, P.
    IFAC PAPERSONLINE, 2023, 56 (02): : 1496 - 1501