Balancing Performance and Cost for Two-Hop Cooperative Communications: Stackelberg Game and Distributed Multi-Agent Reinforcement Learning

被引：0

作者：

Geng, Yuanzhe ^{[1
]}

Liu, Erwu ^{[1
]}

Ni, Wei ^{[2
]}

Wang, Rui ^{[3
]}

Liu, Yan ^{[1
]}

Xu, Hao ^{[1
]}

Cai, Chen ^{[4
]}

Jamalipour, Abbas ^{[5
]}

机构：

[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China

[2] Commonwealth Sci & Ind Res Org, Data61, Marsfield, NSW 2122, Australia

[3] Tongji Univ, Coll Elect & Informat Engn, Shanghai Inst Intelligent Sci & Technol, Shanghai 201804, Peoples R China

[4] Tongji Univ, Inst Carbon Neutral, Coll Environm Sci & Engn, Shanghai 200092, Peoples R China

[5] Univ Sydney, Sch Elect & Informat Engn, Fac Engn, Sydney, NSW 2006, Australia

来源：

IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING | 2024年 / 10卷 / 06期

基金：

美国国家科学基金会;

关键词：

Relays; Games; Optimization; Cooperative communication; Costs; Channel capacity; Signal to noise ratio; power control; multi-agent reinforcement learning; Stackelberg game; DETERMINISTIC POLICY GRADIENT; RELAY SELECTION; ALLOCATION; POWER; OPTIMIZATION;

D O I：

10.1109/TCCN.2024.3400516

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

This paper aims to balance performance and cost in a two-hop wireless cooperative communication network where the source and relays have contradictory optimization goals and make decisions in a distributed manner. This differs from most existing works that have typically assumed that source and relay nodes follow a schedule created implicitly by a central controller. We propose that the relays form an alliance in an attempt to maximize the benefit of relaying while the source aims to increase the channel capacity cost-effectively. To this end, we establish the trade problem as a Stackelberg game, and prove the existence of its equilibrium. Another important aspect is that we use multi-agent reinforcement learning (MARL) to approach the equilibrium in a situation where the instantaneous channel state information (CSI) is unavailable, and the source and relays do not have knowledge of each other's goal. A multi-agent deep deterministic policy gradient-based framework is designed, where the relay alliance and the source act as agents. Experiments demonstrate that the proposed method can obtain an acceptable performance that is close to the game-theoretic equilibrium for all players under time-invariant environments, which considerably outperforms its potential alternatives and is only about 2.9% away from the optimal solution.

引用

页码：2193 / 2208

页数：16

共 50 条

[41] Reinforcement learning of coordination in cooperative multi-agent systems
Kapetanakis, S
Kudenko, D
EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 326 - 331
[42] A reinforcement learning scheme for a multi-agent card game
Fujita, H
Matsuno, Y
Ishii, S
2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4071 - 4078
[43] Evolutionary game theory and multi-agent reinforcement learning
Tuyls, K
Nowé, A
KNOWLEDGE ENGINEERING REVIEW, 2005, 20 (01): : 63 - 90
[44] Sharing of Energy Among Cooperative Households Using Distributed Multi-Agent Reinforcement Learning
Ebell, Niklas
Guetlein, Moritz
Pruckner, Marco
PROCEEDINGS OF 2019 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE), 2019,
[45] Multi-Agent Reinforcement Learning for a Random Access Game
Lee, Dongwoo
Zhao, Yu
Seo, Jun-Bae
Lee, Joohyun
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (08) : 9119 - 9124
[46] Multi-Agent Reinforcement Learning for Cooperative Task Offloading in Distributed Edge Cloud Computing
Ding, Shiyao
Lin, Donghui
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05) : 936 - 945
[47] Cooperative Internet of UAVs: Distributed Trajectory Design by Multi-Agent Deep Reinforcement Learning
Hu, Jingzhi
Zhang, Hongliang
Song, Lingyang
Schober, Robert
Poor, H. Vincent
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (11) : 6807 - 6821
[48] Distributed Cooperative Spectrum Sharing in UAV Networks Using Multi-Agent Reinforcement Learning
Shamsoshoara, Alireza
Khaledi, Mehrdad
Afghah, Fatemeh
Razi, Abolfazl
Ashdown, Jonathan
2019 16TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2019,
[49] Stackelberg Game Theoretical Learning for Multi-Agent Formation Tracking Control
Ye, Haichuan
Zhou, Haoyu
Ma, Bei
Wu, Yongbao
Xue, Lei
39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 2205 - 2210
[50] Negotiation agent based on Deep reinforcement learning for multi-agent cooperative distributed predictive control.
Aponte-Rengifo, O.
Francisco, M.
Vega, P.
IFAC PAPERSONLINE, 2023, 56 (02): : 1496 - 1501

← 1 2 3 4 5 →