Multi-Agent Reinforcement Learning for Energy Harvesting Two-Hop Communications With a Partially Observable System State

被引:11
|
作者
Ortiz, Andrea [1 ]
Weber, Tobias [2 ]
Klein, Anja [1 ]
机构
[1] Tech Univ Darmstadt, Commun Engn Lab, D-64283 Darmstadt, Germany
[2] Univ Rostock, Inst Commun Engn, D-18119 Rostock, Germany
关键词
Two-hop communications; energy harvesting; decode and forward; multi-agent reinforcement learning; linear function approximation; DISTRIBUTED POWER-CONTROL; RELAY; TRANSMISSION; INFORMATION; NETWORKS;
D O I
10.1109/TGCN.2020.3026453
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
We consider an energy harvesting (EH) transmitter communicating with a receiver through an EH relay. The harvested energy is used for data transmission, including the circuit energy consumption. As in practical scenarios, the system's state, comprised by the harvested energy, battery levels, data buffer levels, and channel gains, is only partially observable by the EH nodes. Moreover, the EH nodes have only outdated knowledge regarding the channel gains for their own transmit channels. Our goal is to find distributed transmission policies aiming at maximizing the throughput. A channel predictor based on a Kalman filter is implemented in each EH node to estimate the current channel gain for its own channel. Furthermore, to overcome the partial observability of the system's state, the EH nodes cooperate with each other to obtain information about their parameters during a signaling phase. We model the problem as a Markov game and propose a multi-agent reinforcement learning algorithm to find the transmission policies. We show the trade-off between the achievable throughput and the signaling required, and provide convergence guarantees for the proposed algorithm. Results show that even when the signaling overhead is taken into account, the proposed algorithm outperforms other approaches that do not consider cooperation.
引用
收藏
页码:442 / 456
页数:15
相关论文
共 50 条
  • [1] Reinforcement Learning for Energy Harvesting Decode-and-Forward Two-Hop Communications
    Ortiz, Andrea
    Al-Shatri, Hussein
    Li, Xiang
    Weber, Tobias
    Klein, Anja
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2017, 1 (03): : 309 - 319
  • [2] Reinforcement learning for cooperative actions in a partially observable multi-agent system
    Taniguchi, Yuki
    Mori, Takeshi
    Ishii, Shin
    ARTIFICIAL NEURAL NETWORKS - ICANN 2007, PT 1, PROCEEDINGS, 2007, 4668 : 229 - +
  • [3] Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning
    Mao, Weichao
    Zhang, Kaiqing
    Miehling, Erik
    Basar, Tamer
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 6124 - 6131
  • [4] Balancing Performance and Cost for Two-Hop Cooperative Communications: Stackelberg Game and Distributed Multi-Agent Reinforcement Learning
    Geng, Yuanzhe
    Liu, Erwu
    Ni, Wei
    Wang, Rui
    Liu, Yan
    Xu, Hao
    Cai, Chen
    Jamalipour, Abbas
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (06) : 2193 - 2208
  • [5] A reinforcement learning scheme for a partially-observable multi-agent game
    Ishii, S
    Fujita, H
    Mitsutake, M
    Yamazaki, T
    Matsuda, J
    Matsuno, Y
    MACHINE LEARNING, 2005, 59 (1-2) : 31 - 54
  • [6] A Reinforcement Learning Scheme for a Partially-Observable Multi-Agent Game
    Shin Ishii
    Hajime Fujita
    Masaoki Mitsutake
    Tatsuya Yamazaki
    Jun Matsuda
    Yoichiro Matsuno
    Machine Learning, 2005, 59 : 31 - 54
  • [7] Throughput Maximization in Two-Hop Energy Harvesting Communications
    Ortiz, Andrea
    Al-Shatri, Hussein
    Li, Xiang
    Weber, Tobias
    Klein, Anja
    2015 12TH INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATION SYSTEMS (ISWCS), 2015,
  • [8] Partially Observable Multi-Agent Deep Reinforcement Learning for Cognitive Resource Management
    Yang, Ning
    Zhang, Haijun
    Berry, Randall
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [9] Multi-agent reinforcement learning algorithm to solve a partially-observable multi-agent problem in disaster response
    Lee, Hyun-Rok
    Lee, Taesik
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 291 (01) : 296 - 308
  • [10] Energy Harvesting Aware Multi-hop Routing Policy in Distributed IoT System Based on Multi-agent Reinforcement Learning
    Zhang, Wen
    Liu, Tao
    Xie, Mimi
    Li, Longzhuang
    Kar, Dulal
    Pan, Chen
    27TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2022, 2022, : 562 - 567