Hierarchical Multi-Agent Multi-Armed Bandit for Resource Allocation in Multi-LEO Satellite Constellation Networks

被引:1
|
作者
Shen, Li-Hsiang [1 ]
Ho, Yun [2 ]
Peng, Kai -Ten [2 ]
Yang, Lie-Liang [3 ]
Wu, Sau-Hsuan [2 ]
Wu, Jen-Ming [4 ]
机构
[1] Univ Calif Berkeley, Calif PATH, Berkeley, CA 94720 USA
[2] Natl Yang Ming Chiao Tung Univ, Dept Elect & Elect Engn, Hsinchu, Taiwan
[3] Univ Southampton, Next Generat Wireless, Southampton, England
[4] Hon Hai Res Inst, Next Generat Commun Res Ctr, Taipei, Taiwan
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/VTC2023-Spring57618.2023.10200264
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Low Earth orbit (LEO) satellite constellation is capable of providing global coverage area with high-rate services in the next sixth-generation (6G) non-terrestrial network (NTN). Due to limited onboard resources of operating power, beams, and channels, resilient and efficient resource management has become compellingly imperative under complex interference cases. However, different from conventional terrestrial base stations, LEO is deployed at considerable height and under high mobility, inducing substantially long delay and interference during transmission. As a result, acquiring the accurate channel state information between LEOs and ground users is challenging. Therefore, we construct a framework with a two-way transmission under unknown channel information and no data collected at long-delay ground gateway. In this paper, we propose hierarchical multi-agent multi-armed bandit resource allocation for LEO constellation (mmRAL) by appropriately assigning available radio resources. LEOs are considered as collaborative multiple macro-agents attempting unknown trials of various actions of micro-agents of respective resources, asymptotically achieving suitable allocation with only throughput information. In simulations, we evaluate mmRAL in various cases of LEO deployment, serving numbers of users and LEOs, hardware cost and outage probability. Benefited by efficient and resilient allocation, the proposed mmRAL system is capable of operating in homogeneous or heterogeneous orbital planes or constellations, achieving the highest throughput performance compared to the existing benchmarks in open literature.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Multi-agent Multi-armed Bandit Learning for Content Caching in Edge Networks
    Su, Lina
    Zhou, Ruiting
    Wang, Ne
    Chen, Junmei
    Li, Zongpeng
    2022 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2022), 2022, : 11 - 16
  • [2] Intelligent Load Balancing and Resource Allocation in O-RAN: A Multi-Agent Multi-Armed Bandit Approach
    Lai, Chia-Hsiang
    Shen, Li-Hsiang
    Feng, Kai-Ten
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [3] MULTI-ARMED BANDITS IN MULTI-AGENT NETWORKS
    Shahrampour, Shahin
    Rakhlin, Alexander
    Jadbabaie, Ali
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2786 - 2790
  • [4] Multi-Armed Bandit Framework for Resource Allocation in Uplink NOMA Networks
    Benamor, Amani
    Habachi, Oussama
    Kammoun, Ines
    Cances, Jean-Pierre
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [5] A Dynamic Observation Strategy for Multi-agent Multi-armed Bandit Problem
    Madhushani, Udari
    Leonard, Naomi Ehrich
    2020 EUROPEAN CONTROL CONFERENCE (ECC 2020), 2020, : 1677 - 1682
  • [6] MULTI-ARMED BANDIT ALLOCATION INDEXES
    JONES, PW
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1989, 40 (12) : 1158 - 1159
  • [7] An Efficient Algorithm for Fair Multi-Agent Multi-Armed Bandit with Low Regret
    Jones, Matthew
    Huy Nguyen
    Thy Nguyen
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8159 - 8167
  • [8] Cooperative Learning for Adversarial Multi-Armed Bandit on Open Multi-Agent Systems
    Nakamura, Tomoki
    Hayashi, Naoki
    Inuiguchi, Masahiro
    IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 1712 - 1717
  • [9] Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards
    Xu, Mengfan
    Klabjan, Diego
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] Decentralized Multi-Agent Multi-Armed Bandit Learning With Calibration for Multi-Cell Caching
    Xu, Xianzhe
    Tao, Meixia
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (04) : 2457 - 2472