Dynamic Resource Allocation in Systems-of-Systems Using a Heuristic-Based Interpretable Deep Reinforcement Learning

被引：1

作者：

Chen, Qiliang ^{[1
]}

Heydari, Babak ^{[2
]}

机构：

[1] Northeastern Univ, Dept Mech & Ind Engn, MultiAGent Intelligent Complex Syst MAGICS Lab, Boston, MA 02115 USA

[2] Northeastern Univ, Inst Experiential AI, Dept Mech & Ind Engn, MultiAGent Intelligent Complex Syst MAGICS Lab, Boston, MA 02115 USA

来源：

JOURNAL OF MECHANICAL DESIGN | 2022年 / 144卷 / 09期

关键词：

artificial intelligence; machine learning; systems design; systems engineering; reinforcement learning; interpretable AI; resource allocation; SOCIOTECHNICAL SYSTEMS; RADIO; DESIGN; GAME;

D O I：

10.1115/1.4055057

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

Systems-of-systems (SoS) often include multiple agents that interact in both cooperative and competitive modes. Moreover, they involve multiple resources, including energy, information, and bandwidth. If these resources are limited, agents need to decide how to share resources cooperatively to reach the system-level goal, while performing the tasks assigned to them autonomously. This paper takes a step toward addressing these challenges by proposing a dynamic two-tier learning framework, based on deep reinforcement learning that enables dynamic resource allocation while acknowledging the autonomy of systems constituents. The two-tier learning framework that decouples the learning process of the SoS constituents from that of the resource manager ensures that the autonomy and learning of the SoS constituents are not compromised as a result of interventions executed by the resource manager. We apply the proposed two-tier learning framework on a customized OpenAI Gym environment and compare the results of the proposed framework to baseline methods of resource allocation to show the superior performance of the two-tier learning scheme across a different set of SoS key parameters. We then use the results of this experiment and apply our heuristic inference method to interpret the decisions of the resource manager for a range of environment and agent parameters.

引用

页数：14

共 50 条

[31] Resource allocation algorithm for MEC based on Deep Reinforcement Learning
Wang, Yijie
Chen, Xin
Chen, Ying
Du, Shougang
2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
[32] Deep reinforcement learning for resource allocation of mobile communication systems with device-to-device underlay
de Freitas Cardoso, Gabriel Pimenta
Portela de Carvalho, Paulo Henrique
de Lira Gondim, Paulo Roberto
INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2023, 38 (01)
[33] Deep Reinforcement Learning-Assisted Optimization for Resource Allocation in Downlink OFDMA Cooperative Systems
Tefera, Mulugeta Kassaw
Zhang, Shengbing
Jin, Zengwang
ENTROPY, 2023, 25 (03)
[34] Resource Allocation in THz-NOMA-Enabled HAP Systems: A Deep Reinforcement Learning Approach
Le, Mai
Pham, Quoc-Viet
Vinh Do, Quang
Han, Zhu
Hwang, Won-Joo
IEEE Transactions on Consumer Electronics, 2024, 70 (04) : 6808 - 6816
[35] Dynamic Clustering and Resource Allocation Using Deep Reinforcement Learning for Smart-Duplex Networks
Wang, Dan
Huang, Chuan
Zhang, Han
Jiang, Shengpei
Shi, Guowei
Li, Tengfei
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (01) : 42 - 56
[36] Vehicular Fog Resource Allocation Approach for VANETs Based on Deep Adaptive Reinforcement Learning Combined with Heuristic Information
Cheng, Yunli
Vijayaraj, A.
Sree Pokkuluri, Kiran
Salehnia, Taybeh
Montazerolghaem, Ahmadreza
Rateb, Roqia
IEEE Access, 2024, 12 : 139056 - 139075
[37] A Heuristic-based Resource Allocation Approach for Parallel Execution of Interacting Tasks
Sen, Uddalok
Sarkar, Madhulina
Mukherjee, Nandini
2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 764 - 771
[38] Adversarial Robustness of Deep Reinforcement Learning Based Dynamic Recommender Systems
Wang, Siyu
Cao, Yuanjiang
Chen, Xiaocong
Yao, Lina
Wang, Xianzhi
Sheng, Quan Z.
FRONTIERS IN BIG DATA, 2022, 5
[39] Heuristic-Based Architecting for Autonomous Vehicle Systems
Bansal, Manpreet
Drogosch, Bradley
Monarrez, Omar Lara
Plantharan, Edwin
Nikolik, Zdravko
Weaver, Jonathan M.
INCOSE International Symposium, 2022, 32 (01) : 946 - 960
[40] Deep Reinforcement Learning-Based Resource Management in Maritime Communication Systems
Yao, Xi
Hu, Yingdong
Xu, Yicheng
Gao, Ruifeng
SENSORS, 2024, 24 (07)

← 1 2 3 4 5 →