Dynamic Resource Allocation in Systems-of-Systems Using a Heuristic-Based Interpretable Deep Reinforcement Learning

被引:1
|
作者
Chen, Qiliang [1 ]
Heydari, Babak [2 ]
机构
[1] Northeastern Univ, Dept Mech & Ind Engn, MultiAGent Intelligent Complex Syst MAGICS Lab, Boston, MA 02115 USA
[2] Northeastern Univ, Inst Experiential AI, Dept Mech & Ind Engn, MultiAGent Intelligent Complex Syst MAGICS Lab, Boston, MA 02115 USA
关键词
artificial intelligence; machine learning; systems design; systems engineering; reinforcement learning; interpretable AI; resource allocation; SOCIOTECHNICAL SYSTEMS; RADIO; DESIGN; GAME;
D O I
10.1115/1.4055057
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Systems-of-systems (SoS) often include multiple agents that interact in both cooperative and competitive modes. Moreover, they involve multiple resources, including energy, information, and bandwidth. If these resources are limited, agents need to decide how to share resources cooperatively to reach the system-level goal, while performing the tasks assigned to them autonomously. This paper takes a step toward addressing these challenges by proposing a dynamic two-tier learning framework, based on deep reinforcement learning that enables dynamic resource allocation while acknowledging the autonomy of systems constituents. The two-tier learning framework that decouples the learning process of the SoS constituents from that of the resource manager ensures that the autonomy and learning of the SoS constituents are not compromised as a result of interventions executed by the resource manager. We apply the proposed two-tier learning framework on a customized OpenAI Gym environment and compare the results of the proposed framework to baseline methods of resource allocation to show the superior performance of the two-tier learning scheme across a different set of SoS key parameters. We then use the results of this experiment and apply our heuristic inference method to interpret the decisions of the resource manager for a range of environment and agent parameters.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Resource allocation algorithm for MEC based on Deep Reinforcement Learning
    Wang, Yijie
    Chen, Xin
    Chen, Ying
    Du, Shougang
    2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
  • [32] Deep reinforcement learning for resource allocation of mobile communication systems with device-to-device underlay
    de Freitas Cardoso, Gabriel Pimenta
    Portela de Carvalho, Paulo Henrique
    de Lira Gondim, Paulo Roberto
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2023, 38 (01)
  • [33] Deep Reinforcement Learning-Assisted Optimization for Resource Allocation in Downlink OFDMA Cooperative Systems
    Tefera, Mulugeta Kassaw
    Zhang, Shengbing
    Jin, Zengwang
    ENTROPY, 2023, 25 (03)
  • [34] Resource Allocation in THz-NOMA-Enabled HAP Systems: A Deep Reinforcement Learning Approach
    Le, Mai
    Pham, Quoc-Viet
    Vinh Do, Quang
    Han, Zhu
    Hwang, Won-Joo
    IEEE Transactions on Consumer Electronics, 2024, 70 (04) : 6808 - 6816
  • [35] Dynamic Clustering and Resource Allocation Using Deep Reinforcement Learning for Smart-Duplex Networks
    Wang, Dan
    Huang, Chuan
    Zhang, Han
    Jiang, Shengpei
    Shi, Guowei
    Li, Tengfei
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (01) : 42 - 56
  • [36] Vehicular Fog Resource Allocation Approach for VANETs Based on Deep Adaptive Reinforcement Learning Combined with Heuristic Information
    Cheng, Yunli
    Vijayaraj, A.
    Sree Pokkuluri, Kiran
    Salehnia, Taybeh
    Montazerolghaem, Ahmadreza
    Rateb, Roqia
    IEEE Access, 2024, 12 : 139056 - 139075
  • [37] A Heuristic-based Resource Allocation Approach for Parallel Execution of Interacting Tasks
    Sen, Uddalok
    Sarkar, Madhulina
    Mukherjee, Nandini
    2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 764 - 771
  • [38] Adversarial Robustness of Deep Reinforcement Learning Based Dynamic Recommender Systems
    Wang, Siyu
    Cao, Yuanjiang
    Chen, Xiaocong
    Yao, Lina
    Wang, Xianzhi
    Sheng, Quan Z.
    FRONTIERS IN BIG DATA, 2022, 5
  • [39] Heuristic-Based Architecting for Autonomous Vehicle Systems
    Bansal, Manpreet
    Drogosch, Bradley
    Monarrez, Omar Lara
    Plantharan, Edwin
    Nikolik, Zdravko
    Weaver, Jonathan M.
    INCOSE International Symposium, 2022, 32 (01) : 946 - 960
  • [40] Deep Reinforcement Learning-Based Resource Management in Maritime Communication Systems
    Yao, Xi
    Hu, Yingdong
    Xu, Yicheng
    Gao, Ruifeng
    SENSORS, 2024, 24 (07)