Task Allocation on Layered Multiagent Systems: When Evolutionary Many-Objective Optimization Meets Deep Q-Learning

被引:29
|
作者
Li, Mincan [1 ,2 ]
Wang, Zidong [3 ]
Li, Kenli [1 ,2 ]
Liao, Xiangke [4 ]
Hone, Kate [3 ]
Liu, Xiaohui [3 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[2] Hunan Univ, Natl Supercomp Ctr Changsha, Changsha 410082, Peoples R China
[3] Brunel Univ London, Dept Comp Sci, Uxbridge UB8 3PH, Middx, England
[4] Natl Univ Def Technol, Collaborat Innovat Ctr High Performance Comp, Changsha 410073, Peoples R China
基金
中国国家自然科学基金; 欧盟地平线“2020”;
关键词
Deep Q-learning (DQL); evolutionary computation; many-objective optimization; multiagent systems (MAS); task allocation; NEGOTIATION; BACKPROPAGATION; ALGORITHM;
D O I
10.1109/TEVC.2021.3049131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article is concerned with the multitask multiagent allocation problem via many-objective optimization for multiagent systems (MASs). First, a novel layered MAS model is constructed to address the multitask multiagent allocation problem that includes both the original task simplification and the many-objective allocation. In the first layer of the model, the deep Q-learning method is introduced to simplify the prioritization of the original task set. In the second layer of the model, the modified shift-based density estimation (MSDE) method is put forward to improve the conventional strength Pareto evolutionary algorithm 2 (SPEA2) in order to achieve many-objective optimization on task assignments. Then, an MSDE-SPEA2-based method is proposed to tackle the many-objective optimization problem with objectives including task allocation, makespan, agent satisfaction, resource utilization, task completion, and task waiting time. As compared with the existing allocation methods, the developed method in this article exhibits an outstanding feature that the task assignment and the task scheduling are carried out simultaneously. Finally, extensive experiments are conducted to: 1) verify the validity of the proposed model and the effectiveness of two main algorithms and 2) illustrate the optimal solution for task allocation and efficient strategy for task scheduling under different scenarios.
引用
收藏
页码:842 / 855
页数:14
相关论文
共 46 条
  • [21] Learning-driven many-objective evolutionary algorithms for satellite-ground time synchronization task planning problem
    Zhang, Zhongshan
    Chen, Yuning
    He, Lei
    Xing, Lining
    Tan, Yuejin
    SWARM AND EVOLUTIONARY COMPUTATION, 2019, 47 : 72 - 79
  • [22] A trust-aware task allocation method using deep q-learning for uncertain mobile crowdsourcing
    Sun, Yong
    Tan, Wenan
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2019, 9 (01)
  • [23] Optimization of NB-IoT Uplink Resource Allocation via Double Deep Q-Learning
    Zhong, Han
    Zhang, Runzhou
    Jin, Fan
    Ning, Lei
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, VOL. 1, 2022, 878 : 775 - 781
  • [24] Exploring near-optimal locations for bioretention systems in catchment scale using many-objective evolutionary optimization
    Hamedani, Abtin Shahrokh
    Do Lago, Cesar
    Giacomoni, Marcio H. H.
    URBAN WATER JOURNAL, 2023, : 813 - 830
  • [25] A Hybrid Many-Objective Optimization Algorithm for Task Offloading and Resource Allocation in Multi-Server Mobile Edge Computing Networks
    Zhang, Jiangjiang
    Gong, Bei
    Waqas, Muhammad
    Tu, Shanshan
    Han, Zhu
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (05) : 3101 - 3114
  • [26] Priority-Based Joint Resource Allocation With Deep Q-Learning for Heterogeneous NOMA Systems
    Rezwan, Sifat
    Choi, Wooyeol
    IEEE ACCESS, 2021, 9 : 41468 - 41481
  • [27] Analysis of thermoelastic stresses in filament wound composite pressure vessels using evolutionary deep learning and many-objective optimisation
    Vondracek, Dominik
    Padovec, Zdenek
    Mares, Tomas
    Chakraborti, Nirupam
    PHILOSOPHICAL MAGAZINE LETTERS, 2025, 105 (01)
  • [28] Deep Q-Learning for Channel Optimization in MRCP BMI Systems: A Teleoperated Robot Implementation
    Pongthanisorn, Goragod
    Capi, Genci
    IEEE ACCESS, 2024, 12 : 73769 - 73778
  • [29] Energy Efficient Routing for Wireless Mesh Networks with Directional Antennas: When Q-learning meets Ant systems
    Lahsen-Cherif, Iyad
    Zitoune, Lynda
    Veque, Veronique
    AD HOC NETWORKS, 2021, 121 (121)
  • [30] A Deep Learning Hybrid Framework Combining an Efficient Evolutionary Algorithm for Complex Many-Objective Optimization of Sustainable Triple CO2 Feed Methanol Production
    Cao, Hongtao
    Li, Yue
    Chang, Chenglin
    Zhang, Xiangping
    Yang, Ao
    Shen, Weifeng
    ACS SUSTAINABLE CHEMISTRY & ENGINEERING, 2024, 12 (17): : 6682 - 6696