System-of-systems approach to spatio-temporal crowdsourcing design using improved PPO algorithm based on an invalid action masking

被引:3
|
作者
Ding, Wei [1 ]
Ming, Zhenjun [1 ]
Wang, Guoxin [1 ]
Yan, Yan [1 ]
机构
[1] Beijing Inst Technol, Sch Mech Engn, 5 Zhongguancun South St, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
system-of-systems design; spation-temporal crowdsourcing; improved PPO algorithm; invalid action masking; task allocation; TASK ASSIGNMENT;
D O I
10.1016/j.knosys.2024.111381
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatio-temporal crowdsourcing (STC) is a typical case of complex system-of-systems (SoSs) design, wherein the primary objective is to allocate real-time tasks to suitable groups of workers. Over time, the STC allocation has gradually evolved into a dynamic matching involving three distinct entities: tasks, workers, and workplaces. Aiming at addressing the problems of poor convergence, slow response and sparse actions caused by the spatial complexity and time dynamics of the STC, this paper proposes an improved proximal policy optimization algorithm based on an invalid action masking (IAM-IPPO) for the SoSs design of the STC. Initially, the ternary dynamic matching (TDM) of tasks, workers and workplaces in the STC is described. Furthermore, the STC allocation is formulated as a Markov decision process, with the corresponding definition of state space, action space, and reward mechanism. On this basis, an invalid action masking (IAM) method is mainly introduced to update the policy-based network of proximal policy optimization (PPO), realizing sampling only from valid actions to masking invalid action selection. Subsequently, the algorithmic framework of IAM-IPPO is elaborated upon, and the model is trained to generate an effective allocation scheme. Comparative experiments are conducted on authentic datasets, aiming to assess performance indicators of the presented approach. The findings demonstrate a substantial enhancement in performance for the IAM-IPPO algorithm compared to other baselines, which is helpful in exploring excellent design schemes of the crowdsourcing SoSs, especially in dynamic large-scale cases.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] System-of-systems approach to spatio-temporal crowdsourcing design using improved PPO algorithm based on an invalid action masking
    Ding, Wei
    Ming, Zhenjun
    Wang, Guoxin
    Yan, Yan
    Knowledge-Based Systems, 2024, 285
  • [2] A task allocation algorithm based on reinforcement learning in spatio-temporal crowdsourcing
    Bingxu Zhao
    Hongbin Dong
    Yingjie Wang
    Tingwei Pan
    Applied Intelligence, 2023, 53 : 13452 - 13469
  • [3] A task allocation algorithm based on reinforcement learning in spatio-temporal crowdsourcing
    Zhao, Bingxu
    Dong, Hongbin
    Wang, Yingjie
    Pan, Tingwei
    APPLIED INTELLIGENCE, 2023, 53 (11) : 13452 - 13469
  • [4] STFCM: Spatio-Temporal Clustering Algorithm Based on Improved FCM
    Wang, Ling
    Gui, Lingpeng
    Liu, Wei
    Zhang, Naiwen
    2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 94 - 98
  • [5] A Spatial Crowdsourcing Task Assignment Approach Based on Spatio-Temporal Location Prediction
    Xu T.
    Qiao S.
    Wu J.
    Han N.
    Yue K.
    Yi Y.
    Huang F.
    Yuan C.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (02): : 310 - 328
  • [6] Study of human action recognition based on improved spatio-temporal features
    Ji X.-F.
    Wu Q.-Q.
    Ju Z.-J.
    Wang Y.-Y.
    International Journal of Automation and Computing, 2014, 11 (05) : 500 - 509
  • [7] Study of Human Action Recognition Based on Improved Spatio-temporal Features
    Xiao-Fei Ji
    Qian-Qian Wu
    Zhao-Jie Ju
    Yang-Yang Wang
    International Journal of Automation and Computing, 2014, (05) : 500 - 509
  • [8] Graph-based approach for human action recognition using spatio-temporal features
    Ben Aoun, Najib
    Mejdoub, Mahmoud
    Ben Amar, Chokri
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (02) : 329 - 338
  • [9] Action recognition using spatio-temporal regularity based features
    Goodhart, Taylor
    Yan, Pingkun
    Shah, Mubarak
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 745 - 748
  • [10] Human Action Recognition Algorithm Based on Spatio-Temporal Interactive Attention Model
    Pan Na
    Jiang Min
    Kong Jun
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (18)