Deep Reinforcement Learning Based Iterative Participant Selection Method for Industrial IoT Big Data Mobile Crowdsourcing

被引:1
|
作者
Wang, Yan [1 ]
Tian, Yun [2 ]
Zhang, Xuyun [3 ]
He, Xiaonan [4 ]
Li, Shu [5 ]
Zhu, Jia [6 ]
机构
[1] Tencent, Shenzhen, Peoples R China
[2] Shanghaitech Univ, Shanghai, Peoples R China
[3] Macquarie Univ, Sydney, NSW, Australia
[4] Baidu, Beijing, Peoples R China
[5] Nanjing Univ, Nanjing, Peoples R China
[6] Tongji Univ, Shanghai, Peoples R China
关键词
Reinforcement learning; Mobile crowdsourcing;
D O I
10.1007/978-3-030-95405-5_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the massive deployment of mobile devices, crowdsourcing has become a new service paradigm in which a task requester can proactively recruit a batch of participants with a mobile IoT device from our system for quick and accurate results. In a mobile industrial crowdsourcing platform, a large amount of data is collected, extracted information, and distributed to requesters. In an entire task process, the system receives a task, allocates some suitable participants to complete it, and collects feedback from the requesters. We present a participant selection method, which adopts an end-to-end deep neural network to iteratively update the participant selection policy. The neural network consists of three main parts: (1) task and participant ability prediction part which adopts a bag of words method to extract the semantic information of a query, (2) feature transformation part which adopts a series of linear and nonlinear transformations and (3) evaluation part which uses requesters' feedback to update the network. In addition, the policy gradient method which is proved effective in the deep reinforcement learning field is adopted to update our participant selection method with the help of requesters' feedback. Finally, we conduct an extensive performance evaluation based on the combination of real traces and a real question and answer dataset and numerical results demonstrate that our method can achieve superior performance and improve more than 150% performance gain over a baseline method.
引用
收藏
页码:258 / 272
页数:15
相关论文
共 50 条
  • [21] Resource Allocation Method of Edge IoT Agent Based on Deep Reinforcement Learning
    Zhong, Jiayong
    Hu, Ke
    Lv, Xiaohong
    Chen, Yongtao
    Gao, Jin
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (05)
  • [22] A novel mobile robot navigation method based on deep reinforcement learning
    Quan, Hao
    Li, Yansheng
    Zhang, Yi
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (03):
  • [23] Navigation method for mobile robot based on hierarchical deep reinforcement learning
    Wang T.
    Li A.
    Song H.-L.
    Liu W.
    Wang M.-H.
    Kongzhi yu Juece/Control and Decision, 2022, 37 (11): : 2799 - 2807
  • [24] Research on big data personalised recommendation model based on deep reinforcement learning
    Shi H.
    Shang L.
    International Journal of Networking and Virtual Organisations, 2023, 28 (2-4) : 364 - 380
  • [25] A privacy-protected intelligent crowdsourcing application of IoT based on the reinforcement learning
    Ren, Yingying
    Liu, Wei
    Liu, Anfeng
    Wang, Tian
    Li, Ang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 127 : 56 - 69
  • [26] Multi-task Deep Reinforcement Learning for IoT Service Selection
    Matsuoka, Hiroki
    Moustafa, Ahmed
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 548 - 554
  • [27] Cell Selection with Deep Reinforcement Learning in Sparse Mobile Crowdsensing
    Wang, Leye
    Liu, Wenbin
    Zhang, Daqing
    Wang, Yasha
    Wang, En
    Yang, Yongjian
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1543 - 1546
  • [28] EdgeKE: An On-Demand Deep Learning IoT System for Cognitive Big Data on Industrial Edge Devices
    Fang, Weiwei
    Xue, Feng
    Ding, Yi
    Xiong, Naixue
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (09) : 6144 - 6152
  • [29] Deep Learning for IoT Big Data and Streaming Analytics: A Survey
    Mohammadi, Mehdi
    Al-Fuqaha, Ala
    Sorour, Sameh
    Guizani, Mohsen
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2018, 20 (04): : 2923 - 2960
  • [30] A Deep Reinforcement Learning-Based Caching Strategy for IoT Networks With Transient Data
    Wu, Hongda
    Nasehzadeh, Ali
    Wang, Ping
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (12) : 13310 - 13319