Deep Reinforcement Learning Based Iterative Participant Selection Method for Industrial IoT Big Data Mobile Crowdsourcing

被引：1

作者：

Wang, Yan ^{[1
]}

Tian, Yun ^{[2
]}

Zhang, Xuyun ^{[3
]}

He, Xiaonan ^{[4
]}

Li, Shu ^{[5
]}

Zhu, Jia ^{[6
]}

机构：

[1] Tencent, Shenzhen, Peoples R China

[2] Shanghaitech Univ, Shanghai, Peoples R China

[3] Macquarie Univ, Sydney, NSW, Australia

[4] Baidu, Beijing, Peoples R China

[5] Nanjing Univ, Nanjing, Peoples R China

[6] Tongji Univ, Shanghai, Peoples R China

来源：

ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT I | 2022年 / 13087卷

关键词：

Reinforcement learning; Mobile crowdsourcing;

D O I：

10.1007/978-3-030-95405-5_19

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the massive deployment of mobile devices, crowdsourcing has become a new service paradigm in which a task requester can proactively recruit a batch of participants with a mobile IoT device from our system for quick and accurate results. In a mobile industrial crowdsourcing platform, a large amount of data is collected, extracted information, and distributed to requesters. In an entire task process, the system receives a task, allocates some suitable participants to complete it, and collects feedback from the requesters. We present a participant selection method, which adopts an end-to-end deep neural network to iteratively update the participant selection policy. The neural network consists of three main parts: (1) task and participant ability prediction part which adopts a bag of words method to extract the semantic information of a query, (2) feature transformation part which adopts a series of linear and nonlinear transformations and (3) evaluation part which uses requesters' feedback to update the network. In addition, the policy gradient method which is proved effective in the deep reinforcement learning field is adopted to update our participant selection method with the help of requesters' feedback. Finally, we conduct an extensive performance evaluation based on the combination of real traces and a real question and answer dataset and numerical results demonstrate that our method can achieve superior performance and improve more than 150% performance gain over a baseline method.

引用

页码：258 / 272

页数：15

共 50 条

[21] Resource Allocation Method of Edge IoT Agent Based on Deep Reinforcement Learning
Zhong, Jiayong
Hu, Ke
Lv, Xiaohong
Chen, Yongtao
Gao, Jin
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (05)
[22] A novel mobile robot navigation method based on deep reinforcement learning
Quan, Hao
Li, Yansheng
Zhang, Yi
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (03):
[23] Navigation method for mobile robot based on hierarchical deep reinforcement learning
Wang T.
Li A.
Song H.-L.
Liu W.
Wang M.-H.
Kongzhi yu Juece/Control and Decision, 2022, 37 (11): : 2799 - 2807
[24] Research on big data personalised recommendation model based on deep reinforcement learning
Shi H.
Shang L.
International Journal of Networking and Virtual Organisations, 2023, 28 (2-4) : 364 - 380
[25] A privacy-protected intelligent crowdsourcing application of IoT based on the reinforcement learning
Ren, Yingying
Liu, Wei
Liu, Anfeng
Wang, Tian
Li, Ang
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 127 : 56 - 69
[26] Multi-task Deep Reinforcement Learning for IoT Service Selection
Matsuoka, Hiroki
Moustafa, Ahmed
ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 548 - 554
[27] Cell Selection with Deep Reinforcement Learning in Sparse Mobile Crowdsensing
Wang, Leye
Liu, Wenbin
Zhang, Daqing
Wang, Yasha
Wang, En
Yang, Yongjian
2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1543 - 1546
[28] EdgeKE: An On-Demand Deep Learning IoT System for Cognitive Big Data on Industrial Edge Devices
Fang, Weiwei
Xue, Feng
Ding, Yi
Xiong, Naixue
Leung, Victor C. M.
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (09) : 6144 - 6152
[29] Deep Learning for IoT Big Data and Streaming Analytics: A Survey
Mohammadi, Mehdi
Al-Fuqaha, Ala
Sorour, Sameh
Guizani, Mohsen
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2018, 20 (04): : 2923 - 2960
[30] A Deep Reinforcement Learning-Based Caching Strategy for IoT Networks With Transient Data
Wu, Hongda
Nasehzadeh, Ali
Wang, Ping
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (12) : 13310 - 13319

← 1 2 3 4 5 →