Using Deep Reinforcement Learning to Improve Sensor Selection in the Internet of Things

被引:1
|
作者
Rashtian, Hootan [1 ]
Gopalakrishnan, Sathish [1 ,2 ]
机构
[1] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
[2] Univ British Columbia, Peter Wall Inst Adv Studies, Vancouver, BC V6T 1Z4, Canada
来源
IEEE ACCESS | 2020年 / 8卷
基金
加拿大自然科学与工程研究理事会;
关键词
Machine learning; Internet of Things; Correlation; Production facilities; Temperature sensors; Complexity theory; Temperature distribution; asynchronous advantage actor-critic networks; soft scheduling; deep reinforcement learning; soft real-time systems; ALGORITHMS;
D O I
10.1109/ACCESS.2020.2994600
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the problem of handling timeliness and criticality trade-off when gathering data from multiple resources in complex environments. In IoT environments, where several sensors transmitting data packets - with various criticality and timeliness, the rate of data collection could be limited due to associated costs (e.g., bandwidth limitations and energy considerations). Besides, environment complexity regarding data generation could impose additional challenges to balance criticality and timeliness when gathering data. For instance, when data packets (either regarding criticality or timeliness) of two or more sensors are correlated, or there exists temporal dependency among sensors, incorporating such patterns can expose challenges to trivial policies for data gathering. Motivated by the success of the Asynchronous Advantage Actor-Critic (A3C) approach, we first mapped vanilla A3C into our problem to compare its performance in terms of <italic>criticality-weighted deadline miss ratio</italic> to the considered baselines in multiple scenarios. We observed degradation of the A3C performance in complex scenarios. Therefore, we modified the A3C network by embedding long short term memory (LSTM) to improve performance in cases that vanilla A3C could not capture repeating patterns in data streams. Simulation results show that the modified A3C reduces the criticality-weighted deadline miss ratio from 0.3 to 0.19.
引用
下载
收藏
页码:95208 / 95222
页数:15
相关论文
共 50 条
  • [11] Caching Transient Data for Internet of Things: A Deep Reinforcement Learning Approach
    Zhu, Hao
    Cao, Yang
    Wei, Xiao
    Wang, Wei
    Jiang, Tao
    Jin, Shi
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02): : 2074 - 2083
  • [12] Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges
    Lei, Lei
    Tan, Yue
    Zheng, Kan
    Liu, Shiwen
    Zhang, Kuan
    Shen, Xuemin
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2020, 22 (03): : 1722 - 1760
  • [13] Collaborative multi-agents in dynamic industrial internet of things using deep reinforcement learning
    Ali Raza
    Munam Ali Shah
    Hasan Ali Khattak
    Carsten Maple
    Fadi Al-Turjman
    Hafiz Tayyab Rauf
    Environment, Development and Sustainability, 2022, 24 : 9481 - 9499
  • [14] Energy-saving Service Offloading for the Internet of Medical Things Using Deep Reinforcement Learning
    Jiang, Jielin
    Guo, Jiajie
    Khan, Maqbool
    Cui, Yan
    Lin, Wenmin
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2023, 19 (03)
  • [15] Managing Earth Hazards Using the Deep Reinforcement Learning Algorithm for the Industrial Internet of Things Network
    Liu, Weiwei
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2022, 88 (11): : 707 - 714
  • [16] Collaborative multi-agents in dynamic industrial internet of things using deep reinforcement learning
    Raza, Ali
    Shah, Munam Ali
    Khattak, Hasan Ali
    Maple, Carsten
    Al-Turjman, Fadi
    Rauf, Hafiz Tayyab
    ENVIRONMENT DEVELOPMENT AND SUSTAINABILITY, 2022, 24 (07) : 9481 - 9499
  • [17] USING DEEP LEARNING TECHNOLOGY FOR HEALTHCARE APPLICATIONS IN INTERNET OF THINGS SENSOR MONITORING SYSTEM
    Wu, Guanqaun
    Zeng, Desheng
    Chen, Rongli
    Zhao, Dong Min
    Ge, Dan
    Chen, Xiaozhong
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2023, 23 (04)
  • [18] Multi-objective intelligent clustering routing schema for internet of things enabled wireless sensor networks using deep reinforcement learning
    Ghamry, Walid K.
    Shukry, Suzan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (04): : 4941 - 4961
  • [19] Improving the Performance of ALOHA with Internet of Things Using Reinforcement Learning
    Acik, Sami
    Kosunalp, Selahattin
    Tabakcioglu, Mehmet Baris
    Iliev, Teodor
    ELECTRONICS, 2023, 12 (17)
  • [20] Deep Learning for the Internet of Things
    Yao, Shuochao
    Zhao, Yiran
    Zhang, Aston
    Hu, Shaohan
    Shao, Huajie
    Zhang, Chao
    Su, Lu
    Abdelzaher, Tarek
    COMPUTER, 2018, 51 (05) : 32 - 41