Using Deep Reinforcement Learning to Improve Sensor Selection in the Internet of Things

被引:1
|
作者
Rashtian, Hootan [1 ]
Gopalakrishnan, Sathish [1 ,2 ]
机构
[1] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
[2] Univ British Columbia, Peter Wall Inst Adv Studies, Vancouver, BC V6T 1Z4, Canada
来源
IEEE ACCESS | 2020年 / 8卷
基金
加拿大自然科学与工程研究理事会;
关键词
Machine learning; Internet of Things; Correlation; Production facilities; Temperature sensors; Complexity theory; Temperature distribution; asynchronous advantage actor-critic networks; soft scheduling; deep reinforcement learning; soft real-time systems; ALGORITHMS;
D O I
10.1109/ACCESS.2020.2994600
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the problem of handling timeliness and criticality trade-off when gathering data from multiple resources in complex environments. In IoT environments, where several sensors transmitting data packets - with various criticality and timeliness, the rate of data collection could be limited due to associated costs (e.g., bandwidth limitations and energy considerations). Besides, environment complexity regarding data generation could impose additional challenges to balance criticality and timeliness when gathering data. For instance, when data packets (either regarding criticality or timeliness) of two or more sensors are correlated, or there exists temporal dependency among sensors, incorporating such patterns can expose challenges to trivial policies for data gathering. Motivated by the success of the Asynchronous Advantage Actor-Critic (A3C) approach, we first mapped vanilla A3C into our problem to compare its performance in terms of <italic>criticality-weighted deadline miss ratio</italic> to the considered baselines in multiple scenarios. We observed degradation of the A3C performance in complex scenarios. Therefore, we modified the A3C network by embedding long short term memory (LSTM) to improve performance in cases that vanilla A3C could not capture repeating patterns in data streams. Simulation results show that the modified A3C reduces the criticality-weighted deadline miss ratio from 0.3 to 0.19.
引用
下载
收藏
页码:95208 / 95222
页数:15
相关论文
共 50 条
  • [1] Reinforcement and deep reinforcement learning for wireless Internet of Things: A survey
    Frikha, Mohamed Said
    Gammar, Sonia Mettali
    Lahmadi, Abdelkader
    Andrey, Laurent
    COMPUTER COMMUNICATIONS, 2021, 178 : 98 - 113
  • [2] Deep Reinforcement Learning for Joint Channel Selection and Power Allocation in Cognitive Internet of Things
    Zheng, Weijun
    Wu, Guoqing
    Qie, Wenbo
    Zhang, Yong
    HUMAN CENTERED COMPUTING, 2019, 11956 : 683 - 692
  • [3] Deep Reinforcement Learning for Internet of Things: A Comprehensive Survey
    Chen, Wuhui
    Qiu, Xiaoyu
    Cai, Ting
    Dai, Hong-Ning
    Zheng, Zibin
    Zhang, Yan
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (03): : 1659 - 1692
  • [4] On deep reinforcement learning security for Industrial Internet of Things
    Liu, Xing
    Yu, Wei
    Liang, Fan
    Griffith, David
    Golmie, Nada
    COMPUTER COMMUNICATIONS, 2021, 168 : 20 - 32
  • [5] Energy Conservation for Internet of Things Tracking Applications Using Deep Reinforcement Learning
    Sultan, Salman Md
    Waleed, Muhammad
    Pyun, Jae-Young
    Um, Tai-Won
    SENSORS, 2021, 21 (09)
  • [6] Prism blockchain enabled Internet of Things with deep reinforcement learning
    Gadiraju, Divija Swetha
    Aggarwal, Vaneet
    BLOCKCHAIN-RESEARCH AND APPLICATIONS, 2024, 5 (03):
  • [7] Point of Interest recommendation for social network using the Internet of Things and deep reinforcement learning
    Wang, Shuguang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (09) : 17428 - 17445
  • [8] Edge QoE: Computation Offloading With Deep Reinforcement Learning for Internet of Things
    Lu, Haodong
    He, Xiaoming
    Du, Miao
    Ruan, Xiukai
    Sun, Yanfei
    Wang, Kun
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10) : 9255 - 9265
  • [9] Dynamic multiple access based on deep reinforcement learning for Internet of Things
    Liu, Xin
    Li, Zengqi
    COMPUTER COMMUNICATIONS, 2023, 210 : 331 - 341
  • [10] A Deep Reinforcement Learning-Based Caching Strategy for Internet of Things
    Nasehzadeh, Ali
    Wang, Ping
    2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2020, : 969 - 974