Using Deep Reinforcement Learning to Improve Sensor Selection in the Internet of Things

被引:1
|
作者
Rashtian, Hootan [1 ]
Gopalakrishnan, Sathish [1 ,2 ]
机构
[1] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
[2] Univ British Columbia, Peter Wall Inst Adv Studies, Vancouver, BC V6T 1Z4, Canada
来源
IEEE ACCESS | 2020年 / 8卷
基金
加拿大自然科学与工程研究理事会;
关键词
Machine learning; Internet of Things; Correlation; Production facilities; Temperature sensors; Complexity theory; Temperature distribution; asynchronous advantage actor-critic networks; soft scheduling; deep reinforcement learning; soft real-time systems; ALGORITHMS;
D O I
10.1109/ACCESS.2020.2994600
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the problem of handling timeliness and criticality trade-off when gathering data from multiple resources in complex environments. In IoT environments, where several sensors transmitting data packets - with various criticality and timeliness, the rate of data collection could be limited due to associated costs (e.g., bandwidth limitations and energy considerations). Besides, environment complexity regarding data generation could impose additional challenges to balance criticality and timeliness when gathering data. For instance, when data packets (either regarding criticality or timeliness) of two or more sensors are correlated, or there exists temporal dependency among sensors, incorporating such patterns can expose challenges to trivial policies for data gathering. Motivated by the success of the Asynchronous Advantage Actor-Critic (A3C) approach, we first mapped vanilla A3C into our problem to compare its performance in terms of <italic>criticality-weighted deadline miss ratio</italic> to the considered baselines in multiple scenarios. We observed degradation of the A3C performance in complex scenarios. Therefore, we modified the A3C network by embedding long short term memory (LSTM) to improve performance in cases that vanilla A3C could not capture repeating patterns in data streams. Simulation results show that the modified A3C reduces the criticality-weighted deadline miss ratio from 0.3 to 0.19.
引用
收藏
页码:95208 / 95222
页数:15
相关论文
共 50 条
  • [21] Applying Deep Reinforcement Learning for Detection of Internet-of-Things Cyber Attacks
    Rookard, Curtis
    Khojandi, Anahita
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 389 - 395
  • [22] DeepWiERL: Bringing Deep Reinforcement Learning to the Internet of Self-Adaptive Things
    Restuccia, Francesco
    Melodia, Tommaso
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2020, : 844 - 853
  • [23] Deep reinforcement learning based mobile edge computing for intelligent Internet of Things
    Zhao, Rui
    Wang, Xinjie
    Xia, Junjuan
    Fan, Liseng
    PHYSICAL COMMUNICATION, 2020, 43
  • [24] A Multiagent Deep Reinforcement Learning Autonomous Security Management Approach for Internet of Things
    Ren, Bin
    Tang, Yunlong
    Wang, Huan
    Wang, Yichuan
    Liu, Jianxiong
    Gao, Ge
    Wei, Wei
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (15): : 25600 - 25612
  • [25] Autonomous Rate Control for Mobile Internet of Things: A Deep Reinforcement Learning Approach
    Xu, Wenchao
    Zhou, Haibo
    Cheng, Nan
    Lu, Ning
    Xu, Lijuan
    Qin, Meng
    Guo, Song
    2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [26] Federated Deep Reinforcement Learning for Internet of Things With Decentralized Cooperative Edge Caching
    Wang, Xiaofei
    Wang, Chenyang
    Li, Xiuhua
    Leung, Victor C. M.
    Taleb, Tarik
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10) : 9441 - 9455
  • [27] Deep Reinforcement Learning Based Intelligent Job Batching in Industrial Internet of Things
    Jiang, Chengling
    Luo, Zihui
    Liu, Liang
    Zheng, Xiaolong
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT II, 2021, 12938 : 481 - 493
  • [28] Federated deep reinforcement learning based secure data sharing for Internet of Things
    Miao, Qinyang
    Lin, Hui
    Wang, Xiaoding
    Hassan, Mohammad Mehedi
    COMPUTER NETWORKS, 2021, 197
  • [29] Deep Reinforcement Learning Enables Joint Trajectory and Communication in Internet of Robotic Things
    Luo, Ruyu
    Tian, Hui
    Ni, Wanli
    Cheng, Julian
    Chen, Kwang-Cheng
    IEEE Transactions on Wireless Communications, 2024, 23 (12) : 18154 - 18168
  • [30] Security defense strategy algorithm for Internet of Things based on deep reinforcement learning
    Feng, Xuecai
    Han, Jikai
    Zhang, Rui
    Xu, Shuo
    Xia, Hui
    HIGH-CONFIDENCE COMPUTING, 2024, 4 (01):