Reinforcement Learning Based Stochastic Shortest Path Finding in Wireless Sensor Networks

被引:21
|
作者
Xia, Wenwen [1 ]
Di, Chong [1 ]
Guo, Haonan [1 ]
Li, Shenghong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Cyber Secur, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Stochastic shortest path finding; reinforcement learning; Q-learning; SARSA; convergence proof; AUTOMATA;
D O I
10.1109/ACCESS.2019.2950055
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many factors influence the connection states between nodes of wireless sensor networks, such as physical distance, and the network load, making the network's edge length dynamic in abundant scenarios. This dynamic property makes the network essentially form a graph with stochastic edge lengths. In this paper, we study the stochastic shortest path problem on a directional graph with stochastic edge lengths, using reinforcement learning algorithms. we regard each edge length as a random variable following unknown probability distribution and aim to find the stochastic shortest path on this stochastic graph. We evaluate the performance of path-finding algorithms using regret, which represents the cumulative reward difference between the practical path-finding algorithm and the optimal strategy that chooses the global stochastic shortest path every time. We model the path-finding procedure as a Markov decision process and propose two online path-finding algorithms: Q(SSP) algorithm and SARSA(SSP) algorithm, both combined with specifically-devised average reward mechanism. We justify the convergence property and correctness of the proposed algorithms theoretically. Experiments conducted on two benchmark graphs illustrate the superior performance of the proposed Q(SSP) algorithm which outperforms the SARSA(SSP) algorithm and other competitive algorithms about the regret metric.
引用
收藏
页码:157807 / 157817
页数:11
相关论文
共 50 条
  • [21] Prioritized Shortest Path Computation Mechanism (PSPCM) for wireless sensor networks
    Onwuegbuzie, Innocent Uzougbo
    Abd Razak, Shukor
    Isnin, Ismail Fauzi
    Al-dhaqm, Arafat
    Anuar, Nor Badrul
    [J]. PLOS ONE, 2022, 17 (03):
  • [22] Community Structure Based Shortest Path Finding for Social Networks
    Chai, Yale
    Song, Chunyao
    Nie, Peng
    Yuan, Xiaojie
    Ge, Yao
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2018, PT I, 2018, 11029 : 303 - 319
  • [23] Finding the Shortest Path in Stochastic Graphs Using Learning Automata and Adaptive Stochastic Petri Nets
    Vahidipour, S. Mehdi
    Meybodi, Mohammad Reza
    Esnaashari, Mehdi
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2017, 25 (03) : 427 - 455
  • [24] Planning wireless networks by shortest path
    Mannino, C.
    Mattia, S.
    Sassano, A.
    [J]. COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2011, 48 (03) : 533 - 551
  • [25] Planning wireless networks by shortest path
    C. Mannino
    S. Mattia
    A. Sassano
    [J]. Computational Optimization and Applications, 2011, 48 : 533 - 551
  • [26] An Enhanced Tree Routing Based on Reinforcement Learning in Wireless Sensor Networks
    Kim, Beom-Su
    Suh, Beomkyu
    Seo, In Jin
    Lee, Han Byul
    Gong, Ji Seon
    Kim, Ki-Il
    [J]. SENSORS, 2023, 23 (01)
  • [27] Reinforcement Learning based Routing Protocol for Wireless Body Sensor Networks
    Kiani, Farzad
    [J]. 2017 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CLOUD AND SERVICE COMPUTING (SC2 2017), 2017, : 71 - 78
  • [28] An Intelligent Routing Algorithm in Wireless Sensor Networks based on Reinforcement Learning
    Guo, Wenjing
    Yan, Cairong
    Gan, Yanglan
    Lu, Ting
    [J]. ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING III, 2014, 678 : 487 - 493
  • [29] Fault Tolerance in Wireless Sensor Networks: Finding Primary Path
    Parwekar, Pritee
    Rodda, Sireesha
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 1, 2016, 379 : 593 - 604
  • [30] Many-objective stochastic path finding using reinforcement learning
    Tozer, Bentz
    Mazzuchi, Thomas
    Sarkani, Shahram
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 : 371 - 382