Reinforcement Learning Based Stochastic Shortest Path Finding in Wireless Sensor Networks

被引:21
|
作者
Xia, Wenwen [1 ]
Di, Chong [1 ]
Guo, Haonan [1 ]
Li, Shenghong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Cyber Secur, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Stochastic shortest path finding; reinforcement learning; Q-learning; SARSA; convergence proof; AUTOMATA;
D O I
10.1109/ACCESS.2019.2950055
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many factors influence the connection states between nodes of wireless sensor networks, such as physical distance, and the network load, making the network's edge length dynamic in abundant scenarios. This dynamic property makes the network essentially form a graph with stochastic edge lengths. In this paper, we study the stochastic shortest path problem on a directional graph with stochastic edge lengths, using reinforcement learning algorithms. we regard each edge length as a random variable following unknown probability distribution and aim to find the stochastic shortest path on this stochastic graph. We evaluate the performance of path-finding algorithms using regret, which represents the cumulative reward difference between the practical path-finding algorithm and the optimal strategy that chooses the global stochastic shortest path every time. We model the path-finding procedure as a Markov decision process and propose two online path-finding algorithms: Q(SSP) algorithm and SARSA(SSP) algorithm, both combined with specifically-devised average reward mechanism. We justify the convergence property and correctness of the proposed algorithms theoretically. Experiments conducted on two benchmark graphs illustrate the superior performance of the proposed Q(SSP) algorithm which outperforms the SARSA(SSP) algorithm and other competitive algorithms about the regret metric.
引用
收藏
页码:157807 / 157817
页数:11
相关论文
共 50 条
  • [1] Finding the shortest path in stochastic networks
    Peer, S. K.
    Sharma, Dinesh K.
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2007, 53 (05) : 729 - 740
  • [2] Clustering Algorithm in wireless sensor networks based on shortest path
    El Khediri, Salim
    Thaljaoui, Adel
    Dallali, Adel
    Kachouri, Abdennaceur
    [J]. 2018 30TH INTERNATIONAL CONFERENCE ON MICROELECTRONICS (ICM), 2018, : 335 - 338
  • [3] A novel connectivity algorithm based on shortest path for wireless sensor networks
    El Khediri, Salim
    Thaljaoui, Adel
    Dallali, Adel
    Harakti, Souli
    Kachouri, Abdennaceur
    [J]. 2018 1ST INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS' 2018), 2018,
  • [4] A reinforcement learning approach involving a shortest path finding algorithm
    Kwon, WY
    Lee, S
    Suh, IH
    [J]. IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 436 - 441
  • [5] An iterative stochastic algorithm based on distributed learning automata for finding the stochastic shortest path in stochastic graphs
    Beigy, Hamid
    Meybodi, Mohammad Reza
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (07): : 5540 - 5562
  • [6] An iterative stochastic algorithm based on distributed learning automata for finding the stochastic shortest path in stochastic graphs
    Hamid Beigy
    Mohammad Reza Meybodi
    [J]. The Journal of Supercomputing, 2020, 76 : 5540 - 5562
  • [7] Stochastic Shortest Path Finding in Path-Centric Uncertain Road Networks
    Andonov, Georgi
    Yang, Bin
    [J]. 2018 19TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2018), 2018, : 40 - 45
  • [8] Ensemble path finding in wireless sensor networks
    Prasad, N.
    Baba, A. M. K. Kanna
    Rao, S. Krishna
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER VISION AND MACHINE LEARNING, 2019, 1228
  • [9] A novel connectivity and coverage algorithm based on shortest path for wireless sensor networks
    Sun, Geng
    Liu, Yanheng
    Li, Han
    Wang, Aimin
    Liang, Shuang
    Zhang, Ying
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2018, 71 : 1025 - 1039
  • [10] Shortest path planning of a data mule in wireless sensor networks
    Yanzhi Hu
    Fengbin Zhang
    Tian Tian
    Dawei Ma
    Zhiyong Shi
    [J]. Wireless Networks, 2022, 28 : 1129 - 1145