A Model-Based Reinforcement Learning Algorithm for Routing in Energy Harvesting Mobile Ad-Hoc Networks

被引:0
|
作者
Meisam Maleki
Vesal Hakami
Mehdi Dehghan
机构
[1] Amirkabir University of Technology,Mobile Ad Hoc and Wireless Sensor Networks Lab, Department of Computer Engineering and Information Technology
[2] Iran University of Science and Technology,Department of Computer Engineering
来源
关键词
Mobile ad-hoc networks; Routing; Model-based reinforcement learning; MDP; Energy harvesting;
D O I
暂无
中图分类号
学科分类号
摘要
Dynamic topology, lack of a fixed infrastructure and limited energy in mobile ad-hoc networks (MANETs) give rise to a challenging operational environment. MANET routing protocols should consider dynamic network changes (e.g., link qualities and nodes residual energy) in such circumstances and be able to adapt to these changes to efficiently handle the traffic flows. In this paper, we assume an energy harvesting MANET in which the nodes have recharging capability and thus their residual energy level is randomly changing with time. We present a bi-objective intelligent routing protocol that aims at reducing an expected long-run cost function composed of end-to-end delay and the path energy cost. We formulate the routing problem as a Markov decision process which captures both the link state dynamics due to node mobility and energy state dynamics due to nodes rechargeable energy sources. We propose a multi-agent reinforcement learning-based algorithm to approximate the optimal routing policy in the absence of a priori knowledge of the system statistics. The proposed algorithm is built using the principles of model-based RL. More specifically, we model each node’s cost function by deriving an expression for the expected value of end-to-end costs. Also the transition probabilities are estimated online using a tabular maximum likelihood method. Simulation results show that our model-based scheme outperforms its model-free counterpart and operates closely to standard value-iteration which assumes perfect statistics.
引用
收藏
页码:3119 / 3139
页数:20
相关论文
共 50 条
  • [41] Routing protocols in Mobile Ad-hoc Networks
    Mikaric, Bratislav
    Rancic, Dejan
    Ilic, Slavisa
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2020, 96 (08): : 106 - 111
  • [42] Secure Routing in Multihop Ad-Hoc Networks With SRR-Based Reinforcement Learning
    Lu, Jianzhong
    He, Dongxuan
    Wang, Zhaocheng
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (02) : 362 - 366
  • [43] A RED based minimum energy routing algorithm for wireless ad-hoc networks
    Jin, XY
    Cai, WY
    Zhang, Y
    [J]. 2005 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING PROCEEDINGS, VOLS 1 AND 2, 2005, : 757 - 761
  • [44] Selective route discovery routing algorithm for mobile ad-hoc networks
    Kim, TE
    Kim, WT
    Park, YJ
    [J]. INFORMATION NETWORKING: CONVERGENCE IN BROADBAND AND MOBILE NETWORKING, 2005, 3391 : 152 - 159
  • [45] Research of QoS Routing Algorithm in Ad Hoc Networks based on Reinforcement Learning
    Fu, Yuchen
    Liu, Quan
    [J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2013, 19 (02) : 83 - 87
  • [46] A tabu search algorithm for routing optimization in mobile ad-hoc networks
    Kil-Woong Jang
    [J]. Telecommunication Systems, 2012, 51 : 177 - 191
  • [47] Condensation-based routing in mobile ad-hoc networks
    Palmieri, Francesco
    Castiglione, Aniello
    [J]. MOBILE INFORMATION SYSTEMS, 2012, 8 (03) : 199 - 211
  • [48] Associativity-Based Routing for Ad-Hoc Mobile Networks
    Toh C.-K.
    [J]. Wireless Personal Communications, 1997, 4 (2) : 103 - 139
  • [49] Adaptive ant colony routing algorithm for mobile ad-hoc networks
    Zeng Yuan-yuan
    Guan Ji-hong
    [J]. PROCEEDINGS OF 2005 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1 AND 2, 2005, : 1491 - 1494
  • [50] Distributed Trust Based Routing in Mobile Ad-Hoc Networks
    Jain, Shalabh
    Baras, John S.
    [J]. 2013 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2013), 2013, : 1801 - 1807