A Model-Based Reinforcement Learning Algorithm for Routing in Energy Harvesting Mobile Ad-Hoc Networks

被引:13
|
作者
Maleki, Meisam [1 ]
Hakami, Vesal [2 ]
Dehghan, Mehdi [1 ]
机构
[1] Amirkabir Univ Technol, Mobile Ad Hoc & Wireless Sensor Networks Lab, Dept Comp Engn & Informat Technol, 424 Hafez Ave,POB 15875-4413, Tehran, Iran
[2] Iran Univ Sci & Technol, Dept Comp Engn, Hengam St,Resalat Sq, Tehran 1684613114, Iran
关键词
Mobile ad-hoc networks; Routing; Model-based reinforcement learning; MDP; Energy harvesting;
D O I
10.1007/s11277-017-3987-8
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Dynamic topology, lack of a fixed infrastructure and limited energy in mobile ad-hoc networks (MANETs) give rise to a challenging operational environment. MANET routing protocols should consider dynamic network changes (e.g., link qualities and nodes residual energy) in such circumstances and be able to adapt to these changes to efficiently handle the traffic flows. In this paper, we assume an energy harvesting MANET in which the nodes have recharging capability and thus their residual energy level is randomly changing with time. We present a bi-objective intelligent routing protocol that aims at reducing an expected long-run cost function composed of end-to-end delay and the path energy cost. We formulate the routing problem as a Markov decision process which captures both the link state dynamics due to node mobility and energy state dynamics due to nodes rechargeable energy sources. We propose a multi-agent reinforcement learning-based algorithm to approximate the optimal routing policy in the absence of a priori knowledge of the system statistics. The proposed algorithm is built using the principles of model-based RL. More specifically, we model each node's cost function by deriving an expression for the expected value of end-to-end costs. Also the transition probabilities are estimated online using a tabular maximum likelihood method. Simulation results show that our model-based scheme outperforms its model-free counterpart and operates closely to standard value-iteration which assumes perfect statistics.
引用
收藏
页码:3119 / 3139
页数:21
相关论文
共 50 条
  • [21] Reinforcement Learning Based Mobility Adaptive Routing for Vehicular Ad-Hoc Networks
    Jinqiao Wu
    Min Fang
    Xiao Li
    [J]. Wireless Personal Communications, 2018, 101 : 2143 - 2171
  • [22] A Dynamic Ant Colony Based Routing Algorithm for Mobile Ad-hoc Networks
    Khosrowshahi-Asl, Ehsan
    Noorhosseini, Majid
    Pirouz, Atieh Saberi
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2011, 27 (05) : 1581 - 1596
  • [23] Routing with a density-based probabilistic algorithm for mobile ad-hoc networks
    Ong, Hean-Loong
    Natsheh, Essam
    Wan, Tat-Chee
    [J]. JOURNAL OF HIGH SPEED NETWORKS, 2011, 18 (02) : 83 - 114
  • [24] Ant routing algorithm for mobile ad-hoc networks based on adaptive improvement
    Zeng, YY
    He, YX
    [J]. 2005 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING PROCEEDINGS, VOLS 1 AND 2, 2005, : 678 - 681
  • [25] A multicast routing algorithm based on mobile multicast agents in Ad-hoc networks
    Wang, X
    Li, F
    Ishihara, S
    Mizuno, T
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2001, E84B (08) : 2087 - 2094
  • [26] A position based ant colony routing algorithm for mobile ad-hoc networks
    Department of Computer Science and Software Engineering, Concordia University, Montreal, QC, Canada
    [J]. Journal of Networks, 2008, 3 (04) : 31 - 41
  • [27] An energy aware routing protocol for mobile ad-hoc networks
    Chattopadhyay, Alokes
    Thomas, Markose
    Gupta, Arobinda
    [J]. ADCOM 2007: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2007, : 262 - +
  • [28] Energy-Aware Routing In Mobile Ad-Hoc Networks
    Murali, P.
    Rakesh, K.
    Hota, Chittaranjan
    Antti, Yla-Jaaski
    [J]. 2008 1ST IFIP WIRELESS DAYS (WD), 2008, : 300 - +
  • [29] Model-based prototyping of an interoperability protocol for mobile ad-hoc networks
    Kristensen, LM
    Westergaard, M
    Norgaard, PC
    [J]. INTEGRATED FORMAL METHODS, PROCEEDINGS, 2005, 3771 : 266 - 286
  • [30] Energy and Velocity Based Tree Multicast Routing in Mobile Ad-Hoc Networks
    Abu Sufian
    Anuradha Banerjee
    Paramartha Dutta
    [J]. Wireless Personal Communications, 2019, 107 : 2191 - 2209