AntNet with Reward-Penalty Reinforcement Learning

被引:21
|
作者
Lalbakhsh, Pooia [1 ]
Zaeri, Bahram [2 ]
Lalbakhsh, Ali [3 ]
Fesharaki, Mehdi N. [4 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Borujerd Branch, Borujerd, Lorestan, Iran
[2] Islamic Azad Univ Arak Branch, Young Res Club YRC, Arak, Iran
[3] Islamic Azad Univ Sci & Res Campus, Dept Telecommun Engn, Tehran, Iran
[4] Islamic Azad Univ Sci & Res Campus, Dept Comp Engn, Tehran, Iran
来源
2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, COMMUNICATION SYSTEMS AND NETWORKS (CICSYN) | 2010年
关键词
Ant colony optimization; AntNet; reward-penalty reinforcement learning; swarm intelligence;
D O I
10.1109/CICSyN.2010.11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper deals with a modification in the learning phase of AntNet routing algorithm, which improves the system adaptability in the presence of undesirable events. Unlike most of the ACO algorithms which consider reward-inaction reinforcement learning, the proposed strategy considers both reward and penalty onto the action probabilities. As simulation results show, considering penalty in AntNet routing algorithm increases the exploration towards other possible and sometimes much optimal selections, which leads to a more adaptive strategy. The proposed algorithm also uses a self-monitoring solution called Occurrence-Detection, to sense traffic fluctuations and make decision about the level of undesirability of the current status. The proposed algorithm makes use of the two mentioned strategies to prepare a self-healing version of AntNet routing algorithm to face undesirable and unpredictable traffic conditions.
引用
收藏
页码:17 / 21
页数:5
相关论文
共 50 条
  • [31] Analyses of the reward-penalty mechanism in green closed-loop supply chains with product remanufacturing
    Chen, Cheng-Kang
    Akmalul'Ulya, M.
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2019, 210 : 211 - 223
  • [32] Government Reward-Penalty Mechanism in Dual-Channel Closed-Loop Supply Chain
    Chen, Haitao
    Dong, Zhaohui
    Li, Gendao
    SUSTAINABILITY, 2020, 12 (20) : 1 - 15
  • [33] Pricing and channel selection decisions for new and remanufactured products under carbon emission reward-penalty
    Liang J.
    Fan L.
    Wang N.
    Zhou P.
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2023, 43 (04): : 1116 - 1131
  • [34] Reliability assessment of utilities using an enhanced reward-penalty model in performance based regulation system
    Fotuhi, M.
    Shourkaei, H. M.
    Kharazi, M. B.
    Salimi, A.
    2006 INTERNATIONAL CONFERENCE ON POWER SYSTEMS TECHNOLOGY: POWERCON, VOLS 1- 6, 2006, : 2890 - +
  • [35] A Multi-Period Regulation Methodology for Reliability as Service Quality Considering Reward-Penalty Scheme
    Alizadeh, Ali
    Fereidunian, Alireza
    Kamwa, Innocent
    Mohseni-Bonab, Seyed Masoud
    Lesani, Hamid
    IEEE TRANSACTIONS ON POWER DELIVERY, 2023, 38 (02) : 1440 - 1451
  • [36] Reward-Penalty Weighted Ensemble for Emotion State Classification from Multi-Modal Data Streams
    Nandi, Arijit
    Xhafa, Fatos
    Subirats, Laia
    Fort, Santi
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2022, 32 (12)
  • [37] CONSTRAINED GENETIC OPTIMIZATION VIA DYNAMIC REWARD-PENALTY BALANCING AND ITS USE IN PATTERN-RECOGNITION
    SIEDLECKI, W
    SKLANSKY, J
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON GENETIC ALGORITHMS, 1989, : 141 - 150
  • [38] Pricing strategies for end-of-life vehicle regarding reward-penalty mechanism and customers' environmental awareness
    Sun, Hongxia
    Li, Hui
    RAIRO-OPERATIONS RESEARCH, 2024, 58 (01) : 397 - 421
  • [39] Government Reward-Penalty Mechanism in Closed-Loop Supply Chain Based on Dynamics Game Theory
    Zhang, Xiaoqing
    Su, Yingsheng
    Yuan, Xigang
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2018, 2018
  • [40] Supply chain pricing and delivery channel selection considering lead time under reward-penalty mechanism
    Li D.
    Wei A.
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2023, 43 (03): : 841 - 856