AntNet with Reward-Penalty Reinforcement Learning

被引:21
|
作者
Lalbakhsh, Pooia [1 ]
Zaeri, Bahram [2 ]
Lalbakhsh, Ali [3 ]
Fesharaki, Mehdi N. [4 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Borujerd Branch, Borujerd, Lorestan, Iran
[2] Islamic Azad Univ Arak Branch, Young Res Club YRC, Arak, Iran
[3] Islamic Azad Univ Sci & Res Campus, Dept Telecommun Engn, Tehran, Iran
[4] Islamic Azad Univ Sci & Res Campus, Dept Comp Engn, Tehran, Iran
来源
2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, COMMUNICATION SYSTEMS AND NETWORKS (CICSYN) | 2010年
关键词
Ant colony optimization; AntNet; reward-penalty reinforcement learning; swarm intelligence;
D O I
10.1109/CICSyN.2010.11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper deals with a modification in the learning phase of AntNet routing algorithm, which improves the system adaptability in the presence of undesirable events. Unlike most of the ACO algorithms which consider reward-inaction reinforcement learning, the proposed strategy considers both reward and penalty onto the action probabilities. As simulation results show, considering penalty in AntNet routing algorithm increases the exploration towards other possible and sometimes much optimal selections, which leads to a more adaptive strategy. The proposed algorithm also uses a self-monitoring solution called Occurrence-Detection, to sense traffic fluctuations and make decision about the level of undesirability of the current status. The proposed algorithm makes use of the two mentioned strategies to prepare a self-healing version of AntNet routing algorithm to face undesirable and unpredictable traffic conditions.
引用
收藏
页码:17 / 21
页数:5
相关论文
共 50 条
  • [21] A hierarchical approach to designing an electricity distribution reward-penalty scheme for service quality improvement
    Ghasemi, Mostafa
    Dashti, Reza
    Amirioun, Mohammad Hassan
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2021, 31 (12)
  • [22] Reward-Penalty Assignments and Genetic Algorithms for Ordinal Interval Number Group Decision Making
    Tambouratzis, Tatiana
    Canellidis, Vassileios
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2014, 29 (08) : 727 - 750
  • [23] Performance Evaluation of Network Productivity for LTE Heterogeneous Networks with Reward-Penalty Weights Assessment
    Sawant, Uttara
    Akl, Robert
    2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
  • [24] Adaptive Call Admission Control Based on Reward-Penalty Model in Wireless/Mobile Network
    Jian-Hui Huang
    De-Pei Qian
    Sheng-Ling Wang
    Journal of Computer Science and Technology, 2007, 22 : 527 - 531
  • [25] Reliability incentive regulation based on reward-penalty mechanism using distribution feeders clustering
    Khonakdar-Tarsi, Iman
    Fotuhi-Firuzabad, Mahmud
    Ehsan, Mehdi
    Mohammadnezhad-Shourkaei, Hosein
    Jooshaki, Mohammad
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2021, 31 (08)
  • [26] Reward design for multi-agent reinforcement learning with a penalty based on the payment mechanism
    Matsunami N.
    Okuhara S.
    Ito T.
    Transactions of the Japanese Society for Artificial Intelligence, 2021, 36 (05)
  • [27] Adaptive call admission control based oil reward-penalty model in wireless/mobile network
    Huang, Jian-Hui
    Qian, De-Pei
    Wang, Sheng-Ling
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2007, 22 (04) : 527 - 531
  • [28] Reward-Penalty Mechanism Based on Daily Energy Consumption for Net-Zero Energy Buildings
    Zhang, Yang
    Lu, Yuehong
    Wang, Changlong
    Huang, Zhijia
    Lv, Tao
    SUSTAINABILITY, 2021, 13 (22)
  • [29] Reward-penalty mechanism in a closed-loop supply chain with sequential manufacturers' price competition
    Wang, Wenbin
    Fan, Lingling
    Ma, Peng
    Zhang, Peng
    Lu, Zhenye
    JOURNAL OF CLEANER PRODUCTION, 2017, 168 : 118 - 130
  • [30] CARBON ABATEMENT AND RECYCLING DECISIONS IN CLOSED-LOOP SUPPLY CHAIN WITH THE REWARD-PENALTY MECHANISMS
    Jiang, Lan
    Zhen, Zhiyuan
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2024, 20 (04) : 1483 - 1510