AntNet with Reward-Penalty Reinforcement Learning

被引：21

作者：

Lalbakhsh, Pooia ^{[1
]}

Zaeri, Bahram ^{[2
]}

Lalbakhsh, Ali ^{[3
]}

Fesharaki, Mehdi N. ^{[4
]}

机构：

[1] Islamic Azad Univ, Dept Comp Engn, Borujerd Branch, Borujerd, Lorestan, Iran

[2] Islamic Azad Univ Arak Branch, Young Res Club YRC, Arak, Iran

[3] Islamic Azad Univ Sci & Res Campus, Dept Telecommun Engn, Tehran, Iran

[4] Islamic Azad Univ Sci & Res Campus, Dept Comp Engn, Tehran, Iran

来源：

2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, COMMUNICATION SYSTEMS AND NETWORKS (CICSYN) | 2010年

关键词：

Ant colony optimization; AntNet; reward-penalty reinforcement learning; swarm intelligence;

D O I：

10.1109/CICSyN.2010.11

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper deals with a modification in the learning phase of AntNet routing algorithm, which improves the system adaptability in the presence of undesirable events. Unlike most of the ACO algorithms which consider reward-inaction reinforcement learning, the proposed strategy considers both reward and penalty onto the action probabilities. As simulation results show, considering penalty in AntNet routing algorithm increases the exploration towards other possible and sometimes much optimal selections, which leads to a more adaptive strategy. The proposed algorithm also uses a self-monitoring solution called Occurrence-Detection, to sense traffic fluctuations and make decision about the level of undesirability of the current status. The proposed algorithm makes use of the two mentioned strategies to prepare a self-healing version of AntNet routing algorithm to face undesirable and unpredictable traffic conditions.

引用

页码：17 / 21

页数：5

共 50 条

[31] Analyses of the reward-penalty mechanism in green closed-loop supply chains with product remanufacturing
Chen, Cheng-Kang
Akmalul'Ulya, M.
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2019, 210 : 211 - 223
[32] Government Reward-Penalty Mechanism in Dual-Channel Closed-Loop Supply Chain
Chen, Haitao
Dong, Zhaohui
Li, Gendao
SUSTAINABILITY, 2020, 12 (20) : 1 - 15
[33] Pricing and channel selection decisions for new and remanufactured products under carbon emission reward-penalty
Liang J.
Fan L.
Wang N.
Zhou P.
Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2023, 43 (04): : 1116 - 1131
[34] Reliability assessment of utilities using an enhanced reward-penalty model in performance based regulation system
Fotuhi, M.
Shourkaei, H. M.
Kharazi, M. B.
Salimi, A.
2006 INTERNATIONAL CONFERENCE ON POWER SYSTEMS TECHNOLOGY: POWERCON, VOLS 1- 6, 2006, : 2890 - +
[35] A Multi-Period Regulation Methodology for Reliability as Service Quality Considering Reward-Penalty Scheme
Alizadeh, Ali
Fereidunian, Alireza
Kamwa, Innocent
Mohseni-Bonab, Seyed Masoud
Lesani, Hamid
IEEE TRANSACTIONS ON POWER DELIVERY, 2023, 38 (02) : 1440 - 1451
[36] Reward-Penalty Weighted Ensemble for Emotion State Classification from Multi-Modal Data Streams
Nandi, Arijit
Xhafa, Fatos
Subirats, Laia
Fort, Santi
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2022, 32 (12)
[37] CONSTRAINED GENETIC OPTIMIZATION VIA DYNAMIC REWARD-PENALTY BALANCING AND ITS USE IN PATTERN-RECOGNITION
SIEDLECKI, W
SKLANSKY, J
PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON GENETIC ALGORITHMS, 1989, : 141 - 150
[38] Pricing strategies for end-of-life vehicle regarding reward-penalty mechanism and customers' environmental awareness
Sun, Hongxia
Li, Hui
RAIRO-OPERATIONS RESEARCH, 2024, 58 (01) : 397 - 421
[39] Government Reward-Penalty Mechanism in Closed-Loop Supply Chain Based on Dynamics Game Theory
Zhang, Xiaoqing
Su, Yingsheng
Yuan, Xigang
DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2018, 2018
[40] Supply chain pricing and delivery channel selection considering lead time under reward-penalty mechanism
Li D.
Wei A.
Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2023, 43 (03): : 841 - 856

← 1 2 3 4 5 →