AntNet with Reward-Penalty Reinforcement Learning

被引：21

作者：

Lalbakhsh, Pooia ^{[1
]}

Zaeri, Bahram ^{[2
]}

Lalbakhsh, Ali ^{[3
]}

Fesharaki, Mehdi N. ^{[4
]}

机构：

[1] Islamic Azad Univ, Dept Comp Engn, Borujerd Branch, Borujerd, Lorestan, Iran

[2] Islamic Azad Univ Arak Branch, Young Res Club YRC, Arak, Iran

[3] Islamic Azad Univ Sci & Res Campus, Dept Telecommun Engn, Tehran, Iran

[4] Islamic Azad Univ Sci & Res Campus, Dept Comp Engn, Tehran, Iran

来源：

2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, COMMUNICATION SYSTEMS AND NETWORKS (CICSYN) | 2010年

关键词：

Ant colony optimization; AntNet; reward-penalty reinforcement learning; swarm intelligence;

D O I：

10.1109/CICSyN.2010.11

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper deals with a modification in the learning phase of AntNet routing algorithm, which improves the system adaptability in the presence of undesirable events. Unlike most of the ACO algorithms which consider reward-inaction reinforcement learning, the proposed strategy considers both reward and penalty onto the action probabilities. As simulation results show, considering penalty in AntNet routing algorithm increases the exploration towards other possible and sometimes much optimal selections, which leads to a more adaptive strategy. The proposed algorithm also uses a self-monitoring solution called Occurrence-Detection, to sense traffic fluctuations and make decision about the level of undesirability of the current status. The proposed algorithm makes use of the two mentioned strategies to prepare a self-healing version of AntNet routing algorithm to face undesirable and unpredictable traffic conditions.

引用

页码：17 / 21

页数：5

共 50 条

[21] A hierarchical approach to designing an electricity distribution reward-penalty scheme for service quality improvement
Ghasemi, Mostafa
Dashti, Reza
Amirioun, Mohammad Hassan
INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2021, 31 (12)
[22] Reward-Penalty Assignments and Genetic Algorithms for Ordinal Interval Number Group Decision Making
Tambouratzis, Tatiana
Canellidis, Vassileios
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2014, 29 (08) : 727 - 750
[23] Performance Evaluation of Network Productivity for LTE Heterogeneous Networks with Reward-Penalty Weights Assessment
Sawant, Uttara
Akl, Robert
2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
[24] Adaptive Call Admission Control Based on Reward-Penalty Model in Wireless/Mobile Network
Jian-Hui Huang
De-Pei Qian
Sheng-Ling Wang
Journal of Computer Science and Technology, 2007, 22 : 527 - 531
[25] Reliability incentive regulation based on reward-penalty mechanism using distribution feeders clustering
Khonakdar-Tarsi, Iman
Fotuhi-Firuzabad, Mahmud
Ehsan, Mehdi
Mohammadnezhad-Shourkaei, Hosein
Jooshaki, Mohammad
INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2021, 31 (08)
[26] Reward design for multi-agent reinforcement learning with a penalty based on the payment mechanism
Matsunami N.
Okuhara S.
Ito T.
Transactions of the Japanese Society for Artificial Intelligence, 2021, 36 (05)
[27] Adaptive call admission control based oil reward-penalty model in wireless/mobile network
Huang, Jian-Hui
Qian, De-Pei
Wang, Sheng-Ling
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2007, 22 (04) : 527 - 531
[28] Reward-Penalty Mechanism Based on Daily Energy Consumption for Net-Zero Energy Buildings
Zhang, Yang
Lu, Yuehong
Wang, Changlong
Huang, Zhijia
Lv, Tao
SUSTAINABILITY, 2021, 13 (22)
[29] Reward-penalty mechanism in a closed-loop supply chain with sequential manufacturers' price competition
Wang, Wenbin
Fan, Lingling
Ma, Peng
Zhang, Peng
Lu, Zhenye
JOURNAL OF CLEANER PRODUCTION, 2017, 168 : 118 - 130
[30] CARBON ABATEMENT AND RECYCLING DECISIONS IN CLOSED-LOOP SUPPLY CHAIN WITH THE REWARD-PENALTY MECHANISMS
Jiang, Lan
Zhen, Zhiyuan
JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2024, 20 (04) : 1483 - 1510

← 1 2 3 4 5 →