A New Learning Model for Swarm Intelligence Based on Q-Learning

被引:0
|
作者
Li, Fuming [1 ]
He, Xiaoxian [2 ]
Xu, Jingjing [1 ]
机构
[1] YanShan Univ, Coll Econ & Management, Qinhuangdao, Peoples R China
[2] Cent South Univ, Coll Informat Sci & Engn, Changsha, Peoples R China
关键词
Neighbors' Discounted Information learning (NDI learning); i-interval neighbor; discounted reward; Q-learning; swarm intelligence; ANT; RETRIEVAL; TRANSPORT; PREY;
D O I
10.1109/WCICA.2010.5554902
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Inspired by cooperative transport behaviors of ants, on the basis of Q-learning, a new learning method, Neighbors' Discounted Information (NDI) learning method, is present in the paper. This is a swarm-based learning method, in which principles of swarm intelligence are strictly complied with. In NDI learning, the i-interval neighbor's information, namely its discounted reward, is referenced when an individual selects the next state, so that it can make the best decision in a computable local neighborhood. In application, different policies of NDI learning are recommended by controlling the parameters according to time-relativity of concrete tasks. By applying this learning method, the cooperative transport of ants is simulated. Experiment results show that the transport process in simulation is very similar to the phenomenon in natural world, which proves the designed learning mechanism's rationality.
引用
收藏
页码:2769 / 2775
页数:7
相关论文
共 13 条