Simultaneous perturbation stochastic approximation based neural networks for online learning

被引:0
|
作者
Choy, MC [1 ]
Srinivasan, D [1 ]
Cheu, RL [1 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117576, Singapore
关键词
simultaneous perturbation stochastic approximation; online learning; multi-agents; traffic signal control;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new application of simultaneous perturbation stochastic approximation (SPSA) for online learning and weight updates in multiple neural networks (SPSA-NN). A multi-agent system is implemented for dynamic control of traffic signals in a complex traffic network with numerous intersections. Neural networks are used to approximate the optimal traffic signal control strategies for each agent and the parameters of these neural networks are updated online using an enhanced version of SPSA. Many simulation runs have been carried out to evaluate the performance of the SPSA-NN against an existing traffic signal control technique. Results show that the SPSA-NN based multi-agent system manages to outperform the existing technique. The mean delay of all vehicles has been reduced by 44% compared to the existing technique.
引用
下载
收藏
页码:1038 / 1044
页数:7
相关论文
共 50 条
  • [11] Robust neural network tracking controller using simultaneous perturbation stochastic approximation
    Song, Qing
    Spall, James C.
    Soh, Yeng Chai
    Ni, Jie
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (05): : 817 - 835
  • [12] Simultaneous Perturbation Stochastic Approximation Algorithm Combined with Neural Network and Fuzzy Simulation
    宁玉富
    唐万生
    郭长友
    Transactions of Tianjin University, 2008, (01) : 43 - 49
  • [13] Simultaneous perturbation stochastic approximation algorithm combined with neural network and fuzzy simulation
    Ning Y.
    Tang W.
    Guo C.
    Transactions of Tianjin University, 2008, 14 (1) : 43 - 49
  • [14] Robust neural network tracking controller using simultaneous perturbation stochastic approximation
    Song, Q
    Spall, JC
    Soh, YC
    42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 6194 - 6199
  • [15] Localization in Wireless Sensor Networks by Constrained Simultaneous Perturbation Stochastic Approximation Technique
    Azim, Mohammad Abdul
    Aung, Zeyar
    Xiao, Weidong
    Khadkikar, Vinod
    Jamalipour, Abbas
    6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS'2012), 2012,
  • [16] Simultaneous perturbation stochastic approximation for tidal models
    Altaf, Muhammad Umer
    Heemink, Arnold W.
    Verlaan, Martin
    Hoteit, Ibrahim
    OCEAN DYNAMICS, 2011, 61 (08) : 1093 - 1105
  • [17] A Stopping Rule for Simultaneous Perturbation Stochastic Approximation
    Wada, Takayuki
    Fujisaki, Yasumasa
    2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 644 - 649
  • [18] Adaptive stochastic approximation by the simultaneous perturbation method
    Spall, JC
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2000, 45 (10) : 1839 - 1853
  • [19] Simultaneous perturbation stochastic approximation of nonsmooth functions
    Bartkute, Vaida
    Sakalauskas, Leonidas
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 181 (03) : 1174 - 1188
  • [20] Simultaneous perturbation stochastic approximation for tidal models
    Muhammad Umer Altaf
    Arnold W. Heemink
    Martin Verlaan
    Ibrahim Hoteit
    Ocean Dynamics, 2011, 61 : 1093 - 1105