Actor-Critic Algorithm for Optimal Synchronization of Kuramoto Oscillator

被引:0
|
作者
Vrushabh, D. [1 ]
Shalini, K. [1 ]
Sonam, K. [1 ]
机构
[1] Veermata Jijabai Technol Inst, EED, Mumbai, Maharashtra, India
关键词
Reinforcement learning; Hamilton-Jacobi-Bellman; Approximate Dynamic Programming; Kuramoto oscillator; Mean-field game; Order parameter; NETWORKS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper constructs a reinforcement learning (RL) based algorithm of Actor-Critic (AC) for the optimal synchronism of the Kuramoto oscillator. This is accomplished through the Ott-Antonsen ansatz framework for the dynamics of large interactive unit networks. Besides, this approach reduces the infinite-dimensional dynamics to phase space flow, i.e., low dimensional dynamics for certain systems of globally coupled phase oscillators. The resulting Hamiltonian-Jacobi-Bellman (HJB) expression is extremely difficult to solve in general, therefore this paper introduces the AC method for learning approximate optimal control laws for the Kuramoto oscillator model. RL has been contemplated as one of the efficient methods to solve optimal control of non-linear systems. For a collection of non-homogeneous oscillators, the states are elucidated as phase angles, which is the modification of the model for a coupled Kuramoto oscillator. An admissible initial control policy for the Kuramoto oscillator model is designed and solved using RL giving an approximate solution of the optimal control problem. Finally, local synchronism of the coupled Kuramoto oscillator model is supported through simulations analysis.
引用
收藏
页码:391 / 396
页数:6
相关论文
共 50 条
  • [41] Actor-critic algorithm with incremental dual natural policy gradient
    Zhang P.
    Liu Q.
    Zhong S.
    Zhai J.-W.
    Qian W.-S.
    2017, Editorial Board of Journal on Communications (38): : 166 - 177
  • [42] Decentralized Multiagent Actor-Critic Algorithm Based on Message Diffusion
    Ding, Siyuan
    Li, Shengxiang
    Liu, Guangyi
    Li, Ou
    Ke, Ke
    Bai, Yijie
    Chen, Weiye
    JOURNAL OF SENSORS, 2021, 2021
  • [43] Adaptive Dynamic Programming for Optimal Synchronization of Kuramoto Oscillator
    Vrushabh, D.
    Shalini, K.
    Sonam, K.
    Wagh, S.
    Singh, N. M.
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 1755 - 1760
  • [44] An Experience-Guided Deep Deterministic Actor-Critic Algorithm with Multi-Actor
    Chen H.
    Liu Q.
    Yan Y.
    He B.
    Jiang Y.
    Zhang L.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (08): : 1708 - 1720
  • [45] Optimal Tracking Control for Robotic Manipulator using Actor-Critic Network
    Hu, Yong
    Cui, Lingguo
    Chai, Senchun
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 1556 - 1561
  • [46] Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
    Zhu, Hanlin
    Rashidinejad, Paria
    Jiao, Jiantao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [47] Optimal Control of Affine Nonlinear Continuous-time Systems Using Online Actor-Critic Algorithm
    Chen Xue-song
    Yang Ming-sheng
    Liu Fu-chun
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2891 - 2894
  • [48] Importance sampling actor-critic algorithms
    Williams, Jason L.
    Fisher, John W., III
    Willsky, Alan S.
    2006 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2006, 1-12 : 1625 - +
  • [49] Optimal fractional-order PID controller based on fractional-order actor-critic algorithm
    Shalaby, Raafat
    El-Hossainy, Mohammad
    Abo-Zalam, Belal
    Mahmoud, Tarek A.
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (03): : 2347 - 2380
  • [50] Optimal fractional-order PID controller based on fractional-order actor-critic algorithm
    Raafat Shalaby
    Mohammad El-Hossainy
    Belal Abo-Zalam
    Tarek A. Mahmoud
    Neural Computing and Applications, 2023, 35 : 2347 - 2380