Actor-Critic Algorithm for Optimal Synchronization of Kuramoto Oscillator

被引:0
|
作者
Vrushabh, D. [1 ]
Shalini, K. [1 ]
Sonam, K. [1 ]
机构
[1] Veermata Jijabai Technol Inst, EED, Mumbai, Maharashtra, India
关键词
Reinforcement learning; Hamilton-Jacobi-Bellman; Approximate Dynamic Programming; Kuramoto oscillator; Mean-field game; Order parameter; NETWORKS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper constructs a reinforcement learning (RL) based algorithm of Actor-Critic (AC) for the optimal synchronism of the Kuramoto oscillator. This is accomplished through the Ott-Antonsen ansatz framework for the dynamics of large interactive unit networks. Besides, this approach reduces the infinite-dimensional dynamics to phase space flow, i.e., low dimensional dynamics for certain systems of globally coupled phase oscillators. The resulting Hamiltonian-Jacobi-Bellman (HJB) expression is extremely difficult to solve in general, therefore this paper introduces the AC method for learning approximate optimal control laws for the Kuramoto oscillator model. RL has been contemplated as one of the efficient methods to solve optimal control of non-linear systems. For a collection of non-homogeneous oscillators, the states are elucidated as phase angles, which is the modification of the model for a coupled Kuramoto oscillator. An admissible initial control policy for the Kuramoto oscillator model is designed and solved using RL giving an approximate solution of the optimal control problem. Finally, local synchronism of the coupled Kuramoto oscillator model is supported through simulations analysis.
引用
收藏
页码:391 / 396
页数:6
相关论文
共 50 条
  • [1] A Hessian Actor-Critic Algorithm
    Wang, Jing
    Paschalidis, Ioannis Ch
    [J]. 2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1131 - 1136
  • [2] An Actor-Critic Algorithm With Second-Order Actor and Critic
    Wang, Jing
    Paschalidis, Ioannis Ch.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (06) : 2689 - 2703
  • [3] An Actor-Critic Algorithm for SVM Hyperparameters
    Kim, Chayoung
    Park, Jung-min
    Kim, Hye-young
    [J]. INFORMATION SCIENCE AND APPLICATIONS 2018, ICISA 2018, 2019, 514 : 653 - 661
  • [4] Application of actor-critic learning algorithm for optimal bidding problem of a Genco
    Gajjar, GR
    Khaparde, SA
    Nagaraju, P
    Soman, SA
    [J]. 2003 IEEE POWER ENGINEERING SOCIETY GENERAL MEETING, VOLS 1-4, CONFERENCE PROCEEDINGS, 2003, : 818 - 818
  • [5] Application of actor-critic learning algorithm for optimal bidding problem of a Genco
    Gajjar, GR
    Khaparde, SA
    Nagaraju, P
    Soman, SA
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2003, 18 (01) : 11 - 18
  • [6] A Finite Sample Analysis of the Actor-Critic Algorithm
    Yang, Zhuoran
    Zhang, Kaiqing
    Hong, Mingyi
    Basar, Tamer
    [J]. 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 2759 - 2764
  • [7] Actor-Critic Algorithm with Transition Cost Estimation
    Sergey, Denisov
    Lee, Jee-Hyong
    [J]. INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2016, 16 (04) : 270 - 275
  • [8] The Effect of Discounting Actor-loss in Actor-Critic Algorithm
    Yaputra, Jordi
    Suyanto, Suyanto
    [J]. 2021 4TH INTERNATIONAL SEMINAR ON RESEARCH OF INFORMATION TECHNOLOGY AND INTELLIGENT SYSTEMS (ISRITI 2021), 2020,
  • [9] A modified actor-critic reinforcement learning algorithm
    Mustapha, SM
    Lachiver, G
    [J]. 2000 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS 1 AND 2: NAVIGATING TO A NEW ERA, 2000, : 605 - 609
  • [10] A simultaneous deterministic perturbation actor-critic algorithm with an application to optimal mortgage refinancing
    Chinthalapati, V. L. Raju
    Bhatnagar, S.
    [J]. PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 4151 - 4156