Actor-Critic Algorithm for Optimal Synchronization of Kuramoto Oscillator

被引:0
|
作者
Vrushabh, D. [1 ]
Shalini, K. [1 ]
Sonam, K. [1 ]
机构
[1] Veermata Jijabai Technol Inst, EED, Mumbai, Maharashtra, India
关键词
Reinforcement learning; Hamilton-Jacobi-Bellman; Approximate Dynamic Programming; Kuramoto oscillator; Mean-field game; Order parameter; NETWORKS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper constructs a reinforcement learning (RL) based algorithm of Actor-Critic (AC) for the optimal synchronism of the Kuramoto oscillator. This is accomplished through the Ott-Antonsen ansatz framework for the dynamics of large interactive unit networks. Besides, this approach reduces the infinite-dimensional dynamics to phase space flow, i.e., low dimensional dynamics for certain systems of globally coupled phase oscillators. The resulting Hamiltonian-Jacobi-Bellman (HJB) expression is extremely difficult to solve in general, therefore this paper introduces the AC method for learning approximate optimal control laws for the Kuramoto oscillator model. RL has been contemplated as one of the efficient methods to solve optimal control of non-linear systems. For a collection of non-homogeneous oscillators, the states are elucidated as phase angles, which is the modification of the model for a coupled Kuramoto oscillator. An admissible initial control policy for the Kuramoto oscillator model is designed and solved using RL giving an approximate solution of the optimal control problem. Finally, local synchronism of the coupled Kuramoto oscillator model is supported through simulations analysis.
引用
收藏
页码:391 / 396
页数:6
相关论文
共 50 条
  • [21] On Finite-Time Convergence of Actor-Critic Algorithm
    Qiu S.
    Yang Z.
    Ye J.
    Wang Z.
    IEEE Journal on Selected Areas in Information Theory, 2021, 2 (02): : 652 - 664
  • [22] Adaptive Inverse Optimal Control for Rehabilitation Robot Systems Using Actor-Critic Algorithm
    Meng, Fancheng
    Dai, Yaping
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [23] An Adaptive Threshold for the Canny Edge With Actor-Critic Algorithm
    Choi, Keong-Hun
    Ha, Jong-Eun
    IEEE ACCESS, 2023, 11 : 67058 - 67069
  • [24] Actor-Critic Algorithm with Maximum-Entropy Correction
    Jiang Y.-B.
    Liu Q.
    Hu Z.-H.
    Liu, Quan (quanliu@suda.edu.cn), 1897, Science Press (43): : 1897 - 1908
  • [25] Variational actor-critic algorithms*,**
    Zhu, Yuhua
    Ying, Lexing
    ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2023, 29
  • [26] Error controlled actor-critic
    Gao, Xingen
    Chao, Fei
    Zhou, Changle
    Ge, Zhen
    Yang, Longzhi
    Chang, Xiang
    Shang, Changjing
    Shen, Qiang
    INFORMATION SCIENCES, 2022, 612 : 62 - 74
  • [27] An improved Soft Actor-Critic strategy for optimal energy management
    Boato, Bruno
    Sueldo, Carolina Saavedra
    Avila, Luis
    de Paula, Mariano
    IEEE LATIN AMERICA TRANSACTIONS, 2023, 21 (09) : 958 - 965
  • [28] Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning
    Zhong, Shan
    Liu, Quan
    Fu, QiMing
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
  • [29] A connectionist actor-critic algorithm for faster learning and biological plausibility
    Johard, Leonard
    Ruffaldi, Emanuele
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 3903 - 3909
  • [30] Natural actor-critic algorithms
    Bhatnagar, Shalabh
    Sutton, Richard S.
    Ghavamzadeh, Mohammad
    Lee, Mark
    AUTOMATICA, 2009, 45 (11) : 2471 - 2482