Actor-Critic Algorithm for Optimal Synchronization of Kuramoto Oscillator

被引：0

作者：

Vrushabh, D. ^{[1
]}

Shalini, K. ^{[1
]}

Sonam, K. ^{[1
]}

机构：

[1] Veermata Jijabai Technol Inst, EED, Mumbai, Maharashtra, India

来源：

2020 7TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'20), VOL 1 | 2020年

关键词：

Reinforcement learning; Hamilton-Jacobi-Bellman; Approximate Dynamic Programming; Kuramoto oscillator; Mean-field game; Order parameter; NETWORKS;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper constructs a reinforcement learning (RL) based algorithm of Actor-Critic (AC) for the optimal synchronism of the Kuramoto oscillator. This is accomplished through the Ott-Antonsen ansatz framework for the dynamics of large interactive unit networks. Besides, this approach reduces the infinite-dimensional dynamics to phase space flow, i.e., low dimensional dynamics for certain systems of globally coupled phase oscillators. The resulting Hamiltonian-Jacobi-Bellman (HJB) expression is extremely difficult to solve in general, therefore this paper introduces the AC method for learning approximate optimal control laws for the Kuramoto oscillator model. RL has been contemplated as one of the efficient methods to solve optimal control of non-linear systems. For a collection of non-homogeneous oscillators, the states are elucidated as phase angles, which is the modification of the model for a coupled Kuramoto oscillator. An admissible initial control policy for the Kuramoto oscillator model is designed and solved using RL giving an approximate solution of the optimal control problem. Finally, local synchronism of the coupled Kuramoto oscillator model is supported through simulations analysis.

引用

页码：391 / 396

页数：6

共 50 条

[21] On Finite-Time Convergence of Actor-Critic Algorithm
Qiu S.
Yang Z.
Ye J.
Wang Z.
IEEE Journal on Selected Areas in Information Theory, 2021, 2 (02): : 652 - 664
[22] Adaptive Inverse Optimal Control for Rehabilitation Robot Systems Using Actor-Critic Algorithm
Meng, Fancheng
Dai, Yaping
MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
[23] An Adaptive Threshold for the Canny Edge With Actor-Critic Algorithm
Choi, Keong-Hun
Ha, Jong-Eun
IEEE ACCESS, 2023, 11 : 67058 - 67069
[24] Actor-Critic Algorithm with Maximum-Entropy Correction
Jiang Y.-B.
Liu Q.
Hu Z.-H.
Liu, Quan (quanliu@suda.edu.cn), 1897, Science Press (43): : 1897 - 1908
[25] Variational actor-critic algorithms*,**
Zhu, Yuhua
Ying, Lexing
ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2023, 29
[26] Error controlled actor-critic
Gao, Xingen
Chao, Fei
Zhou, Changle
Ge, Zhen
Yang, Longzhi
Chang, Xiang
Shang, Changjing
Shen, Qiang
INFORMATION SCIENCES, 2022, 612 : 62 - 74
[27] An improved Soft Actor-Critic strategy for optimal energy management
Boato, Bruno
Sueldo, Carolina Saavedra
Avila, Luis
de Paula, Mariano
IEEE LATIN AMERICA TRANSACTIONS, 2023, 21 (09) : 958 - 965
[28] Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning
Zhong, Shan
Liu, Quan
Fu, QiMing
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
[29] A connectionist actor-critic algorithm for faster learning and biological plausibility
Johard, Leonard
Ruffaldi, Emanuele
2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 3903 - 3909
[30] Natural actor-critic algorithms
Bhatnagar, Shalabh
Sutton, Richard S.
Ghavamzadeh, Mohammad
Lee, Mark
AUTOMATICA, 2009, 45 (11) : 2471 - 2482

← 1 2 3 4 5 →