Fast Online Reinforcement Learning of Distributed Optimal Controller for Large-Scale Network Systems

被引:1
|
作者
Hoshiya, Tomoki [1 ]
Sadamoto, Tomonori [1 ]
机构
[1] Univ Electrocommun, Grad Sch Informat & Engn, Dept Mech & Intelli Gent Syst Engn, 1-5-1 Chofu Gaoka, Chofu, Tokyo 1828585, Japan
关键词
MODEL-REDUCTION;
D O I
10.1109/CCTA48906.2021.9659050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a fast real-time reinforcement learning (RL) control algorithm to design distributed controllers for large-scale network systems. When network size is large, existing RL-based methodologies can result in unacceptably long learning time, making them unsuitable for real-time control. The proposed approach overcomes this issue by aggregating states while keeping the aggregation error as small as possible. The aggregation matrix is constructed by a kind of sparse singular value decomposition of data. Next, a distributed controller is learned using the aggregated data by the RL method which is modified to promote sparsity of the controller by l(1)-regularization. Because of the structure of the aggregation matrix, the resultant controller can have a highly sparse structure. The efficiency of the proposed method is shown through a numerical simulation of a complex network system whose graph structure is described by the Barabasi-albert model.
引用
收藏
页码:1135 / 1141
页数:7
相关论文
共 50 条
  • [21] Fast and Lightweight Online Person Search for Large-Scale Surveillance Systems
    Specker, Andreas
    Moritz, Lennart
    Cormier, Mickael
    Beyerer, Juergen
    [J]. 2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 570 - 580
  • [22] An optimal distributed trigger counting algorithm for large-scale networked systems
    Kim, Seokhyun
    Lee, Jaeheung
    Park, Yongsu
    Cho, Yookun
    [J]. SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2013, 89 (07): : 846 - 859
  • [23] Deep reinforcement learning for scheduling in large-scale networked control systems
    Redder, Adrian
    Ramaswamy, Arunselvan
    Quevedo, Daniel E.
    [J]. IFAC PAPERSONLINE, 2019, 52 (20): : 333 - 338
  • [24] Optimal allocation of fast charging stations for large-scale transportation systems
    dos Santos, Caio
    Andrade, Jose C. G.
    Oliveira, Washington A.
    Lyra, Christiano
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024, 62 (14) : 5087 - 5107
  • [25] Large-scale network intrusion detection algorithm based on distributed learning
    College of Computer Science and Technology, Jilin University, Changchun 130012, China
    不详
    [J]. Ruan Jian Xue Bao/Journal of Software, 2008, 19 (04): : 993 - 1003
  • [26] Large-scale network intrusion detection based on distributed learning algorithm
    Tian, Daxin
    Liu, Yanheng
    Xiang, Yang
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2009, 8 (01) : 25 - 35
  • [27] Distributed Resource Scheduling for Large-Scale MEC Systems: A Multiagent Ensemble Deep Reinforcement Learning With Imitation Acceleration
    Jiang, Feibo
    Dong, Li
    Wang, Kezhi
    Yang, Kun
    Pan, Cunhua
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (09) : 6597 - 6610
  • [28] Large-scale network intrusion detection based on distributed learning algorithm
    Daxin Tian
    Yanheng Liu
    Yang Xiang
    [J]. International Journal of Information Security, 2009, 8 : 25 - 35
  • [29] Distributed Task Offloading for Large-Scale VEC Systems: A Multi-agent Deep Reinforcement Learning Method
    Lu, Yanfei
    Han, Dengyu
    Wang, Xiaoxuan
    Gao, Qinghe
    [J]. 2022 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2022), 2022, : 161 - 165
  • [30] Multivariable Three-Term Optimal Controller Design for Large-Scale Systems
    Davison, Edward J.
    Davison, Daniel E.
    Lam, Simon
    [J]. PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 940 - 945