An Improved Deep Deterministic Policy Gradient Pantograph Active Control Strategy for High-Speed Railways

被引:0
|
作者
Wang, Ying [1 ,2 ]
Wang, Yuting [1 ]
Chen, Xiaoqiang [1 ,2 ]
Wang, Yixuan [1 ]
Chang, Zhanning [3 ]
机构
[1] Lanzhou Jiaotong Univ, Sch Automat & Elect Engn, Lanzhou 730070, Peoples R China
[2] Lanzhou Jiaotong Univ, Key Lab Optotechnol & Intelligent Control, Minist Educ, Lanzhou 730070, Peoples R China
[3] China Railway Lanzhou Bur Grp Co Ltd, Power Supply Dept, Lanzhou 730070, Peoples R China
基金
中国国家自然科学基金;
关键词
pantograph-catenary system coupling; improved deep deterministic policy gradient; pantograph active control; CATENARY SYSTEM; ROBOTS;
D O I
10.3390/electronics13173545
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The pantograph-catenary system (PCS) is essential for trains to obtain electrical energy. As the train's operating speed increases, the vibration between the pantograph and the catenary intensifies, reducing the quality of the current collection. Active control may significantly reduce the vibration of the PCS, effectively lower the cost of line retrofitting, and enhance the quality of the current collection. This article proposes an improved deep deterministic policy gradient (IDDPG) for the pantograph active control problem, which delays updating the Actor and Target-Actor networks and adopts a reconstructed experience replay mechanism. The deep reinforcement learning (DRL) environment module was first established by creating a PCS coupling model. On this basis, the controller's DRL module is precisely designed using the IDDPG strategy. Ultimately, the control strategy is integrated with the PCS for training, and the controller's performance is validated on the PCS. Simulation experiments show that the improved strategy significantly reduces the training time, enhances the steady-state performance of the agent during later training stages, and effectively reduces the standard deviation of the pantograph-catenary contact force (PCCF) by an average of over 51.44%, effectively improving the quality of current collection.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Estimator-based multiobjective robust control strategy for an active pantograph in high-speed railways
    Lu, Xiaobing
    Liu, Zhigang
    Song, Yang
    Wang, Hongrui
    Zhang, Jing
    Wang, Yanbo
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART F-JOURNAL OF RAIL AND RAPID TRANSIT, 2018, 232 (04) : 1064 - 1077
  • [2] High-speed Train Operation Adjustment Strategy Based on Deep Deterministic Policy Gradient
    Rui Luo
    Wei ShangGuan
    Rong, Dingchao
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 549 - 554
  • [3] Deep Reinforcement Learning Based Active Pantograph Control Strategy in High-Speed Railway
    Wang, Hui
    Han, Zhiwei
    Liu, Zhigang
    Wu, Yanbo
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) : 227 - 238
  • [4] A New Active Control Strategy for Pantograph in High-Speed Electrified Railways Based on Multi-Objective Robust Control
    Zhang, Jing
    Zhang, Hantao
    Song, Baolin
    Xie, Songlin
    Liu, Zhigang
    [J]. IEEE ACCESS, 2019, 7 : 173719 - 173730
  • [5] Active control of a high-speed pantograph
    OConnor, DN
    Eppinger, SD
    Seering, WP
    Wormley, DN
    [J]. JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 1997, 119 (01): : 1 - 4
  • [6] Offline Meta-Reinforcement Learning for Active Pantograph Control in High-Speed Railways
    Wang, Hui
    Liu, Zhigang
    Hu, Guiyang
    Wang, Xufan
    Han, Zhiwei
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (08) : 10669 - 10679
  • [7] Deep Deterministic Policy Gradient for High-Speed Train Trajectory Optimization
    Ning, Lingbin
    Zhou, Min
    Hou, Zhuopu
    Goverde, Rob M. P.
    Wang, Fei-Yue
    Dong, Hairong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 11562 - 11574
  • [8] Active control of fuzzy for high-speed pantograph
    Yang Gang
    Dai ZHongmei
    Li Fu
    Luo ZHengzhi
    [J]. NEW TRENDS IN MECHANICAL ENGINEERING AND MATERIALS, 2013, 251 : 158 - +
  • [9] Active Pantograph Control of Deep Reinforcement Learning Based on Double Delay Depth Deterministic Strategy Gradient
    Wu, Yanbo
    Han, Zhiwei
    Wang, Hui
    Liu, Zhigang
    Zhang, Yujing
    [J]. Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2024, 39 (14): : 4547 - 4556
  • [10] Optimizing Active Control Scheme of High-speed Pantograph
    Wu Yan
    Zheng Ji Hao
    Zheng, Trillion Q.
    [J]. 2009 IEEE 6TH INTERNATIONAL POWER ELECTRONICS AND MOTION CONTROL CONFERENCE, VOLS 1-4, 2009, : 591 - 595