An Energy Dynamic Control Algorithm Based on Reinforcement Learning for Data Centers

被引:0
|
作者
Xiang, Yao [1 ]
Yuan, Jingling [1 ]
Luo, Ruiqi [1 ]
Zhong, Xian [1 ]
Li, Tao [2 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan 430070, Hubei, Peoples R China
[2] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Reinforcement learning; double Q-learning; dynamic energy control; energy cost reduction; GENERATION; MANAGEMENT; POWER; COST;
D O I
10.1142/S0218001419510091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, how to use renewable energy to reduce the energy cost of internet data center (IDC) has been an urgent problem to be solved. More and more solutions are beginning to consider machine learning, but many of the existing methods need to take advantage of some future information, which is difficult to obtain in the actual operation process. In this paper, we focus on reducing the energy cost of IDC by controlling the energy flow of renewable energy without any future information. we propose an efficient energy dynamic control algorithm based on the theory of reinforcement learning, which approximates the optimal solution by learning the feedback of historical control decisions. For the purpose of avoiding overestimation, improving the convergence ability of the algorithm, we use the double Q-method to further optimize. The extensive experimental results show that our algorithm can on average save the energy cost by 18.3% and reduce the rate of grid intervention by 26.2% compared with other algorithms, and thus has good application prospects.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] An Improved Reinforcement Learning Based Heuristic Dynamic Programming Algorithm for Model-Free Optimal Control
    Li, Jia
    Yuan, Zhaolin
    Ban, Xiaojuan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 282 - 294
  • [32] Cooperative Merging Control Based on Reinforcement Learning With Dynamic Waypoint
    Yang, Xiao
    Liu, Hongfei
    Xu, Miao
    Wan, Jintao
    IEEE ACCESS, 2024, 12 : 81581 - 81592
  • [33] Meta-Reinforcement Learning Algorithm Based on Reward and Dynamic Inference
    Chen, Jinhao
    Zhang, Chunhong
    Hu, Zheng
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT III, PAKDD 2024, 2024, 14647 : 223 - 234
  • [34] Simulation of pedestrian evacuation with reinforcement learning based on a dynamic scanning algorithm
    Huang, Zhongyi
    Liang, Rong
    Xiao, Yao
    Fang, Zhiming
    Li, Xiaolian
    Ye, Rui
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2023, 625
  • [35] A novel dynamic spectrum allocation algorithm based on POMDP reinforcement learning
    Tang, Lun
    Chen, Qian-Bin
    Zeng, Xiao-Ping
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2009, 32 (06): : 125 - 129
  • [36] Dynamic Sizing of Cloud-Native Telco Data Centers With Digital Twin and Reinforcement Learning
    Pentelas, Angelos
    Katsiros, Dimitris
    Paranou, Dimitra
    Doukas, George
    Chondralis, Konstantinos
    Giannopoulos, Giorgos
    Angelou, Evangelos
    Papastefanatos, George
    IEEE ACCESS, 2024, 12 : 91462 - 91479
  • [37] Reinforcement Learning for Dynamic Microfluidic Control
    Dressler, Oliver J.
    Howes, Philip D.
    Choo, Jaebum
    deMello, Andrew J.
    ACS OMEGA, 2018, 3 (08): : 10084 - 10091
  • [38] Deep reinforcement learning towards real-world dynamic thermal management of data centers
    Zhang, Qingang
    Zeng, Wei
    Lin, Qinjie
    Chng, Chin-Boon
    Chui, Chee-Kong
    Lee, Poh-Seng
    APPLIED ENERGY, 2023, 333
  • [39] Data Centers Job Scheduling with Deep Reinforcement Learning
    Liang, Sisheng
    Yang, Zhou
    Jin, Fang
    Chen, Yong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II, 2020, 12085 : 906 - 917
  • [40] Dynamic Algorithm Selection Using Reinforcement Learning
    Armstrong, Warren
    Christen, Peter
    McCreath, Eric
    Rendell, Alistair P.
    AIDM 2006: INTERNATIONAL WORKSHOP ON INTEGRATING AI AND DATING MINING, 2006, : 18 - +