Safe Building HVAC Control via Batch Reinforcement Learning

被引:8
|
作者
Zhang, Chi [1 ]
Kuppannagari, Sanmukh Rao [2 ]
Prasanna, Viktor K. [2 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
[2] Univ Southern Calif, Dept Elect & Comp Engn, Los Angeles, CA 90089 USA
来源
基金
美国国家科学基金会;
关键词
Batch reinforcement learning; safe building HVAC control; model-based offline performance evaluation; SYSTEM;
D O I
10.1109/TSUSC.2022.3164084
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study safe building HVAC control via batch reinforcement learning. Random exploration in building HVAC control is infeasible due to safety considerations. However, diverse states are necessary for RL algorithms to learn useful policies. To enable safety during exploration, we propose guided exploration by adding a Gaussian noise to a hand-crafted rule-based controller. Adjusting the variance of the noise provides a tradeoff between the diversity of the dataset and the safety. We apply Conservative Q Learning (CQL) to learn a policy. CQL ensures that the trained policy stays within the policy distribution used to collect the dataset, thereby guarantees safety at deployment. To select the optimal policy during the offline training, we apply model-based performance evaluation. We use the widely adopted CityLearn testbed to evaluate the performance of our proposed method. Compared with a rule-based controller, our approach obtains 12% similar to 35% reduction in ramping, 3% similar to 10% reduction in 1-load factor, 3% similar to 8% reduction in daily peak at deployment with less than 10% performance degradation during the exploration. On the contrary, the performance degradation of the state-of-the-art online reinforcement learning algorithm during exploration is around 8% 18%. It also fails to surpass the performance of the rule-based controller at deployment.
引用
收藏
页码:923 / 934
页数:12
相关论文
共 50 条
  • [1] Safe HVAC Control via Batch Reinforcement Learning
    Liu, Hsin-Yu
    Balaji, Bharathan
    Gao, Sicun
    Gupta, Rajesh
    Hong, Dezhi
    [J]. 2022 13TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2022), 2022, : 181 - 192
  • [2] Reinforcement Learning for Control of Building HVAC Systems
    Raman, Naren Srivaths
    Devraj, Adithya M.
    Barooah, Prabir
    Meyn, Sean P.
    [J]. 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 2326 - 2332
  • [3] Deep Reinforcement Learning for Building HVAC Control
    Wei, Tianshu
    Wang, Yanzhi
    Zhu, Qi
    [J]. PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
  • [4] Safe chance constrained reinforcement learning for batch process control
    Mowbray, M.
    Petsagkourakis, R.
    del Rio-Chanona, E. A.
    Zhang, D.
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2022, 157
  • [5] Reinforcement learning for whole-building HVAC control and demand response
    Azuatalam, Donald
    Lee, Wee-Lih
    de Nijs, Frits
    Liebman, Ariel
    [J]. ENERGY AND AI, 2020, 2
  • [6] Building HVAC control with reinforcement learning for reduction of energy cost and demand charge
    Jiang, Zhanhong
    Risbeck, Michael J.
    Ramamurti, Vish
    Murugesan, Sugumar
    Amores, Jaume
    Zhang, Chenlu
    Lee, Young M.
    Drees, Kirk H.
    [J]. ENERGY AND BUILDINGS, 2021, 239
  • [7] Accelerate Online Reinforcement Learning for Building HVAC Control with Heterogeneous Expert Guidances
    Xu, Shichao
    Fu, Yangyang
    Wang, Yixuan
    Yang, Zhuoran
    O'Neill, Zheng
    Wang, Zhaoran
    Zhu, Qi
    [J]. PROCEEDINGS OF THE 2022 THE 9TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2022, 2022, : 89 - 98
  • [8] Modelling building HVAC control strategies using a deep reinforcement learning approach
    Nguyen, Anh Tuan
    Pham, Duy Hoang
    Oo, Bee Lan
    Santamouris, Mattheos
    Ahn, Yonghan
    Lim, Benson T. H.
    [J]. ENERGY AND BUILDINGS, 2024, 310
  • [9] On Normative Reinforcement Learning via Safe Reinforcement Learning
    Neufeld, Emery A.
    Bartocci, Ezio
    Ciabattoni, Agata
    [J]. PRIMA 2022: PRINCIPLES AND PRACTICE OF MULTI-AGENT SYSTEMS, 2023, 13753 : 72 - 89
  • [10] Advanced Building Control via Deep Reinforcement Learning
    Jia, Ruoxi
    Jin, Ming
    Sun, Kaiyu
    Hong, Tianzhen
    Spanos, Costas
    [J]. INNOVATIVE SOLUTIONS FOR ENERGY TRANSITIONS, 2019, 158 : 6158 - 6163