Actor-critic learning for optimal building energy management with phase change materials

被引:16
|
作者
Rahimpour, Zahra [1 ]
Verbic, Gregor [1 ]
Chapman, Archie C. [2 ]
机构
[1] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW, Australia
[2] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia
关键词
Actor-critic; Approximate dynamic programming; Deep deterministic policy gradient; Home energy management; Phase change materials;
D O I
10.1016/j.epsr.2020.106543
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Energy management in buildings using phase change materials (PCM) to improve thermal performance is challenging due to the nonlinear thermal capacity of the PCM. To address this problem, this paper adopts a model-free actor-critic on-policy reinforcement learning method based on deep deterministic policy gradient (DDPG). The proposed approach overcomes the major weakness of model-based approaches, such as approximate dynamic programming (ADP), which require an explicit thermal model of the building under control. This requirement makes a plug-and-play implementation of the energy management algorithm in an existing smart meter difficult due to the wide variety of building design and construction types. To overcome this difficulty, we use a DDPG algorithm that can learn policies in continuous action spaces without access to the full dynamics of the building. We demonstrate the competitive performance of DDPG by benchmarking it against an ADP-based approach with access to the full thermal dynamics of the building.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Optimal Energy Management of Energy Internet: A Distributed Actor-Critic Reinforcement Learning Method
    Cheng, Yijun
    Peng, Jun
    Gu, Xin
    Jiang, Fu
    Li, Heng
    Liu, Weirong
    Huang, Zhiwu
    [J]. 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 521 - 526
  • [2] Generalized Actor-Critic Learning Optimal Control in Smart Home Energy Management
    Wei, Qinglai
    Liao, Zehua
    Shi, Guang
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (10) : 6614 - 6623
  • [3] An improved Soft Actor-Critic strategy for optimal energy management
    Boato, Bruno
    Sueldo, Carolina Saavedra
    Avila, Luis
    de Paula, Mariano
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2023, 21 (09) : 958 - 965
  • [4] A soft actor-critic reinforcement learning framework for optimal energy management in electric vehicles with hybrid storage
    Mazzi, Yahia
    Ben Sassi, Hicham
    Errahimi, Fatima
    Es-Sbai, Najia
    [J]. JOURNAL OF ENERGY STORAGE, 2024, 99
  • [5] Granular computing in actor-critic learning
    Peters, James F.
    [J]. 2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 59 - 64
  • [6] Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
    Zhu, Hanlin
    Rashidinejad, Paria
    Jiao, Jiantao
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] Application of actor-critic learning algorithm for optimal bidding problem of a Genco
    Gajjar, GR
    Khaparde, SA
    Nagaraju, P
    Soman, SA
    [J]. 2003 IEEE POWER ENGINEERING SOCIETY GENERAL MEETING, VOLS 1-4, CONFERENCE PROCEEDINGS, 2003, : 818 - 818
  • [8] Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning
    Shi, Daming
    Guo, Xudong
    Liu, Yi
    Fan, Wenhui
    [J]. ENTROPY, 2022, 24 (06)
  • [9] Application of actor-critic learning algorithm for optimal bidding problem of a Genco
    Gajjar, GR
    Khaparde, SA
    Nagaraju, P
    Soman, SA
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2003, 18 (01) : 11 - 18
  • [10] Reward Shaping-Based Actor-Critic Deep Reinforcement Learning for Residential Energy Management
    Lu, Renzhi
    Jiang, Zhenyu
    Wu, Huaming
    Ding, Yuemin
    Wang, Dong
    Zhang, Hai-Tao
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) : 2662 - 2673