Actor-critic learning for optimal building energy management with phase change materials

被引:16
|
作者
Rahimpour, Zahra [1 ]
Verbic, Gregor [1 ]
Chapman, Archie C. [2 ]
机构
[1] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW, Australia
[2] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia
关键词
Actor-critic; Approximate dynamic programming; Deep deterministic policy gradient; Home energy management; Phase change materials;
D O I
10.1016/j.epsr.2020.106543
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Energy management in buildings using phase change materials (PCM) to improve thermal performance is challenging due to the nonlinear thermal capacity of the PCM. To address this problem, this paper adopts a model-free actor-critic on-policy reinforcement learning method based on deep deterministic policy gradient (DDPG). The proposed approach overcomes the major weakness of model-based approaches, such as approximate dynamic programming (ADP), which require an explicit thermal model of the building under control. This requirement makes a plug-and-play implementation of the energy management algorithm in an existing smart meter difficult due to the wide variety of building design and construction types. To overcome this difficulty, we use a DDPG algorithm that can learn policies in continuous action spaces without access to the full dynamics of the building. We demonstrate the competitive performance of DDPG by benchmarking it against an ADP-based approach with access to the full thermal dynamics of the building.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Actor-Critic Algorithm for Optimal Synchronization of Kuramoto Oscillator
    Vrushabh, D.
    Shalini, K.
    Sonam, K.
    [J]. 2020 7TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'20), VOL 1, 2020, : 391 - 396
  • [32] Optimal scheduling strategy of electricity and thermal energy storage based on soft actor-critic reinforcement learning approach
    Zheng, Yingying
    Wang, Hui
    Wang, Jinglong
    Wang, Zichong
    Zhao, Yongning
    [J]. JOURNAL OF ENERGY STORAGE, 2024, 92
  • [33] Variational value learning in advantage actor-critic reinforcement learning
    Zhang, Yaozhong
    Han, Jiaqi
    Hu, Xiaofang
    Dan, Shihao
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1955 - 1960
  • [34] Energy optimization management of microgrid using improved soft actor-critic algorithm
    Yu, Zhiwen
    Zheng, Wenjie
    Zeng, Kaiwen
    Zhao, Ruifeng
    Zhang, Yanxu
    Zeng, Mengdi
    [J]. INTERNATIONAL JOURNAL OF RENEWABLE ENERGY DEVELOPMENT-IJRED, 2024, 13 (02): : 329 - 339
  • [35] A bounded actor-critic reinforcement learning algorithm applied to airline revenue management
    Lawhead, Ryan J.
    Gosavi, Abhijit
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 82 : 252 - 262
  • [36] Actor-Critic Learning Based on Adaptive Importance Sampling
    Cheng Yuhu
    Feng Huanting
    Wang Xuesong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (04) : 583 - 588
  • [37] DAC: The Double Actor-Critic Architecture for Learning Options
    Zhang, Shangtong
    Whiteson, Shimon
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [38] Merging with Extraction Method for Transfer Learning in Actor-Critic
    Takano, Toshiaki
    Takase, Haruhiko
    Kawanaka, Hiroharu
    Tsuruoka, Shinji
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (07) : 814 - 821
  • [39] Efficient Model Learning Methods for Actor-Critic Control
    Grondman, Ivo
    Vaandrager, Maarten
    Busoniu, Lucian
    Babuska, Robert
    Schuitema, Erik
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (03): : 591 - 602
  • [40] Actor-critic reinforcement learning for bidding in bilateral negotiation
    Arslan, Furkan
    Aydogan, Reyhan
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (05) : 1695 - 1714