Actor-critic learning for optimal building energy management with phase change materials

被引:16
|
作者
Rahimpour, Zahra [1 ]
Verbic, Gregor [1 ]
Chapman, Archie C. [2 ]
机构
[1] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW, Australia
[2] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia
关键词
Actor-critic; Approximate dynamic programming; Deep deterministic policy gradient; Home energy management; Phase change materials;
D O I
10.1016/j.epsr.2020.106543
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Energy management in buildings using phase change materials (PCM) to improve thermal performance is challenging due to the nonlinear thermal capacity of the PCM. To address this problem, this paper adopts a model-free actor-critic on-policy reinforcement learning method based on deep deterministic policy gradient (DDPG). The proposed approach overcomes the major weakness of model-based approaches, such as approximate dynamic programming (ADP), which require an explicit thermal model of the building under control. This requirement makes a plug-and-play implementation of the energy management algorithm in an existing smart meter difficult due to the wide variety of building design and construction types. To overcome this difficulty, we use a DDPG algorithm that can learn policies in continuous action spaces without access to the full dynamics of the building. We demonstrate the competitive performance of DDPG by benchmarking it against an ADP-based approach with access to the full thermal dynamics of the building.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] A fuzzy Actor-Critic reinforcement learning network
    Wang, Xue-Song
    Cheng, Yu-Hu
    Yi, Jian-Qiang
    [J]. INFORMATION SCIENCES, 2007, 177 (18) : 3764 - 3781
  • [22] A modified actor-critic reinforcement learning algorithm
    Mustapha, SM
    Lachiver, G
    [J]. 2000 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS 1 AND 2: NAVIGATING TO A NEW ERA, 2000, : 605 - 609
  • [23] Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning
    Wei, Qinglai
    Wang, Lingxiao
    Liu, Yu
    Polycarpou, Marios M.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (12) : 5245 - 5256
  • [24] Research on actor-critic reinforcement learning in RoboCup
    Guo, He
    Liu, Tianying
    Wang, Yuxin
    Chen, Feng
    Fan, Jianming
    [J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 205 - 205
  • [25] Actor-critic reinforcement learning to estimate the optimal operating conditions of the hydrocracking process
    Oh, Dong-Hoon
    Adams, Derrick
    Nguyen Dat Vo
    Gbadago, Dela Quarme
    Lee, Chang-Ha
    Oh, Min
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2021, 149
  • [26] An Actor-critic Reinforcement Learning Model for Optimal Bidding in Online Display Advertising
    Yuan, Congde
    Guo, Mengzhuo
    Xiang, Chaoneng
    Wang, Shuangyang
    Song, Guoqing
    Zhang, Qingpeng
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3604 - 3613
  • [27] Actor-Critic reinforcement learning for optimal design of piping support constraint combinations
    Ham, Jong-Ho
    An, Jung-Eun
    Lee, Hee-Sung
    Park, Gun-il
    Lee, Dong-Yeon
    [J]. INTERNATIONAL JOURNAL OF NAVAL ARCHITECTURE AND OCEAN ENGINEERING, 2022, 14
  • [28] The actor-critic learning is behind the matching law: Matching versus optimal behaviors
    Sakai, Yutaka
    Fukai, Tomoki
    [J]. NEURAL COMPUTATION, 2008, 20 (01) : 227 - 251
  • [29] Optimal Scheduling of Regional Integrated Energy System Based on Advantage Learning Soft Actor-critic Algorithm and Transfer Learning
    Luo, Wenjian
    Zhang, Jing
    He, Yu
    Gu, Tingyun
    Nie, Xianglun
    Fan, Luqin
    Yuan, Xufeng
    Li, Bowen
    [J]. Dianwang Jishu/Power System Technology, 2023, 47 (04): : 1601 - 1611
  • [30] Optimal Actor-Critic Policy With Optimized Training Datasets
    Banerjee, Chayan
    Chen, Zhiyong
    Noman, Nasimul
    Zamani, Mohsen
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (06): : 1324 - 1334