Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings

被引:16
|
作者
Nweye, Kingsley [1 ]
Liu, Bo [2 ]
Stone, Peter [2 ]
Nagy, Zoltan [1 ]
机构
[1] Univ Texas Austin, Dept Civil Architectural & Environm Engn, Intelligent Environm Lab, 301 E Dean Keeton St Stop,ECJ 4 200, Austin, TX 78712 USA
[2] Univ Texas Austin, Dept Comp Sci, 2317 Speedway,GDC 2 302, Austin, TX 78712 USA
关键词
Grid-interactive buildings; Benchmarking; Reinforcement learning; DEMAND RESPONSE;
D O I
10.1016/j.egyai.2022.100202
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Building upon prior research that highlighted the need for standardizing environments for building control research, and inspired by recently introduced challenges for real life reinforcement learning (RL) control, here we propose a non-exhaustive set of nine real world challenges for RL control in grid-interactive buildings (GIBs). We argue that research in this area should be expressed in this framework in addition to providing a standardized environment for repeatability. Advanced controllers such as model predictive control (MPC) and RL control have both advantages and disadvantages that prevent them from being implemented in real world problems. Comparisons between the two are rare, and often biased. By focusing on the challenges, we can investigate the performance of the controllers under a variety of situations and generate a fair comparison. As a demonstration, we implement the offline learning challenge in CityLearn, an OpenAI Gym environment for the easy implementation of RL agents in a demand response setting to reshape the aggregated curve of electricity demand by controlling the energy storage of a diverse set of buildings in a district. We use CityLearn to study the impact of different levels of domain knowledge and complexity of RL algorithms and show that the sequence of operations (SOOs) utilized in a rule based controller (RBC) that provides fixed logs to RL agents during offline training affect the performance of the agents when evaluated on a set of four energy flexibility metrics. Longer offline training from an optimized RBC leads to improved performance in the long run. RL agents that train on the logs from a simplified RBC risk poorer performance as the offline training period increases. We also observe no impact on performance from information sharing amongst agents. We call for a more interdisciplinary effort of the research community to address the real world challenges, and unlock the potential of GIB controllers.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications
    Du, Wei
    Ding, Shifei
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (05) : 3215 - 3238
  • [42] Learning-based demand response in grid-interactive buildings via Gaussian Processes
    Ospina, Ana M.
    Chen, Yue
    Bernstein, Andrey
    Dall'Anese, Emiliano
    ELECTRIC POWER SYSTEMS RESEARCH, 2022, 211
  • [43] Two-Stage Reinforcement Learning Policy Search for Grid-Interactive Building Control
    Zhang, Xiangyu
    Chen, Yue
    Bernstein, Andrey
    Chintala, Rohit
    Graf, Peter
    Jin, Xin
    Biagioni, David
    IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (03) : 1976 - 1987
  • [44] TEAM POLICY LEARNING FOR MULTI-AGENT REINFORCEMENT LEARNING
    Cassano, Lucas
    Alghunaim, Sulaiman A.
    Sayed, Ali H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3062 - 3066
  • [45] Aggregation Transfer Learning for Multi-Agent Reinforcement learning
    Xu, Dongsheng
    Qiao, Peng
    Dou, Yong
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 547 - 551
  • [46] Learning to Communicate with Deep Multi-Agent Reinforcement Learning
    Foerster, Jakob N.
    Assael, Yannis M.
    de Freitas, Nando
    Whiteson, Shimon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [47] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
    Xu, Zhiwei
    Zhang, Bin
    Li, Dapeng
    Zhang, Zeren
    Zhou, Guangchong
    Chen, Hao
    Fan, Guoliang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734
  • [48] Learning structured communication for multi-agent reinforcement learning
    Junjie Sheng
    Xiangfeng Wang
    Bo Jin
    Junchi Yan
    Wenhao Li
    Tsung-Hui Chang
    Jun Wang
    Hongyuan Zha
    Autonomous Agents and Multi-Agent Systems, 2022, 36
  • [49] Learning structured communication for multi-agent reinforcement learning
    Sheng, Junjie
    Wang, Xiangfeng
    Jin, Bo
    Yan, Junchi
    Li, Wenhao
    Chang, Tsung-Hui
    Wang, Jun
    Zha, Hongyuan
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (02)
  • [50] Generalized learning automata for multi-agent reinforcement learning
    De Hauwere, Yann-Michael
    Vrancx, Peter
    Nowe, Ann
    AI COMMUNICATIONS, 2010, 23 (04) : 311 - 324