Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings

被引：16

作者：

Nweye, Kingsley ^{[1
]}

Liu, Bo ^{[2
]}

Stone, Peter ^{[2
]}

Nagy, Zoltan ^{[1
]}

机构：

[1] Univ Texas Austin, Dept Civil Architectural & Environm Engn, Intelligent Environm Lab, 301 E Dean Keeton St Stop,ECJ 4 200, Austin, TX 78712 USA

[2] Univ Texas Austin, Dept Comp Sci, 2317 Speedway,GDC 2 302, Austin, TX 78712 USA

来源：

ENERGY AND AI | 2022年 / 10卷

关键词：

Grid-interactive buildings; Benchmarking; Reinforcement learning; DEMAND RESPONSE;

D O I：

10.1016/j.egyai.2022.100202

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Building upon prior research that highlighted the need for standardizing environments for building control research, and inspired by recently introduced challenges for real life reinforcement learning (RL) control, here we propose a non-exhaustive set of nine real world challenges for RL control in grid-interactive buildings (GIBs). We argue that research in this area should be expressed in this framework in addition to providing a standardized environment for repeatability. Advanced controllers such as model predictive control (MPC) and RL control have both advantages and disadvantages that prevent them from being implemented in real world problems. Comparisons between the two are rare, and often biased. By focusing on the challenges, we can investigate the performance of the controllers under a variety of situations and generate a fair comparison. As a demonstration, we implement the offline learning challenge in CityLearn, an OpenAI Gym environment for the easy implementation of RL agents in a demand response setting to reshape the aggregated curve of electricity demand by controlling the energy storage of a diverse set of buildings in a district. We use CityLearn to study the impact of different levels of domain knowledge and complexity of RL algorithms and show that the sequence of operations (SOOs) utilized in a rule based controller (RBC) that provides fixed logs to RL agents during offline training affect the performance of the agents when evaluated on a set of four energy flexibility metrics. Longer offline training from an optimized RBC leads to improved performance in the long run. RL agents that train on the logs from a simplified RBC risk poorer performance as the offline training period increases. We also observe no impact on performance from information sharing amongst agents. We call for a more interdisciplinary effort of the research community to address the real world challenges, and unlock the potential of GIB controllers.

引用

页数：10

共 50 条

[41] A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications
Du, Wei
Ding, Shifei
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (05) : 3215 - 3238
[42] Learning-based demand response in grid-interactive buildings via Gaussian Processes
Ospina, Ana M.
Chen, Yue
Bernstein, Andrey
Dall'Anese, Emiliano
ELECTRIC POWER SYSTEMS RESEARCH, 2022, 211
[43] Two-Stage Reinforcement Learning Policy Search for Grid-Interactive Building Control
Zhang, Xiangyu
Chen, Yue
Bernstein, Andrey
Chintala, Rohit
Graf, Peter
Jin, Xin
Biagioni, David
IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (03) : 1976 - 1987
[44] TEAM POLICY LEARNING FOR MULTI-AGENT REINFORCEMENT LEARNING
Cassano, Lucas
Alghunaim, Sulaiman A.
Sayed, Ali H.
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3062 - 3066
[45] Aggregation Transfer Learning for Multi-Agent Reinforcement learning
Xu, Dongsheng
Qiao, Peng
Dou, Yong
2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 547 - 551
[46] Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Foerster, Jakob N.
Assael, Yannis M.
de Freitas, Nando
Whiteson, Shimon
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[47] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
Xu, Zhiwei
Zhang, Bin
Li, Dapeng
Zhang, Zeren
Zhou, Guangchong
Chen, Hao
Fan, Guoliang
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734
[48] Learning structured communication for multi-agent reinforcement learning
Junjie Sheng
Xiangfeng Wang
Bo Jin
Junchi Yan
Wenhao Li
Tsung-Hui Chang
Jun Wang
Hongyuan Zha
Autonomous Agents and Multi-Agent Systems, 2022, 36
[49] Learning structured communication for multi-agent reinforcement learning
Sheng, Junjie
Wang, Xiangfeng
Jin, Bo
Yan, Junchi
Li, Wenhao
Chang, Tsung-Hui
Wang, Jun
Zha, Hongyuan
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (02)
[50] Generalized learning automata for multi-agent reinforcement learning
De Hauwere, Yann-Michael
Vrancx, Peter
Nowe, Ann
AI COMMUNICATIONS, 2010, 23 (04) : 311 - 324

← 1 2 3 4 5 →