Towards optimal control of air handling units using deep reinforcement learning and recurrent neural network

被引:115
|
作者
Zou, Zhengbo [1 ]
Yu, Xinran [1 ]
Ergan, Semiha [1 ]
机构
[1] NYU, Dept Civil & Urban Engn, MetroTech Ctr 15, Brooklyn, NY 11201 USA
关键词
HVAC control; Energy consumption; Thermal comfort; Deep reinforcement learning; Long-short-term-memory network; ENERGY-CONSUMPTION; BUILDINGS; SYSTEMS; COMFORT; MODELS;
D O I
10.1016/j.buildenv.2019.106535
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Optimal control of heating, ventilation and air conditioning systems (HVACs) aims to minimize the energy consumption of equipment while maintaining the thermal comfort of occupants. Traditional rule-based control methods are not optimized for HVAC systems with continuous sensor readings and actuator controls. Recent developments in deep reinforcement learning (DRL) enabled control of HVACs with continuous sensor inputs and actions, while eliminating the need of building complex thermodynamic models. DRL control includes an environment, which approximates real-world HVAC operations; and an agent, that aims to achieve optimal control over the HVAC. Existing DRL control frameworks use simulation tools (e.g., EnergyPlus) to build DRL training environments with HVAC systems information, but oversimplify building geometrics. This study proposes a framework aiming to achieve optimal control over Air Handling Units (AHUs) by implementing longshort-term-memory (LSTM) networks to approximate real-world HVAC operations to build DRL training environments. The framework also implements state-of-the-art DRL algorithms (e.g., deep deterministic policy gradient) for optimal control over the AHUs. Three AHUs, each with two-years of building automation system (BAS) data, were used as testbeds for evaluation. Our LSTM-based DRL training environments, built using the first year's BAS data, achieved an average mean square error of 0.0015 across 16 normalized AHU parameters. When deployed in the testing environments, which were built using the second year's BAS data of the same AHUs, the DRL agents achieved 27%-30% energy saving comparing to the actual energy consumption, while maintaining the predicted percentage of discomfort (PPD) at 10%.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Optimal Control of Active Distribution Network using Deep Reinforcement Learning
    Tahir, Yameena
    Khan, Muhammad Faisal Nadeem
    Sajjad, Intisar Ali
    Martirano, Luigi
    2022 IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2022 IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC / I&CPS EUROPE), 2022,
  • [2] A recurrent control neural network for data efficient reinforcement learning
    Schaefer, Anton Maximilian
    Udluft, Steffen
    Zimmermann, Hans-Georg
    2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 151 - +
  • [3] Nonlinear Optimal Control Using Deep Reinforcement Learning
    Bucci, Michele Alessandro
    Semeraro, Onofrio
    Allauzen, Alexandre
    Cordier, Laurent
    Mathelin, Lionel
    IUTAM LAMINAR-TURBULENT TRANSITION, 2022, 38 : 279 - 290
  • [4] Emergence of Prediction by Reinforcement Learning Using a Recurrent Neural Network
    Goto, Kenta
    Shibata, Katsunari
    JOURNAL OF ROBOTICS, 2010, 2010
  • [5] Identification and optimal control of nonlinear systems using recurrent neural networks and reinforcement learning: An overview
    Perrusquia, Adolfo
    Yu, Wen
    NEUROCOMPUTING, 2021, 438 : 145 - 154
  • [6] Improvement of air handling unit control performance using reinforcement learning
    Youk, Sangjo
    Kim, Moonseong
    Kim, Yangsok
    Park, Gilcheol
    ADVANCES IN KNOWLEDGE ACQUISITION AND MANAGEMENT, 2006, 4303 : 168 - +
  • [7] Category learning in a recurrent neural network with reinforcement learning
    Zhang, Ying
    Pan, Xiaochuan
    Wang, Yihong
    FRONTIERS IN PSYCHIATRY, 2022, 13
  • [8] Learning Control for Air Hockey Striking using Deep Reinforcement Learning
    Taitler, Ayal
    Shimkin, Nahum
    2017 INTERNATIONAL CONFERENCE ON CONTROL, ARTIFICIAL INTELLIGENCE, ROBOTICS & OPTIMIZATION (ICCAIRO), 2017, : 22 - 27
  • [9] Voltage Optimal Control of Distribution Network Based on Deep Reinforcement Learning
    Quan H.
    Peng X.
    Liu H.
    Zhou P.
    Wu Z.
    Su H.
    Dianwang Jishu/Power System Technology, 2023, 47 (05): : 2029 - 2038
  • [10] Robust optimal control using recurrent dynamic neural network
    Karam, M
    Zohdy, MA
    Farinwata, SS
    PROCEEDINGS OF THE 2001 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL (ISIC'01), 2001, : 331 - 336