Optimal control towards sustainable wastewater treatment plants based on multi-agent reinforcement learning

被引:53
|
作者
Chen, Kehua [1 ,5 ]
Wang, Hongcheng [1 ,2 ,3 ]
Valverde-Perez, Borja [4 ]
Zhai, Siyuan [1 ]
Vezzaro, Luca [4 ]
Wang, Aijie [1 ,3 ]
机构
[1] Chinese Acad Sci, Res Ctr Ecoenvironm Sci, Key Lab Environm Biotechnol, 18 Shuangqing Rd, Beijing 100085, Peoples R China
[2] Harbin Inst Technol Shenzhen, Sch Civil & Environm Engn, Shenzhen 518055, Peoples R China
[3] Harbin Inst Technol, State Key Lab Urban Water Resource & Environm, Harbin 150001, Peoples R China
[4] Tech Univ Denmark, DTU Environm, Bldg 115, DK-2800 Lyngby, Denmark
[5] Univ Chinese Acad Sci, Sino Danish Ctr Educ & Res, Beijing, Peoples R China
关键词
Wastewater treatment; Reinforcement learning; Multi-objective optimization; Sustainability; GREENHOUSE-GAS EMISSIONS; MULTIOBJECTIVE OPTIMIZATION; ENERGY-CONSUMPTION; DISSOLVED-OXYGEN; NEURAL-NETWORKS; MODEL; BENCHMARKING; PHOSPHORUS; EFFICIENCY; REMOVAL;
D O I
10.1016/j.chemosphere.2021.130498
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Wastewater treatment plants (WWTPs) are designed to eliminate pollutants and alleviate environmental pollution resulting from human activities. However, the construction and operation of WWTPs consume resources, emit greenhouse gases (GHGs) and produce residual sludge, thus require further optimization. WWTPs are complex to control and optimize because of high non-linearity and variation. This study used a novel technique, multi-agent deep reinforcement learning (MADRL), to simultaneously optimize dissolved oxygen (DO) and chemical dosage in a WWTP. The reward function was specially designed from life cycle perspective to achieve sustainable optimization. Five scenarios were considered: baseline, three different effluent quality and cost-oriented scenarios. The result shows that optimization based on LCA has lower environmental impacts compared to baseline scenario, as cost, energy consumption and greenhouse gas emissions reduce to 0.890 CNY/m(3)-ww, 0.530 kWh/m(3)-ww, 2.491 kg CO2-eq/m(3)-ww respectively. The cost-oriented control strategy exhibits comparable overall performance to the LCA-driven strategy since it sacrifices environmental benefits but has lower cost as 0.873 CNY/m(3)-ww. It is worth mentioning that the retrofitting of WWTPs based on resources should be implemented with the consideration of impact transfer. Specifically, LCA-SW scenario decreases 10 kg PO4-eq in eutrophication potential compared to the baseline within 10 days, while significantly increases other indicators. The major contributors of each indicator are identified for future study and improvement. Last, the authors discussed that novel dynamic control strategies required advanced sensors or a large amount of data, so the selection of control strategies should also consider economic and ecological conditions. In a nutshell, there are still limitations of this work and future studies are required. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Optimal control method of HVAC based on multi-agent deep reinforcement learning
    Fu, Qiming
    Chen, Xiyao
    Ma, Shuai
    Fang, Nengwei
    Xing, Bin
    Chen, Jianping
    [J]. ENERGY AND BUILDINGS, 2022, 270
  • [2] A Nearly Optimal Multi-agent Formation Control with Reinforcement Learning
    Peng, Jiangwen
    Mu, Chaoxu
    Wang, Ke
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 5315 - 5320
  • [3] Optimal control in microgrid using multi-agent reinforcement learning
    Li, Fu-Dong
    Wu, Min
    He, Yong
    Chen, Xin
    [J]. ISA TRANSACTIONS, 2012, 51 (06) : 743 - 751
  • [4] Multi-Agent Optimal Control for Central Chiller Plants Using Reinforcement Learning and Game Theory
    Qiu, Shunian
    Li, Zhenhai
    Pang, Zhihong
    Li, Zhengwei
    Tao, Yinying
    [J]. SYSTEMS, 2023, 11 (03):
  • [5] Optimal robust formation control for heterogeneous multi-agent systems based on reinforcement learning
    Yan, Bing
    Shi, Peng
    Lim, Cheng-Chew
    Shi, Zhiyuan
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (05) : 2683 - 2704
  • [6] Multi-Agent Reinforcement Learning Based Optimal PV-ESS Control In Grid
    Park, Jaemin
    Kwon, Taehyeon
    Kim, Bongseok
    Hwang, Yujeong
    Sim, Min Kyu
    [J]. 2023 IEEE IAS GLOBAL CONFERENCE ON RENEWABLE ENERGY AND HYDROGEN TECHNOLOGIES, GLOBCONHT, 2023,
  • [7] Distributed optimal control of nonlinear multi-agent systems based on integral reinforcement learning
    Xu, Ying
    Li, Kewen
    Li, Yongming
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2024,
  • [8] Towards Designing Optimal Reward Functions in Multi-Agent Reinforcement Learning Problems
    Grunitzki, Ricardo
    da Silva, Bruno C.
    Bazzan, Ana L. C.
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [9] Formation Control of Multi-agent Based on Deep Reinforcement Learning
    Pan, Chao
    Nian, Xiaohong
    Dai, Xunhua
    Wang, Haibo
    Xiong, Hongyun
    [J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1149 - 1159
  • [10] Multi-agent reinforcement learning for character control
    Li, Cheng
    Fussell, Levi
    Komura, Taku
    [J]. VISUAL COMPUTER, 2021, 37 (12): : 3115 - 3123