Optimal control towards sustainable wastewater treatment plants based on multi-agent reinforcement learning

被引:53
|
作者
Chen, Kehua [1 ,5 ]
Wang, Hongcheng [1 ,2 ,3 ]
Valverde-Perez, Borja [4 ]
Zhai, Siyuan [1 ]
Vezzaro, Luca [4 ]
Wang, Aijie [1 ,3 ]
机构
[1] Chinese Acad Sci, Res Ctr Ecoenvironm Sci, Key Lab Environm Biotechnol, 18 Shuangqing Rd, Beijing 100085, Peoples R China
[2] Harbin Inst Technol Shenzhen, Sch Civil & Environm Engn, Shenzhen 518055, Peoples R China
[3] Harbin Inst Technol, State Key Lab Urban Water Resource & Environm, Harbin 150001, Peoples R China
[4] Tech Univ Denmark, DTU Environm, Bldg 115, DK-2800 Lyngby, Denmark
[5] Univ Chinese Acad Sci, Sino Danish Ctr Educ & Res, Beijing, Peoples R China
关键词
Wastewater treatment; Reinforcement learning; Multi-objective optimization; Sustainability; GREENHOUSE-GAS EMISSIONS; MULTIOBJECTIVE OPTIMIZATION; ENERGY-CONSUMPTION; DISSOLVED-OXYGEN; NEURAL-NETWORKS; MODEL; BENCHMARKING; PHOSPHORUS; EFFICIENCY; REMOVAL;
D O I
10.1016/j.chemosphere.2021.130498
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Wastewater treatment plants (WWTPs) are designed to eliminate pollutants and alleviate environmental pollution resulting from human activities. However, the construction and operation of WWTPs consume resources, emit greenhouse gases (GHGs) and produce residual sludge, thus require further optimization. WWTPs are complex to control and optimize because of high non-linearity and variation. This study used a novel technique, multi-agent deep reinforcement learning (MADRL), to simultaneously optimize dissolved oxygen (DO) and chemical dosage in a WWTP. The reward function was specially designed from life cycle perspective to achieve sustainable optimization. Five scenarios were considered: baseline, three different effluent quality and cost-oriented scenarios. The result shows that optimization based on LCA has lower environmental impacts compared to baseline scenario, as cost, energy consumption and greenhouse gas emissions reduce to 0.890 CNY/m(3)-ww, 0.530 kWh/m(3)-ww, 2.491 kg CO2-eq/m(3)-ww respectively. The cost-oriented control strategy exhibits comparable overall performance to the LCA-driven strategy since it sacrifices environmental benefits but has lower cost as 0.873 CNY/m(3)-ww. It is worth mentioning that the retrofitting of WWTPs based on resources should be implemented with the consideration of impact transfer. Specifically, LCA-SW scenario decreases 10 kg PO4-eq in eutrophication potential compared to the baseline within 10 days, while significantly increases other indicators. The major contributors of each indicator are identified for future study and improvement. Last, the authors discussed that novel dynamic control strategies required advanced sensors or a large amount of data, so the selection of control strategies should also consider economic and ecological conditions. In a nutshell, there are still limitations of this work and future studies are required. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Multi-agent Deep Reinforcement Learning Based Optimal Dispatch of Distributed Generators
    Zhang, Jinyuan
    Pu, Tianjiao
    Li, Ye
    Wang, Xinying
    Zhou, Xiang
    [J]. Dianwang Jishu/Power System Technology, 2022, 46 (09): : 3496 - 3503
  • [22] Multi-Agent Deep Reinforcement Learning-Based Distributed Optimal Generation Control of DC Microgrids
    Fan, Zhen
    Zhang, Wei
    Liu, Wenxin
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (05) : 3337 - 3351
  • [23] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    [J]. 2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [24] Multi-Agent Reinforcement Learning Control for Ramp Metering
    Fares, Ahmed
    Gomaa, Walid
    [J]. PROGRESS IN SYSTEMS ENGINEERING, 2015, 366 : 167 - 173
  • [25] Scalable Reinforcement Learning Policies for Multi-Agent Control
    Hsu, Christopher D.
    Jeong, Heejin
    Pappas, George J.
    Chaudhari, Pratik
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4785 - 4791
  • [26] MARLYC: Multi-Agent Reinforcement Learning Yaw Control
    Kadoche, Elie
    Gourvenec, Sebastien
    Pallud, Maxime
    Levent, Tanguy
    [J]. RENEWABLE ENERGY, 2023, 217
  • [27] Dynamic Multi-Agent Reinforcement Learning for Control Optimization
    Fagan, Derek
    Meier, Rene
    [J]. PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 99 - 104
  • [28] Multi-agent Reinforcement Learning for Traffic Signal Control
    Prabuchandran, K. J.
    Kumar, Hemanth A. N.
    Bhatnagar, Shalabh
    [J]. 2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2014, : 2529 - 2534
  • [29] Cranes control using multi-agent reinforcement learning
    Arai, S
    Miyazaki, K
    Kobayashi, S
    [J]. INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 335 - 342
  • [30] Multi-Agent Reinforcement Learning for Coordinating Communication and Control
    Mason, Federico
    Chiariotti, Federico
    Zanella, Andrea
    Popovski, Petar
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (04) : 1566 - 1581