Tackling the start-up of a reinforcement learning agent for the control of wastewater treatment plants

被引:24
|
作者
Hernandez-del-Olma, Felix [1 ]
Gaudioso, Elena [1 ]
Dormido, Raquel [2 ]
Duro, Natividad [2 ]
机构
[1] Natl Distance Educ Univ UNED, Dept Artificial Intelligence, Madrid 28040, Spain
[2] Natl Distance Educ Univ UNED, Dept Comp Sci & Automat Control, Madrid 28040, Spain
关键词
Reinforcement learning; Wastewater systems; Intelligent agent; Adaptive control; OPTIMIZATION; SYSTEMS;
D O I
10.1016/j.knosys.2017.12.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning problems involve learning by doing. Therefore, a reinforcement learning agent will have to fail sometimes (while doing) in order to learn. Nevertheless, even with this starting error, introduced at least during the non-optimal learning stage, reinforcement learning can be affordable in some domains like the control of a wastewater treatment plant. However, in wastewater treatment plants, trying to solve the day-to-day problems, plant operators will usually not risk to leave their plant in the hands of an inexperienced and untrained reinforcement learning agent. In fact, it is somewhat obvious that plant operators will require firstly to check that the agent has been trained and that it works as it should at their particular plant In this paper, we present a solution to this problem by giving a previous instruction to the reinforcement learning agent before we let it act on the plant. In fact, this previous instruction is the key point of the paper. In addition, this instruction is given effortlessly by the plant operator. As we will see, this solution does not just solve the starting up problem of leaving the plant in the hands of an untrained agent, but it also improves the future performance of the agent. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:9 / 15
页数:7
相关论文
共 50 条
  • [1] START-UP OF NEW WASTEWATER TREATMENT PLANTS
    CAVERLY, DS
    [J]. JOURNAL WATER POLLUTION CONTROL FEDERATION, 1968, 40 (04): : 571 - &
  • [2] Reinforcement Learning Techniques for the Control of WasteWater Treatment Plants
    Hernandez-del-Olmo, Felix
    Gaudioso, Elena
    [J]. NEW CHALLENGES ON BIOINSPIRED APPLICATIONS: 4TH INTERNATIONAL WORK-CONFERENCE ON THE INTERPLAY BETWEEN NATURAL AND ARTIFICIAL COMPUTATION, IWINAC 2011, PART II, 2011, 6687 : 215 - 222
  • [3] START-UP OF ANAEROBIC REACTORS FOR SLAUGHTERHOUSE WASTEWATER TREATMENT
    Fia, Ronaldo
    Pereira, Erlon L.
    Fia, Fatima R. L.
    Emboaba, Debora G.
    Gomes, Emanuel M.
    [J]. ENGENHARIA AGRICOLA, 2015, 35 (02): : 331 - 339
  • [4] Start-up and operation of an AnMBR for winery wastewater treatment
    Basset, Nuria
    Santos, Eric
    Dosta, Joan
    Mata-Alvarez, Joan
    [J]. ECOLOGICAL ENGINEERING, 2016, 86 : 279 - 289
  • [5] Start-up of a trickling photobioreactor for the treatment of domestic wastewater
    Katam, Keerthi
    Tiwari, Yashendra
    Shimizu, Toshiyuki
    Soda, Satoshi
    Bhattacharyya, Debraj
    [J]. WATER ENVIRONMENT RESEARCH, 2021, 93 (09) : 1690 - 1699
  • [6] Optimal control towards sustainable wastewater treatment plants based on multi-agent reinforcement learning
    Chen, Kehua
    Wang, Hongcheng
    Valverde-Perez, Borja
    Zhai, Siyuan
    Vezzaro, Luca
    Wang, Aijie
    [J]. CHEMOSPHERE, 2021, 279
  • [7] START-UP OF METALLURGICAL PLANTS
    TAYLOR, JC
    [J]. JOURNAL OF METALS, 1985, 37 (11): : A124 - A125
  • [8] Fast start-up strategies of MBBR for mariculture wastewater treatment
    Li, Changwei
    Liang, Jiawei
    Lin, Xiaochang
    Xu, Hong
    Tadda, Musa Abubakar
    Lan, Lihua
    Liu, Dezhao
    [J]. JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2019, 248
  • [9] Start-up strategies of UASB reactor for treatment of pharmaceutical wastewater
    Zheng, P
    Hu, BL
    [J]. JOURNAL OF ENVIRONMENTAL SCIENCES, 2002, 14 (02) : 250 - 254
  • [10] Start-up of EGSB reactor for treatment of wastewater containing pentachlorophenol
    Zhou, HB
    Chen, J
    [J]. JOURNAL OF CHEMICAL ENGINEERING OF JAPAN, 2003, 36 (10) : 1152 - 1155