Taming Lagrangian chaos with multi-objective reinforcement learning

被引:7
|
作者
Calascibetta, Chiara [1 ]
Biferale, Luca [1 ]
Borra, Francesco [2 ]
Celani, Antonio [3 ]
Cencini, Massimo [4 ,5 ]
机构
[1] Univ Roma Tor Vergata, Dept Phys & INFN, Via Ric Scientif 1, I-00133 Rome, Italy
[2] Lab Phys Ecole Normale Super, 24 RueLhomond, F-75005 Paris, France
[3] Abdus Salaam Int Ctr Theoret Phys, Abdus Salam Int Ctr Theoret Phys, Quantitat Life Sci, I-34151 Trieste, Italy
[4] CNR, Ist Sistemi Complessi, Via Taurini 19, I-00185 Rome, Italy
[5] INFN Tor Vergata, Rome, Italy
来源
EUROPEAN PHYSICAL JOURNAL E | 2023年 / 46卷 / 03期
基金
欧洲研究理事会;
关键词
PURSUIT;
D O I
10.1140/epje/s10189-023-00271-0
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
We consider the problem of two active particles in 2D complex flows with the multi-objective goals of minimizing both the dispersion rate and the control activation cost of the pair. We approach the problem by means of multi-objective reinforcement learning (MORL), combining scalarization techniques together with a Q-learning algorithm, for Lagrangian drifters that have variable swimming velocity. We show that MORL is able to find a set of trade-off solutions forming an optimal Pareto frontier. As a benchmark, we show that a set of heuristic strategies are dominated by the MORL solutions. We consider the situation in which the agents cannot update their control variables continuously, but only after a discrete (decision) time, tau. We show that there is a range of decision times, in between the Lyapunov time and the continuous updating limit, where reinforcement learning finds strategies that significantly improve over heuristics. In particular, we discuss how large decision times require enhanced knowledge of the flow, whereas for smaller tau all a priori heuristic strategies become Pareto optimal.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Hypervolume-Based Multi-Objective Reinforcement Learning
    Van Moffaert, Kristof
    Drugan, Madalina M.
    Nowe, Ann
    EVOLUTIONARY MULTI-CRITERION OPTIMIZATION, EMO 2013, 2013, 7811 : 352 - 366
  • [32] A practical guide to multi-objective reinforcement learning and planning
    Conor F. Hayes
    Roxana Rădulescu
    Eugenio Bargiacchi
    Johan Källström
    Matthew Macfarlane
    Mathieu Reymond
    Timothy Verstraeten
    Luisa M. Zintgraf
    Richard Dazeley
    Fredrik Heintz
    Enda Howley
    Athirai A. Irissappane
    Patrick Mannion
    Ann Nowé
    Gabriel Ramos
    Marcello Restelli
    Peter Vamplew
    Diederik M. Roijers
    Autonomous Agents and Multi-Agent Systems, 2022, 36
  • [33] Incremental reinforcement learning for multi-objective robotic tasks
    Garcia, Javier
    Iglesias, Roberto
    Rodriguez, Miguel A.
    Regueiro, Carlos V.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (03) : 911 - 940
  • [34] Incremental reinforcement learning for multi-objective robotic tasks
    Javier García
    Roberto Iglesias
    Miguel A. Rodríguez
    Carlos V. Regueiro
    Knowledge and Information Systems, 2017, 51 : 911 - 940
  • [35] Adaptive Objective Selection for Correlated Objectives in Multi-Objective Reinforcement Learning
    Brys, Tim
    Van Moffaert, Kristof
    Nowe, Ann
    Taylor, Matthew E.
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1349 - 1350
  • [36] Prediction Guided Meta-Learning for Multi-Objective Reinforcement Learning
    Liu, Fei-Yu
    Qian, Chao
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 2171 - 2178
  • [37] Learning adversarial attack policies through multi-objective reinforcement learning
    Garcia, Javier
    Majadas, Ruben
    Fernandez, Fernando
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 96
  • [38] Nondominated Policy-Guided Learning in Multi-Objective Reinforcement Learning
    Kim, Man-Je
    Park, Hyunsoo
    Ahn, Chang Wook
    ELECTRONICS, 2022, 11 (07)
  • [39] LEARNING MULTI-OBJECTIVE DECEPTION IN A TWO-PLAYER DIFFERENTIAL GAME USING REINFORCEMENT LEARNING AND MULTI-OBJECTIVE GENETIC ALGORITHM
    Asgharnia A.
    Schwartz H.
    Atia M.
    International Journal of Innovative Computing, Information and Control, 2022, 18 (06): : 1667 - 1688
  • [40] Multi-Objective Reinforcement Learning Based on Decomposition: A Taxonomy and Framework
    Felten, Florian
    Talbi, El-Ghazali
    Danoy, Gregoire
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 79 : 679 - 723