Fitted Q-Iteration via Max-Plus-Linear Approximation

被引:0
|
作者
Liu, Yichen [1 ]
Kolarijani, Mohamad Amin Sharifi [1 ]
机构
[1] Delft Univ Technol, Delft Ctr Syst & Control, NL-2628 CD Delft, Netherlands
来源
关键词
Approximation algorithms; Convergence; Vectors; Standards; Optimal control; Complexity theory; Algebra; Real-time systems; Neural networks; Medical services; Reinforcement learning; stochastic optimal control; computational methods;
D O I
10.1109/LCSYS.2024.3520060
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this letter, we consider the application of max-plus-linear approximators for Q-function in offline reinforcement learning of discounted Markov decision processes. In particular, we incorporate these approximators to propose novel fitted Q-iteration (FQI) algorithms with provable convergence. Exploiting the compatibility of the Bellman operator with max-plus operations, we show that the max-plus-linear regression within each iteration of the proposed FQI algorithm reduces to simple max-plus matrix-vector multiplications. We also consider the variational implementation of the proposed algorithm which leads to a per-iteration complexity that is independent of the number of samples.
引用
收藏
页码:3201 / 3206
页数:6
相关论文
共 50 条
  • [1] Boosted Fitted Q-Iteration
    Tosatto, Samuele
    Pirotta, Matteo
    D'Eramo, Carlo
    Restelli, Marcello
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [2] CFQI: Fitted Q-Iteration with Complex Returns
    Wright, Robert
    Qiao, Xingye
    Yu, Lei
    Loscalzo, Steven
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 163 - 170
  • [3] Regularized Fitted Q-Iteration: Application to Planning
    Farahmand, Amir Massoud
    Ghavamzadeh, Mohammad
    Szepesvari, Csaba
    Mannor, Shie
    RECENT ADVANCES IN REINFORCEMENT LEARNING, 2008, 5323 : 55 - +
  • [4] Logically-Constrained Neural Fitted Q-iteration
    Hasanbeig, Mohammadhosein
    Abate, Alessandro
    Kroening, Daniel
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2012 - 2014
  • [5] Fitted Q-iteration by Functional Networks for control problems
    Gaeta, Matteo
    Loia, Vincenzo
    Miranda, Sergio
    Tomasiello, Stefania
    APPLIED MATHEMATICAL MODELLING, 2016, 40 (21-22) : 9183 - 9196
  • [6] Approximate Model-Assisted Neural Fitted Q-Iteration
    Lampe, Thomas
    Riedmiller, Martin
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2698 - 2704
  • [7] Fitted Q-iteration and functional networks for ubiquitous recommender systems
    Gaeta, Matteo
    Orciuoli, Francesco
    Rarita, Luigi
    Tomasiello, Stefania
    SOFT COMPUTING, 2017, 21 (23) : 7067 - 7075
  • [8] Fitted Q-iteration and functional networks for ubiquitous recommender systems
    Matteo Gaeta
    Francesco Orciuoli
    Luigi Rarità
    Stefania Tomasiello
    Soft Computing, 2017, 21 : 7067 - 7075
  • [9] Electric Water Heater Control Through Informed Fitted Q-Iteration
    Patyn, Christophe
    Deconinck, Geert
    PROCEEDINGS OF 2019 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE), 2019,
  • [10] Learning to Control Partial Differential Equations: Regularized Fitted Q-Iteration Approach
    Farahmand, Amir-massoud
    Nabi, Saleh
    Grover, Piyush
    Nikovski, Daniel N.
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 4578 - 4585