Customized Learning Algorithms for Episodic Tasks with Acyclic State Spaces

被引:0
|
作者
Bountourelis, Theologos [1 ]
Reveliotis, Spyros [1 ]
机构
[1] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA 30332 USA
关键词
TIME;
D O I
10.1109/COASE.2009.5234189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The work presented in this paper provides a practical, customized learning algorithm for reinforcement learning tasks that evolve episodically over acyclic state spaces. The presented results are motivated by the Optimal Disassembly Planning (ODP) problem described in [14], and they complement and enhance some earlier developments on this problem that were presented in [15]. In particular, the proposed algorithm is shown to be a substantial improvement of the original algorithm developed in [15], in terms of, both, the involved computational effort and the attained performance, where the latter is measured by the accumulated reward. The new algorithm also leads to a robust performance gain over the typical Q-learning implementations for the considered problem context.
引用
收藏
页码:627 / 634
页数:8
相关论文
共 50 条
  • [1] Efficient learning algorithms for episodic tasks with acyclic state spaces
    Reveliotis, Spyros
    Bountourelis, Theologos
    2006 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING, VOLS 1 AND 2, 2006, : 411 - +
  • [2] Efficient PAC learning for episodic tasks with acyclic state spaces
    Reveliotis, Spyros
    Bountourelis, Theologos
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2007, 17 (03): : 307 - 327
  • [3] Efficient PAC Learning for Episodic Tasks with Acyclic State Spaces
    Spyros Reveliotis
    Theologos Bountourelis
    Discrete Event Dynamic Systems, 2007, 17 : 307 - 327
  • [4] Maximizing the average reward in episodic reinforcement learning tasks
    Reinke, Chris
    Uchibe, Eiji
    Doya, Kenji
    2015 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS), 2015, : 420 - 421
  • [5] A hybrid learning agent for episodic learning tasks with unknown target distance
    Oliver Sefrin
    Sabine Wölk
    Quantum Machine Intelligence, 2025, 7 (1)
  • [6] Scheduling of Customized Tasks in Cloud Manufacturing with Deep Reinforcement Learning
    Lv, Ming
    Cao, Yu
    Qiu, Xingbo
    Liu, Yongkui
    Zhang, Lin
    INTELLIGENT NETWORKED THINGS, CINT 2024, PT II, 2024, 2139 : 241 - 252
  • [7] Quantum machine learning with glow for episodic tasks and decision games
    Clausen, Jens
    Briegel, Hans J.
    PHYSICAL REVIEW A, 2018, 97 (02)
  • [8] Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces
    Sinclair, Sean R.
    Banerjee, Siddhartha
    Yu, Christina Lee
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2019, 3 (03)
  • [9] Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces
    Sinclair S.R.
    Banerjee S.
    Lee Yu C.
    Performance Evaluation Review, 2020, 48 (01): : 17 - 18
  • [10] Improved Corruption Robust Algorithms for Episodic Reinforcement Learning
    Chen, Yifang
    Du, Simon S.
    Jamieson, Kevin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139