Customized Learning Algorithms for Episodic Tasks with Acyclic State Spaces

被引：0

作者：

Bountourelis, Theologos ^{[1
]}

Reveliotis, Spyros ^{[1
]}

机构：

[1] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA 30332 USA

来源：

2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING | 2009年

关键词：

TIME;

D O I：

10.1109/COASE.2009.5234189

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The work presented in this paper provides a practical, customized learning algorithm for reinforcement learning tasks that evolve episodically over acyclic state spaces. The presented results are motivated by the Optimal Disassembly Planning (ODP) problem described in [14], and they complement and enhance some earlier developments on this problem that were presented in [15]. In particular, the proposed algorithm is shown to be a substantial improvement of the original algorithm developed in [15], in terms of, both, the involved computational effort and the attained performance, where the latter is measured by the accumulated reward. The new algorithm also leads to a robust performance gain over the typical Q-learning implementations for the considered problem context.

引用

页码：627 / 634

页数：8

共 50 条

[1] Efficient learning algorithms for episodic tasks with acyclic state spaces
Reveliotis, Spyros
Bountourelis, Theologos
2006 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING, VOLS 1 AND 2, 2006, : 411 - +
[2] Efficient PAC learning for episodic tasks with acyclic state spaces
Reveliotis, Spyros
Bountourelis, Theologos
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2007, 17 (03): : 307 - 327
[3] Efficient PAC Learning for Episodic Tasks with Acyclic State Spaces
Spyros Reveliotis
Theologos Bountourelis
Discrete Event Dynamic Systems, 2007, 17 : 307 - 327
[4] Maximizing the average reward in episodic reinforcement learning tasks
Reinke, Chris
Uchibe, Eiji
Doya, Kenji
2015 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS), 2015, : 420 - 421
[5] A hybrid learning agent for episodic learning tasks with unknown target distance
Oliver Sefrin
Sabine Wölk
Quantum Machine Intelligence, 2025, 7 (1)
[6] Scheduling of Customized Tasks in Cloud Manufacturing with Deep Reinforcement Learning
Lv, Ming
Cao, Yu
Qiu, Xingbo
Liu, Yongkui
Zhang, Lin
INTELLIGENT NETWORKED THINGS, CINT 2024, PT II, 2024, 2139 : 241 - 252
[7] Quantum machine learning with glow for episodic tasks and decision games
Clausen, Jens
Briegel, Hans J.
PHYSICAL REVIEW A, 2018, 97 (02)
[8] Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces
Sinclair, Sean R.
Banerjee, Siddhartha
Yu, Christina Lee
PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2019, 3 (03)
[9] Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces
Sinclair S.R.
Banerjee S.
Lee Yu C.
Performance Evaluation Review, 2020, 48 (01): : 17 - 18
[10] Improved Corruption Robust Algorithms for Episodic Reinforcement Learning
Chen, Yifang
Du, Simon S.
Jamieson, Kevin
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139

← 1 2 3 4 5 →