Markov Decision Process Framework for Control-Based Reinforcement Learning

被引：0

作者：

Lu Y. ^{[1
]}

Squillante M.S. ^{[1
]}

Wah Wu C. ^{[1
]}

机构：

[1] IBM Research, Mathematical Sciences Department, Thomas J. Watson Research Center, Yorktown Heights, 10598, NY

来源：

Performance Evaluation Review | 2023年 / 51卷 / 02期

关键词：

Compendex;

D O I：

10.1145/3626570.3626585

中图分类号：

学科分类号：

摘要：

[No abstract available]

引用

页码：39 / 41

页数：2

共 50 条

[41] Optimal Electric Vehicle Charging Strategy With Markov Decision Process and Reinforcement Learning Technique
Ding, Tao
Zeng, Ziyu
Bai, Jiawen
Qin, Boyu
Yang, Yongheng
Shahidehpour, Mohammad
IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2020, 56 (05) : 5811 - 5823
[42] Joint Manufacturing and Onsite Microgrid System Control Using Markov Decision Process and Neural Network Integrated Reinforcement Learning
Hu, Wenqing
Sun, Zeyi
Zhang, Yunchao
Li, Yu
25TH INTERNATIONAL CONFERENCE ON PRODUCTION RESEARCH MANUFACTURING INNOVATION: CYBER PHYSICAL MANUFACTURING, 2019, 39 : 1242 - 1249
[43] A statistical property of multiagent learning based on Markov decision process
Iwata, Kazunori
Ikeda, Kazushi
Sakai, Hideaki
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (04): : 829 - 842
[44] Activity Support Framework for People with Dementia Based on Markov Decision Process
Sarni, Tomi
Pulli, Petri
2015 INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS IE 2015, 2015, : 25 - 32
[45] On the convergence of projective-simulation-based reinforcement learning in Markov decision processes
Boyajian, W. L.
Clausen, J.
Trenkwalder, L. M.
Dunjko, V
Briegel, H. J.
QUANTUM MACHINE INTELLIGENCE, 2020, 2 (02)
[46] On the convergence of projective-simulation–based reinforcement learning in Markov decision processes
W. L. Boyajian
J. Clausen
L. M. Trenkwalder
V. Dunjko
H. J. Briegel
Quantum Machine Intelligence, 2020, 2
[47] A sensitivity view of Markov decision processes and reinforcement learning
Cao, XR
MODELING, CONTROL AND OPTIMIZATION OF COMPLEX SYSTEMS: IN HONOR OF PROFESSOR YU-CHI HO, 2003, 14 : 261 - 283
[48] Quality Control for Express Items Based on Markov Decision Process
Han, Xu
Li, Yisong
2016 INTERNATIONAL CONFERENCE ON LOGISTICS, INFORMATICS AND SERVICE SCIENCES (LISS' 2016), 2016,
[49] Filtered Probabilistic Model Predictive Control-Based Reinforcement Learning for Unmanned Surface Vehicles
Cui, Yunduan
Peng, Lei
Li, Huiyun
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (10) : 6950 - 6961
[50] GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Zhou, Zhehua
Xie, Xuan
Song, Jiayang
Shu, Zhan
Ma, Lei
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,

← 1 2 3 4 5 →