Markov Decision Process Framework for Control-Based Reinforcement Learning

被引:0
|
作者
Lu Y. [1 ]
Squillante M.S. [1 ]
Wah Wu C. [1 ]
机构
[1] IBM Research, Mathematical Sciences Department, Thomas J. Watson Research Center, Yorktown Heights, 10598, NY
来源
Performance Evaluation Review | 2023年 / 51卷 / 02期
关键词
Compendex;
D O I
10.1145/3626570.3626585
中图分类号
学科分类号
摘要
[No abstract available]
引用
收藏
页码:39 / 41
页数:2
相关论文
共 50 条
  • [41] Optimal Electric Vehicle Charging Strategy With Markov Decision Process and Reinforcement Learning Technique
    Ding, Tao
    Zeng, Ziyu
    Bai, Jiawen
    Qin, Boyu
    Yang, Yongheng
    Shahidehpour, Mohammad
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2020, 56 (05) : 5811 - 5823
  • [42] Joint Manufacturing and Onsite Microgrid System Control Using Markov Decision Process and Neural Network Integrated Reinforcement Learning
    Hu, Wenqing
    Sun, Zeyi
    Zhang, Yunchao
    Li, Yu
    25TH INTERNATIONAL CONFERENCE ON PRODUCTION RESEARCH MANUFACTURING INNOVATION: CYBER PHYSICAL MANUFACTURING, 2019, 39 : 1242 - 1249
  • [43] A statistical property of multiagent learning based on Markov decision process
    Iwata, Kazunori
    Ikeda, Kazushi
    Sakai, Hideaki
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (04): : 829 - 842
  • [44] Activity Support Framework for People with Dementia Based on Markov Decision Process
    Sarni, Tomi
    Pulli, Petri
    2015 INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS IE 2015, 2015, : 25 - 32
  • [45] On the convergence of projective-simulation-based reinforcement learning in Markov decision processes
    Boyajian, W. L.
    Clausen, J.
    Trenkwalder, L. M.
    Dunjko, V
    Briegel, H. J.
    QUANTUM MACHINE INTELLIGENCE, 2020, 2 (02)
  • [46] On the convergence of projective-simulation–based reinforcement learning in Markov decision processes
    W. L. Boyajian
    J. Clausen
    L. M. Trenkwalder
    V. Dunjko
    H. J. Briegel
    Quantum Machine Intelligence, 2020, 2
  • [47] A sensitivity view of Markov decision processes and reinforcement learning
    Cao, XR
    MODELING, CONTROL AND OPTIMIZATION OF COMPLEX SYSTEMS: IN HONOR OF PROFESSOR YU-CHI HO, 2003, 14 : 261 - 283
  • [48] Quality Control for Express Items Based on Markov Decision Process
    Han, Xu
    Li, Yisong
    2016 INTERNATIONAL CONFERENCE ON LOGISTICS, INFORMATICS AND SERVICE SCIENCES (LISS' 2016), 2016,
  • [49] Filtered Probabilistic Model Predictive Control-Based Reinforcement Learning for Unmanned Surface Vehicles
    Cui, Yunduan
    Peng, Lei
    Li, Huiyun
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (10) : 6950 - 6961
  • [50] GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
    Zhou, Zhehua
    Xie, Xuan
    Song, Jiayang
    Shu, Zhan
    Ma, Lei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,