Metrics for finite Markov decision processes

被引:0
|
作者
Ferns, N [1 ]
Panangaden, P [1 ]
Precup, D [1 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ H3A 2A7, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
引用
收藏
页码:950 / 951
页数:2
相关论文
共 50 条
  • [1] BISIMULATION METRICS FOR CONTINUOUS MARKOV DECISION PROCESSES
    Ferns, Norm
    Panangaden, Prakash
    Precup, Doina
    [J]. SIAM JOURNAL ON COMPUTING, 2011, 40 (06) : 1662 - 1714
  • [2] Computing Game Metrics on Markov Decision Processes
    Fu, Hongfei
    [J]. AUTOMATA, LANGUAGES, AND PROGRAMMING, ICALP 2012, PT II, 2012, 7392 : 227 - 238
  • [3] A taxonomy for similarity metrics between Markov decision processes
    Javier García
    Álvaro Visús
    Fernando Fernández
    [J]. Machine Learning, 2022, 111 : 4217 - 4247
  • [4] A taxonomy for similarity metrics between Markov decision processes
    Garcia, Javier
    Visus, Alvaro
    Fernandez, Fernando
    [J]. MACHINE LEARNING, 2022, 111 (11) : 4217 - 4247
  • [5] MARKOV DECISION PROCESSES WITH FINITE STATE AND DECISION SPACES
    RYKOV, VV
    [J]. THEORY OF PROBILITY AND ITS APPLICATIONS,USSR, 1966, 11 (02): : 302 - &
  • [6] A Remark on Finite Horizon Markov Decision processes
    XikUi Wang (University of Saskatchewan
    [J]. 经济数学, 1989, (05) : 76 - 80
  • [7] Safe Exploration in Finite Markov Decision Processes with Gaussian Processes
    Turchetta, Matteo
    Berkenkamp, Felix
    Krause, Andreas
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [8] On optimality gaps for fuzzification in finite Markov decision processes
    Kageyama, Masayuki
    [J]. JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2008, 11 (01) : 77 - 88
  • [9] TURNPIKES IN FINITE MARKOV DECISION PROCESSES AND RANDOM WALK*
    Piunovskiy, A. B.
    [J]. THEORY OF PROBABILITY AND ITS APPLICATIONS, 2023, 68 (01) : 123 - 149
  • [10] Measuring the Distance Between Finite Markov Decision Processes
    Song, Jinhua
    Gao, Yang
    Wang, Hao
    An, Bo
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 468 - 476