A Modified Value Iteration Algorithm for Discounted Markov Decision Processes

被引:1
|
作者
Chafik, Sanaa [1 ]
Daoui, Cherki [1 ]
机构
[1] Univ Sultan Moulay Slimane, Lab Informat Proc & Decis Support, Beni Mellal, Morocco
关键词
Discounted Reward Criterion; Markov Decision Processes; Open MP; Parallelizing; Value Iteration Algorithm;
D O I
10.4018/JECO.2015070104
中图分类号
F [经济];
学科分类号
02 ;
摘要
As many real applications need a large amount of states, the classical methods are intractable for solving large Markov Decision Processes. The decomposition technique basing on the topology of each state in the associated graph and the parallelization technique are very useful methods to cope with this problem. In this paper, the authors propose a Modified Value Iteration algorithm, adding the parallelism technique. They test their implementation on artificial data using an Open MP that offers a significant speed-up.
引用
收藏
页码:47 / 57
页数:11
相关论文
共 50 条
  • [41] CRITERIA FOR SELECTING THE RELAXATION FACTOR OF THE VALUE-ITERATION ALGORITHM FOR UNDISCOUNTED MARKOV AND SEMI-MARKOV DECISION-PROCESSES
    HERZBERG, M
    YECHIALI, U
    [J]. OPERATIONS RESEARCH LETTERS, 1991, 10 (04) : 193 - 202
  • [42] A unified algorithm framework for mean-variance optimization in discounted Markov decision processes
    Ma, Shuai
    Ma, Xiaoteng
    Xia, Li
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2023, 311 (03) : 1057 - 1067
  • [43] Discounted Markov Decision Processes for Small Noise Intensities
    Cruz-Suarez, Hugo
    Ilhuicatzi-Roldan, Rocio
    [J]. RECENT ADVANCES IN APPLIED MATHEMATICS, 2009, : 245 - +
  • [44] Hierarchical algorithms for discounted and weighted Markov decision processes
    Abbad, M
    Daoui, C
    [J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2003, 58 (02) : 237 - 245
  • [45] A pause control approach to the value iteration scheme in average Markov decision processes
    Cavazos-Cadena, Rolando
    [J]. Systems and Control Letters, 1998, 33 (04): : 209 - 219
  • [46] A method for speeding up value iteration in partially observable Markov decision processes
    Zhang, NL
    Lee, SS
    Zhang, WH
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1999, : 696 - 703
  • [47] A pause control approach to the value iteration scheme in average Markov decision processes
    Cavazos-Cadena, R
    [J]. SYSTEMS & CONTROL LETTERS, 1998, 33 (04) : 209 - 219
  • [48] Variance reduced value iteration and faster algorithms for solving Markov decision processes
    Sidford, Aaron
    Wang, Mengdi
    Wu, Xian
    Ye, Yinyu
    [J]. NAVAL RESEARCH LOGISTICS, 2023, 70 (05) : 423 - 442
  • [49] A Note on Generalized Second-Order Value Iteration in Markov Decision Processes
    Vijesh, Villavarayan Antony
    Rudresha, Shreyas Sumithra
    Abdulla, Mohammed Shahid
    [J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2023, 199 (03) : 1022 - 1049
  • [50] Sketched Newton Value Iteration for Large-Scale Markov Decision Processes
    Liu, Jinsong
    Xie, Chenghan
    Deng, Qi
    Ge, Dongdong
    Ye, Yinyu
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13936 - 13944