A new complexity result on solving the Markov decision problem

被引:33
|
作者
Ye, YY [1 ]
机构
[1] Stanford Univ, Dept Management Sci & Engn, Stanford, CA 94305 USA
关键词
Markov decision problem; linear programming; complexity;
D O I
10.1287/moor.1050.0149
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We present a new complexity result on solving the Markov decision problem (MDP) with n states and a number of actions for each state, a special class of real-number linear programs with the Leontief matrix structure. We prove that when the discount factor theta is strictly less than 1, the problem can be solved in at most O(n(1.5)(log1/(1 - theta) + log n)) classical interior-point method iterations and O(n(4)(log 1/(1 - theta) + log n)) arithmetic operations. Our method is a combinatorial interior-point method related to the work of Ye (1990. A "build-down" scheme for linear programming. Math. Programming 46 61-72) and Vavasis and Ye (1996. A primal-dual interior-point method whose running time depends only on the constraint matrix. Math. Programming 74 79-120). To our knowledge, this is the first strongly polynomial-time algorithm for solving the MDP when the discount factor is a constant less than 1.
引用
收藏
页码:733 / 749
页数:17
相关论文
共 50 条
  • [1] Markov interval chain (MIC) for solving a decision problem
    Semati, Salah Eddine
    Gasmi, Abdelkader
    OPSEARCH, 2023, 60 (02) : 802 - 811
  • [2] Markov interval chain (MIC) for solving a decision problem
    Salah eddine Semati
    Abdelkader Gasmi
    OPSEARCH, 2023, 60 : 802 - 811
  • [3] An efficient algorithm and complexity result for solving the sum of general affine ratios problem
    Jiao, Hongwei
    Ma, Junqiao
    CHAOS SOLITONS & FRACTALS, 2022, 164
  • [4] A Cooperative Distributed Problem Solving Technique for Large Markov Decision Processes
    Mouaddib, Abdel-Illah
    Le Gloannec, Simon
    ECAI 2006, PROCEEDINGS, 2006, 141 : 843 - +
  • [5] Solving concurrent Markov decision processes
    Weld, M
    Weld, DS
    PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 716 - 722
  • [6] Solving hybrid Markov decision processes
    Reyes, Alberto
    Sucar, L. Enrique
    Morales, Eduardo F.
    Ibarguengoytia, Pablo H.
    MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 227 - +
  • [7] Complexity issues in Markov decision processes
    Goldsmith, J
    Mundhenk, M
    THIRTEENTH ANNUAL IEEE CONFERENCE ON COMPUTATIONAL COMPLEXITY - PROCEEDINGS, 1998, : 272 - 280
  • [8] The complexity of synchronizing Markov decision processes
    Doyen, Laurent
    Massart, Thierry
    Shirmohammadi, Mahsa
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2019, 100 : 96 - 129
  • [9] THE COMPLEXITY OF MARKOV DECISION-PROCESSES
    PAPADIMITRIOU, CH
    TSITSIKLIS, JN
    MATHEMATICS OF OPERATIONS RESEARCH, 1987, 12 (03) : 441 - 450
  • [10] On the complexity of hierarchical problem solving
    de Jong, Edwin D.
    Watson, Richard A.
    Thierens, Dirk
    GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, 2005, : 1201 - 1208