A new complexity result on solving the Markov decision problem

被引：33

作者：

Ye, YY ^{[1
]}

机构：

[1] Stanford Univ, Dept Management Sci & Engn, Stanford, CA 94305 USA

来源：

MATHEMATICS OF OPERATIONS RESEARCH | 2005年 / 30卷 / 03期

关键词：

Markov decision problem; linear programming; complexity;

D O I：

10.1287/moor.1050.0149

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

We present a new complexity result on solving the Markov decision problem (MDP) with n states and a number of actions for each state, a special class of real-number linear programs with the Leontief matrix structure. We prove that when the discount factor theta is strictly less than 1, the problem can be solved in at most O(n(1.5)(log1/(1 - theta) + log n)) classical interior-point method iterations and O(n(4)(log 1/(1 - theta) + log n)) arithmetic operations. Our method is a combinatorial interior-point method related to the work of Ye (1990. A "build-down" scheme for linear programming. Math. Programming 46 61-72) and Vavasis and Ye (1996. A primal-dual interior-point method whose running time depends only on the constraint matrix. Math. Programming 74 79-120). To our knowledge, this is the first strongly polynomial-time algorithm for solving the MDP when the discount factor is a constant less than 1.

引用

页码：733 / 749

页数：17

共 50 条

[1] Markov interval chain (MIC) for solving a decision problem
Semati, Salah Eddine
Gasmi, Abdelkader
OPSEARCH, 2023, 60 (02) : 802 - 811
[2] Markov interval chain (MIC) for solving a decision problem
Salah eddine Semati
Abdelkader Gasmi
OPSEARCH, 2023, 60 : 802 - 811
[3] An efficient algorithm and complexity result for solving the sum of general affine ratios problem
Jiao, Hongwei
Ma, Junqiao
CHAOS SOLITONS & FRACTALS, 2022, 164
[4] A Cooperative Distributed Problem Solving Technique for Large Markov Decision Processes
Mouaddib, Abdel-Illah
Le Gloannec, Simon
ECAI 2006, PROCEEDINGS, 2006, 141 : 843 - +
[5] Solving concurrent Markov decision processes
Weld, M
Weld, DS
PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 716 - 722
[6] Solving hybrid Markov decision processes
Reyes, Alberto
Sucar, L. Enrique
Morales, Eduardo F.
Ibarguengoytia, Pablo H.
MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 227 - +
[7] Complexity issues in Markov decision processes
Goldsmith, J
Mundhenk, M
THIRTEENTH ANNUAL IEEE CONFERENCE ON COMPUTATIONAL COMPLEXITY - PROCEEDINGS, 1998, : 272 - 280
[8] The complexity of synchronizing Markov decision processes
Doyen, Laurent
Massart, Thierry
Shirmohammadi, Mahsa
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2019, 100 : 96 - 129
[9] THE COMPLEXITY OF MARKOV DECISION-PROCESSES
PAPADIMITRIOU, CH
TSITSIKLIS, JN
MATHEMATICS OF OPERATIONS RESEARCH, 1987, 12 (03) : 441 - 450
[10] On the complexity of hierarchical problem solving
de Jong, Edwin D.
Watson, Richard A.
Thierens, Dirk
GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, 2005, : 1201 - 1208

← 1 2 3 4 5 →