Computing Policies and Performance Bounds for Deterministic Dynamic Programs Using Mixed Integer Programming

被引：0

作者：

Cogill, Randy ^{[1
]}

Hindi, Haitham ^{[2
]}

机构：

[1] Univ Virginia, Dept Syst & Informat Engn, Charlottesville, VA 22903 USA

[2] Palo Alto Res Ctr, Intelligent Syst Lab, Palo Alto, CA 94304 USA

来源：

2011 AMERICAN CONTROL CONFERENCE | 2011年

关键词：

APPROXIMATIONS;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper we present a mixed integer programming approach to deterministic dynamic programming. We consider the problem of computing a policy that maximizes the total discounted reward earned over an infinite time horizon. While problems of this form are difficult in general, suboptimal solutions and performance bounds can be computed by approximating the dynamic programming value function. Here we provide a linear programming-based method for approximating the value function, and show how suboptimal policies can be computed through repeated solution of mixed integer programs that directly utilize this approximation. We have applied this approach to problems with states described by binary vectors with dimension as large as several hundred. Although the number of distinct states associated with such a problem is extremely large, we are able to obtain suboptimal policies with surprisingly tight performance guarantees. We illustrate the application of this method on a class of infinite horizon job shop scheduling problems.

引用

页数：8

共 50 条

[31] Lower Bounds on the Complexity of Mixed-Integer Programs for Stable Set and Knapsack
Schade, Jamico
Sinha, Makrand
Weltge, Stefan
INTEGER PROGRAMMING AND COMBINATORIAL OPTIMIZATION, IPCO 2024, 2024, 14679 : 379 - 392
[32] IMPROVED INTEGER PROGRAMMING BOUNDS USING INTERSECTIONS OF CORNER POLYHEDRA
BELL, DE
FISHER, ML
MATHEMATICAL PROGRAMMING, 1975, 8 (03) : 345 - 368
[33] Lower Bounds on the Complexity of Mixed-Integer Programs for Stable Set and Knapsack
Schade, Jamico
Sinha, Makrand
Weltge, Stefan
arXiv, 2023,
[34] A dynamic convexized method for nonconvex mixed integer nonlinear programming
Zhu, Wenxing
Lin, Geng
COMPUTERS & OPERATIONS RESEARCH, 2011, 38 (12) : 1792 - 1804
[35] A mixed integer linear programming model for dynamic route guidance
Kaufman, DE
Nonis, J
Smith, RL
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 1998, 32 (06) : 431 - 440
[36] Improving Bounds on the Football Pool Problem by Integer Programming and High-Throughput Computing
Linderoth, Jeff
Margot, Francois
Thain, Greg
INFORMS JOURNAL ON COMPUTING, 2009, 21 (03) : 445 - 457
[37] Convex mixed-integer nonlinear programs derived from generalized disjunctive programming using cones
David E. Bernal Neira
Ignacio E. Grossmann
Computational Optimization and Applications, 2024, 88 : 251 - 312
[38] Convex mixed-integer nonlinear programs derived from generalized disjunctive programming using cones
Neira, David E. Bernal
Grossmann, Ignacio E.
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2024, 88 (01) : 251 - 312
[39] A mixed-integer programming model for identifying intuitive ambulance dispatching policies
Albert, Laura A.
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2023, 74 (11) : 2300 - 2311
[40] Using mixed integer programming to design employee rosters
Beaumont, N
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1997, 48 (06) : 585 - 590

← 1 2 3 4 5 →