Computing Policies and Performance Bounds for Deterministic Dynamic Programs Using Mixed Integer Programming

被引:0
|
作者
Cogill, Randy [1 ]
Hindi, Haitham [2 ]
机构
[1] Univ Virginia, Dept Syst & Informat Engn, Charlottesville, VA 22903 USA
[2] Palo Alto Res Ctr, Intelligent Syst Lab, Palo Alto, CA 94304 USA
关键词
APPROXIMATIONS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present a mixed integer programming approach to deterministic dynamic programming. We consider the problem of computing a policy that maximizes the total discounted reward earned over an infinite time horizon. While problems of this form are difficult in general, suboptimal solutions and performance bounds can be computed by approximating the dynamic programming value function. Here we provide a linear programming-based method for approximating the value function, and show how suboptimal policies can be computed through repeated solution of mixed integer programs that directly utilize this approximation. We have applied this approach to problems with states described by binary vectors with dimension as large as several hundred. Although the number of distinct states associated with such a problem is extremely large, we are able to obtain suboptimal policies with surprisingly tight performance guarantees. We illustrate the application of this method on a class of infinite horizon job shop scheduling problems.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Computing non-stationary (s, S) policies using mixed integer linear programming
    Xiang, Mengyuan
    Rossi, Roberto
    Martin-Barragan, Belen
    Tarim, S. Armagan
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 271 (02) : 490 - 500
  • [2] Parametric Integer Programming Algorithm for Bilevel Mixed Integer Programs
    Koeppe, M.
    Queyranne, M.
    Ryan, C. T.
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2010, 146 (01) : 137 - 150
  • [3] Parametric Integer Programming Algorithm for Bilevel Mixed Integer Programs
    M. Köppe
    M. Queyranne
    C. T. Ryan
    Journal of Optimization Theory and Applications, 2010, 146 : 137 - 150
  • [4] A NOTE ON SOLVING QUADRATIC PROGRAMS USING MIXED-INTEGER PROGRAMMING
    RAGSDALE, C
    STAM, A
    COMPUTERS & OPERATIONS RESEARCH, 1989, 16 (04) : 393 - 395
  • [5] COMPUTING EFFICIENT STEADY-STATE POLICIES FOR DETERMINISTIC DYNAMIC PROGRAMS .1.
    FLYNN, J
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1992, 166 (01) : 175 - 191
  • [6] COMPUTING EFFICIENT STEADY-STATE POLICIES FOR DETERMINISTIC DYNAMIC PROGRAMS .2.
    FLYNN, J
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1993, 177 (01) : 75 - 99
  • [7] Lifting for mixed integer programs with variable upper bounds
    Shebalov, Sergey
    Park, Young Woong
    Klabjan, Diego
    DISCRETE APPLIED MATHEMATICS, 2015, 186 : 226 - 250
  • [8] A hierarchy of bounds for stochastic mixed-integer programs
    Burhaneddin Sandıkçı
    Nan Kong
    Andrew J. Schaefer
    Mathematical Programming, 2013, 138 : 253 - 272
  • [9] A hierarchy of bounds for stochastic mixed-integer programs
    Sandikci, Burhaneddin
    Kong, Nan
    Schaefer, Andrew J.
    MATHEMATICAL PROGRAMMING, 2013, 138 (1-2) : 253 - 272
  • [10] Valid Linear Programming Bounds for Exact Mixed-Integer Programming
    Steffy, Daniel E.
    Wolter, Kati
    INFORMS JOURNAL ON COMPUTING, 2013, 25 (02) : 271 - 284