A solving method of an MDP with a constraint by genetic algorithms

被引:4
|
作者
Hirayama, K [1 ]
Kawai, H
机构
[1] Tottori Univ, Course Engn Social Dev, Tottori 680, Japan
[2] Tottori Univ, Dept Social Syst Engn, Tottori 680, Japan
关键词
Markov decision processes; genetic algorithms; reward constraints; linear programming; pure and mixed policies;
D O I
10.1016/S0895-7177(00)00084-4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We consider a discrete time Markov decision process (MDP) with a finite state space, a finite action space, and two kinds of immediate rewards. The problem is to maximize the time average reward generated by one reward stream, subject to the other reward not being smaller than a prescribed value. An MDP with a reward constraint can be solved by linear programming in the range of mixed policies. On the other hand, when we restrict ourselves to pure policies, the problem is a combinatorial problem, for which a solution has not been discovered. In this paper, we propose an approach by Genetic Algorithms (GAs) in order to obtain an effective search process and to obtain a near optimal, possibly optimal pure stationary policy. A numerical example is given to examine the efficiency of the approach proposed. (C) 2000 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:165 / 173
页数:9
相关论文
共 50 条
  • [41] Geometric constraint solving: The witness configuration method
    Michelucci, D
    Foufou, S
    COMPUTER-AIDED DESIGN, 2006, 38 (04) : 284 - 299
  • [42] A Constraint Solving Method for Collaborative Product Development
    Liu, Tingting
    Zhang, Zhaoying
    Wang, Huifen
    COOPERATIVE DESIGN, VISUALIZATION, AND ENGINEERING, 2010, 6240 : 239 - 246
  • [43] Solving ordinary differential equations using genetic algorithms and the Taylor series matrix method
    Gutierrez-Navarro, Daniel
    Lopez-Aguayo, Servando
    JOURNAL OF PHYSICS COMMUNICATIONS, 2018, 2 (11):
  • [44] A combined genetic algorithms-shooting method approach to solving optimal control problems
    Sim, YC
    Leng, SB
    Subramaniam, V
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2000, 31 (01) : 83 - 89
  • [45] Genetic algorithms in solving graph partitioning problem
    Shazely, S
    Baraka, H
    Abdel-Wahab, A
    Kamal, H
    MULTIPLE APPROACHES TO INTELLIGENT SYSTEMS, PROCEEDINGS, 1999, 1611 : 155 - 164
  • [46] Application of genetic algorithms for solving transport problems
    Polkovnikova, Natalia A.
    Polkovnikov, Anatoly K.
    MARINE INTELLECTUAL TECHNOLOGIES, 2022, (03): : 265 - 273
  • [47] Solving traveling salesman problems by genetic algorithms
    LEE Heow Pueh
    LIM Siak Piang
    LEE Kwok Hong
    ProgressinNaturalScience, 2003, (02) : 57 - 63
  • [48] ON SOLVING TRAVELING SALESMAN PROBLEMS BY GENETIC ALGORITHMS
    BRAUN, H
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 496 : 129 - 133
  • [49] Solving QoS multicast routing with genetic algorithms
    Tran, HT
    Harris, RJ
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1944 - 1948
  • [50] Solving traveling salesman problems by genetic algorithms
    Liang, YC
    Ge, HW
    Zhou, CG
    Lee, HP
    Lin, WZ
    Lim, SP
    Lee, KH
    PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2003, 13 (02) : 135 - 141