A solving method of an MDP with a constraint by genetic algorithms

被引:4
|
作者
Hirayama, K [1 ]
Kawai, H
机构
[1] Tottori Univ, Course Engn Social Dev, Tottori 680, Japan
[2] Tottori Univ, Dept Social Syst Engn, Tottori 680, Japan
关键词
Markov decision processes; genetic algorithms; reward constraints; linear programming; pure and mixed policies;
D O I
10.1016/S0895-7177(00)00084-4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We consider a discrete time Markov decision process (MDP) with a finite state space, a finite action space, and two kinds of immediate rewards. The problem is to maximize the time average reward generated by one reward stream, subject to the other reward not being smaller than a prescribed value. An MDP with a reward constraint can be solved by linear programming in the range of mixed policies. On the other hand, when we restrict ourselves to pure policies, the problem is a combinatorial problem, for which a solution has not been discovered. In this paper, we propose an approach by Genetic Algorithms (GAs) in order to obtain an effective search process and to obtain a near optimal, possibly optimal pure stationary policy. A numerical example is given to examine the efficiency of the approach proposed. (C) 2000 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:165 / 173
页数:9
相关论文
共 50 条
  • [21] Estimating parallel runtimes for randomized algorithms in constraint solving
    Charlotte Truchet
    Alejandro Arbelaez
    Florian Richoux
    Philippe Codognet
    Journal of Heuristics, 2016, 22 : 613 - 648
  • [22] Estimating parallel runtimes for randomized algorithms in constraint solving
    Truchet, Charlotte
    Arbelaez, Alejandro
    Richoux, Florian
    Codognet, Philippe
    JOURNAL OF HEURISTICS, 2016, 22 (04) : 613 - 648
  • [23] Iterative projection algorithms for solving constraint satisfaction problems: Effect of constraint convexity
    Millane, Rick P.
    Taylor, Joshua T.
    Arnal, Romain D.
    Wojtas, David H.
    Clare, Richard M.
    2019 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2019,
  • [24] Combining Evolutionary Algorithms with Constraint Solving for Configuration Optimization
    Shi, Kai
    Yu, Huiqun
    Guo, Jianmei
    2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2017, : 665 - 669
  • [25] Solving a Multi-Objective Constraint Satisfaction Problem with Genetic Algorithms - Making a Food Menu with GAs -
    Fukamachi, Hironori
    Sato, Yuji
    PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 17TH '12), 2012, : 150 - 153
  • [26] Solving Applications by Use of Genetic Algorithms
    Popescu, Marius-Constantin
    Perescu-Popescu, Liliana
    MATHEMATICAL METHODS AND APPLIED COMPUTING, VOL 1, 2009, : 208 - +
  • [27] Solving mastermind using genetic algorithms
    Kalisker, T
    Camens, D
    GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2003, PT II, PROCEEDINGS, 2003, 2724 : 1590 - 1591
  • [28] Solving nonogram using genetic algorithms
    Soto, Ricardo
    Crawford, Broderick
    Galleguillos, Cristian
    Olguin, Eduardo
    2016 11TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2016,
  • [29] Genetic algorithms for solving graphical games
    Li, J
    Liu, WY
    Zhao, Y
    DCABES 2004, Proceedings, Vols, 1 and 2, 2004, : 372 - 376
  • [30] Solving synthesis problems with genetic algorithms
    Jozwiak, L
    Ederveen, N
    24TH EUROMICRO CONFERENCE - PROCEEDING, VOLS 1 AND 2, 1998, : 1 - 7