Offline and Online Time in Sequential Decision-Making Problems

被引:0
|
作者
Soni, Aman [1 ]
Lewis, Peter R. [1 ]
Ekart, Aniko [1 ]
机构
[1] Aston Univ, Sch Engn & Appl Sci, ALICE, Birmingham B4 7ET, W Midlands, England
关键词
EVOLUTIONARY ALGORITHMS; DYNAMIC OPTIMIZATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A connection has recently been drawn between Dynamic Optimization Problems (DOPs) and Reinforcement Learning Problems (RLPs) where they can be seen as subsets of a broader class of Sequential Decision-Making Problems (SDMPs). SDMPs require new decisions on an ongoing basis. Typically the underlying environment changes between decisions. The SDMP view is useful as it allows the unified space to be explored. Solutions can be designed for characteristics of problem instances using algorithms from either community. Little has been done on comparing algorithm performance across these communities, particularly under real-world resource constraints. In this paper we lay the theoretical foundations for the concept of offline and online time in SDMPs. We implement a method, based on the theoretical formulations, to limit offline time on representative algorithms. We investigate the online performance on a Conceptual Moving Peaks Benchmark (CMPB). Our results show that the performance of an Evolutionary Dynamic Optimisation (EDO) algorithm depends on the offline time constraint while the performance of an EDO-hybrid is noticeably impacted only past a lower bound on the size of the state-action space. Our method evaluates the effects of resource constraints on online algorithm performance and is a promising start to a rigorous method of algorithm selection for real-world problems.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Subjective optimality in finite sequential decision-making
    Sin, Yeonju J.
    Seon, HeeYoung
    Shin, Yun Kyoung J.
    Kwon, Oh-Sang
    Chung, Dongil J.
    PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (12)
  • [42] SEQUENTIAL MULTI-CRITERION DECISION-MAKING
    KORNBLUTH, JSH
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1985, 13 (06): : 569 - 574
  • [43] THE STRATEGIC VALUE OF FLEXIBILITY IN SEQUENTIAL DECISION-MAKING
    BENJAAFAR, S
    MORIN, TL
    TALAVAGE, JJ
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1995, 82 (03) : 438 - 457
  • [44] Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games
    Farina, Gabriele
    Sandholm, Tuomas
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 5381 - 5390
  • [45] Structure Learning in Human Sequential Decision-Making
    Acuna, Daniel E.
    Schrater, Paul
    PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (12)
  • [46] MODELS OF OPTIMAL STRATEGIES IN SEQUENTIAL DECISION-MAKING
    SAZYKIN, BV
    SOVIET JOURNAL OF COMPUTER AND SYSTEMS SCIENCES, 1989, 27 (06): : 99 - 105
  • [47] Decision-making in sequential projects: expected time-to-build and probability of failure
    Moells, Sascha H.
    Schild, Karl-Heinz
    REVIEW OF QUANTITATIVE FINANCE AND ACCOUNTING, 2012, 39 (01) : 1 - 25
  • [48] Reproductive decision-making in the context of hereditary cancer: the effects of an online decision aid on informed decision-making
    Reumkens, Kelly
    Tummers, Marly H. E.
    Severijns, Yil
    Gietel-Habets, Joyce J. G.
    van Kuijk, Sander M. J.
    Aalfs, Cora M.
    van Asperen, Christi J.
    Ausems, Margreet G. E. M.
    Collee, Margriet
    Dommering, Charlotte J.
    Kets, Marleen
    van der Kolk, Lizet E.
    Oosterwijk, Jan C.
    Tjan-Heijnen, Vivianne C. G.
    van der Weijden, Trudy
    de Die-Smulders, Christine E. M.
    van Osch, Liesbeth A. D. M.
    JOURNAL OF COMMUNITY GENETICS, 2021, 12 (01) : 101 - 110
  • [49] Reproductive decision-making in the context of hereditary cancer: the effects of an online decision aid on informed decision-making
    Kelly Reumkens
    Marly H. E. Tummers
    Yil Severijns
    Joyce J. G. Gietel-Habets
    Sander M. J. van Kuijk
    Cora M. Aalfs
    Christi J. van Asperen
    Margreet G. E. M. Ausems
    Margriet Collée
    Charlotte J. Dommering
    Marleen Kets
    Lizet E. van der Kolk
    Jan C. Oosterwijk
    Vivianne C. G. Tjan-Heijnen
    Trudy van der Weijden
    Christine E. M. de Die-Smulders
    Liesbeth A. D. M. van Osch
    Journal of Community Genetics, 2021, 12 : 101 - 110
  • [50] Logic-Based Sequential Decision-Making
    Lyu, Daoming
    Yang, Fangkai
    Liu, Bo
    Yoon, Daesub
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9995 - 9996