Offline and Online Time in Sequential Decision-Making Problems

被引:0
|
作者
Soni, Aman [1 ]
Lewis, Peter R. [1 ]
Ekart, Aniko [1 ]
机构
[1] Aston Univ, Sch Engn & Appl Sci, ALICE, Birmingham B4 7ET, W Midlands, England
关键词
EVOLUTIONARY ALGORITHMS; DYNAMIC OPTIMIZATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A connection has recently been drawn between Dynamic Optimization Problems (DOPs) and Reinforcement Learning Problems (RLPs) where they can be seen as subsets of a broader class of Sequential Decision-Making Problems (SDMPs). SDMPs require new decisions on an ongoing basis. Typically the underlying environment changes between decisions. The SDMP view is useful as it allows the unified space to be explored. Solutions can be designed for characteristics of problem instances using algorithms from either community. Little has been done on comparing algorithm performance across these communities, particularly under real-world resource constraints. In this paper we lay the theoretical foundations for the concept of offline and online time in SDMPs. We implement a method, based on the theoretical formulations, to limit offline time on representative algorithms. We investigate the online performance on a Conceptual Moving Peaks Benchmark (CMPB). Our results show that the performance of an Evolutionary Dynamic Optimisation (EDO) algorithm depends on the offline time constraint while the performance of an EDO-hybrid is noticeably impacted only past a lower bound on the size of the state-action space. Our method evaluates the effects of resource constraints on online algorithm performance and is a promising start to a rigorous method of algorithm selection for real-world problems.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Unifying offline and online simulation for online decision-making
    Liu, Haitao
    Liang, Jinpeng
    Lee, Loo Hay
    Chew, Ek Peng
    IISE TRANSACTIONS, 2022, 54 (10) : 923 - 935
  • [2] A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty
    Malikopoulos, Andreas A.
    Papalambros, Panos Y.
    Assanis, Dennis N.
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2009, 131 (04): : 1 - 8
  • [3] FALLIBILITY AND SEQUENTIAL DECISION-MAKING
    KOH, WTH
    JOURNAL OF INSTITUTIONAL AND THEORETICAL ECONOMICS-ZEITSCHRIFT FUR DIE GESAMTE STAATSWISSENSCHAFT, 1994, 150 (02): : 362 - 374
  • [4] SEQUENTIAL DECISION-MAKING - MODEL
    DECKARD, BS
    PUBLIC CHOICE, 1976, 26 : 89 - 103
  • [5] Creative Self-efficacy, Political Decision-making, and Offline and Online Political Participation
    Kushin, Matthew J.
    Dalisay, Francis
    Kim, Jinhee
    Forbes, Amy
    David, Clarissa C.
    Somera, Lilnabeth P.
    JOURNAL OF CREATIVE COMMUNICATIONS, 2022, 17 (03) : 270 - 287
  • [6] STOCHASTIC DECISION-MAKING TREES - SOLVING SEQUENTIAL DECISION-MAKING PROBLEMS UNDER CONDITIONS OF RISK - GERMAN - KLAUSMANN,HS
    REICHARDT, H
    JAHRBUCHER FUR NATIONALOKONOMIE UND STATISTIK, 1977, 192 (3-4): : 366 - 367
  • [7] Representation Matters: Offline Pretraining for Sequential Decision Making
    Yang, Mengjiao
    Nachum, Ofir
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [8] Online decision aids: the role of decision-making styles and decision-making stages
    Virdi, Preeti
    Kalro, Arti D.
    Sharma, Dinesh
    INTERNATIONAL JOURNAL OF RETAIL & DISTRIBUTION MANAGEMENT, 2020, 48 (06) : 555 - 574
  • [9] Triangular Neutrosophic Cognitive Map for Multistage Sequential Decision-Making Problems
    Salah Hasan Al-subhi
    Elpiniki I. Papageorgiou
    Pedro Piñero Pérez
    Gaafar Sadeq S. Mahdi
    Luis Alvarado Acuña
    International Journal of Fuzzy Systems, 2021, 23 : 657 - 679
  • [10] Triangular Neutrosophic Cognitive Map for Multistage Sequential Decision-Making Problems
    Al-subhi, Salah Hasan
    Papageorgiou, Elpiniki I.
    Perez, Pedro Pinero
    Mahdi, Gaafar Sadeq S.
    Acuna, Luis Alvarado
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2021, 23 (03) : 657 - 679