Offline and Online Time in Sequential Decision-Making Problems

被引:0
|
作者
Soni, Aman [1 ]
Lewis, Peter R. [1 ]
Ekart, Aniko [1 ]
机构
[1] Aston Univ, Sch Engn & Appl Sci, ALICE, Birmingham B4 7ET, W Midlands, England
关键词
EVOLUTIONARY ALGORITHMS; DYNAMIC OPTIMIZATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A connection has recently been drawn between Dynamic Optimization Problems (DOPs) and Reinforcement Learning Problems (RLPs) where they can be seen as subsets of a broader class of Sequential Decision-Making Problems (SDMPs). SDMPs require new decisions on an ongoing basis. Typically the underlying environment changes between decisions. The SDMP view is useful as it allows the unified space to be explored. Solutions can be designed for characteristics of problem instances using algorithms from either community. Little has been done on comparing algorithm performance across these communities, particularly under real-world resource constraints. In this paper we lay the theoretical foundations for the concept of offline and online time in SDMPs. We implement a method, based on the theoretical formulations, to limit offline time on representative algorithms. We investigate the online performance on a Conceptual Moving Peaks Benchmark (CMPB). Our results show that the performance of an Evolutionary Dynamic Optimisation (EDO) algorithm depends on the offline time constraint while the performance of an EDO-hybrid is noticeably impacted only past a lower bound on the size of the state-action space. Our method evaluates the effects of resource constraints on online algorithm performance and is a promising start to a rigorous method of algorithm selection for real-world problems.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Collaborative decision-making in online education
    Petrescu, Daniela
    Enache, Dumitru
    Duta, Luminita
    8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2020 & 2021): DEVELOPING GLOBAL DIGITAL ECONOMY AFTER COVID-19, 2022, 199 : 1090 - 1094
  • [32] Robust Tests in Online Decision-Making
    Kim, Gi-Soo
    Kim, Jane P.
    Yang, Hyun-Joon
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10016 - 10024
  • [33] DECISION-MAKING IN FLAT AND HIERARCHICAL DECISION PROBLEMS
    Krupa, Tadeusz
    Ostrowska, Teresa
    FOUNDATIONS OF MANAGEMENT, 2012, 4 (02) : 23 - 36
  • [34] CONSTRUCTION OF DECISION RULES IN DECISION-MAKING PROBLEMS
    GAFT, MG
    PODINOVSKII, VV
    AUTOMATION AND REMOTE CONTROL, 1981, 42 (06) : 806 - 815
  • [35] INTERNATIONAL DECISION-MAKING AND DECISION-MAKING PROBLEMS IN INTERNATIONAL-BUSINESS ACTIVITY
    MACHARZINA, K
    ENGELHARD, J
    BETRIEBSWIRTSCHAFTLICHE FORSCHUNG UND PRAXIS, 1984, 36 (04): : 297 - 322
  • [36] CHOICE BEHAVIOR IN A SEQUENTIAL DECISION-MAKING TASK
    BUSEMEYER, JR
    ORGANIZATIONAL BEHAVIOR AND HUMAN PERFORMANCE, 1982, 29 (02): : 175 - 207
  • [37] Experiments on sequential decision-making: The ''irreversibility effect''
    Rauchs, A
    Willinger, M
    REVUE ECONOMIQUE, 1996, 47 (01): : 51 - 71
  • [38] Group Decision-Making Models for Sequential Tasks
    Kimura, Margot
    Moehlis, Jeff
    SIAM REVIEW, 2012, 54 (01) : 121 - 138
  • [39] Statistical determinants of sequential visual decision-making
    Arato, Jozsef
    Khani, Abbas
    Rainer, Gregor
    Fiser, Jozsef
    PERCEPTION, 2015, 44 : 369 - 369
  • [40] Decision-Making in Research Tasks with Sequential Testing
    Pfeiffer, Thomas
    Rand, David G.
    Dreher, Anna
    PLOS ONE, 2009, 4 (02):