Offline and Online Time in Sequential Decision-Making Problems

被引：0

作者：

Soni, Aman ^{[1
]}

Lewis, Peter R. ^{[1
]}

Ekart, Aniko ^{[1
]}

机构：

[1] Aston Univ, Sch Engn & Appl Sci, ALICE, Birmingham B4 7ET, W Midlands, England

来源：

PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI) | 2016年

关键词：

EVOLUTIONARY ALGORITHMS; DYNAMIC OPTIMIZATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A connection has recently been drawn between Dynamic Optimization Problems (DOPs) and Reinforcement Learning Problems (RLPs) where they can be seen as subsets of a broader class of Sequential Decision-Making Problems (SDMPs). SDMPs require new decisions on an ongoing basis. Typically the underlying environment changes between decisions. The SDMP view is useful as it allows the unified space to be explored. Solutions can be designed for characteristics of problem instances using algorithms from either community. Little has been done on comparing algorithm performance across these communities, particularly under real-world resource constraints. In this paper we lay the theoretical foundations for the concept of offline and online time in SDMPs. We implement a method, based on the theoretical formulations, to limit offline time on representative algorithms. We investigate the online performance on a Conceptual Moving Peaks Benchmark (CMPB). Our results show that the performance of an Evolutionary Dynamic Optimisation (EDO) algorithm depends on the offline time constraint while the performance of an EDO-hybrid is noticeably impacted only past a lower bound on the size of the state-action space. Our method evaluates the effects of resource constraints on online algorithm performance and is a promising start to a rigorous method of algorithm selection for real-world problems.

引用

页数：8

共 50 条

[1] Unifying offline and online simulation for online decision-making
Liu, Haitao
Liang, Jinpeng
Lee, Loo Hay
Chew, Ek Peng
IISE TRANSACTIONS, 2022, 54 (10) : 923 - 935
[2] A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty
Malikopoulos, Andreas A.
Papalambros, Panos Y.
Assanis, Dennis N.
JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2009, 131 (04): : 1 - 8
[3] FALLIBILITY AND SEQUENTIAL DECISION-MAKING
KOH, WTH
JOURNAL OF INSTITUTIONAL AND THEORETICAL ECONOMICS-ZEITSCHRIFT FUR DIE GESAMTE STAATSWISSENSCHAFT, 1994, 150 (02): : 362 - 374
[4] SEQUENTIAL DECISION-MAKING - MODEL
DECKARD, BS
PUBLIC CHOICE, 1976, 26 : 89 - 103
[5] Creative Self-efficacy, Political Decision-making, and Offline and Online Political Participation
Kushin, Matthew J.
Dalisay, Francis
Kim, Jinhee
Forbes, Amy
David, Clarissa C.
Somera, Lilnabeth P.
JOURNAL OF CREATIVE COMMUNICATIONS, 2022, 17 (03) : 270 - 287
[6] STOCHASTIC DECISION-MAKING TREES - SOLVING SEQUENTIAL DECISION-MAKING PROBLEMS UNDER CONDITIONS OF RISK - GERMAN - KLAUSMANN,HS
REICHARDT, H
JAHRBUCHER FUR NATIONALOKONOMIE UND STATISTIK, 1977, 192 (3-4): : 366 - 367
[7] Representation Matters: Offline Pretraining for Sequential Decision Making
Yang, Mengjiao
Nachum, Ofir
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[8] Online decision aids: the role of decision-making styles and decision-making stages
Virdi, Preeti
Kalro, Arti D.
Sharma, Dinesh
INTERNATIONAL JOURNAL OF RETAIL & DISTRIBUTION MANAGEMENT, 2020, 48 (06) : 555 - 574
[9] Triangular Neutrosophic Cognitive Map for Multistage Sequential Decision-Making Problems
Salah Hasan Al-subhi
Elpiniki I. Papageorgiou
Pedro Piñero Pérez
Gaafar Sadeq S. Mahdi
Luis Alvarado Acuña
International Journal of Fuzzy Systems, 2021, 23 : 657 - 679
[10] Triangular Neutrosophic Cognitive Map for Multistage Sequential Decision-Making Problems
Al-subhi, Salah Hasan
Papageorgiou, Elpiniki I.
Perez, Pedro Pinero
Mahdi, Gaafar Sadeq S.
Acuna, Luis Alvarado
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2021, 23 (03) : 657 - 679

← 1 2 3 4 5 →