Learning Factored Markov Decision Processes with Unawareness

被引：0

作者：

Innes, Craig ^{[1
]}

Lascarides, Alex ^{[1
]}

机构：

[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9AB, Midlothian, Scotland

来源：

35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019) | 2020年 / 115卷

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Methods for learning and planning in sequential decision problems often assume the learner is aware of all possible states and actions in advance. This assumption is sometimes untenable. In this paper, we give a method to learn factored markov decision problems from both domain exploration and expert assistance, which guarantees convergence to near-optimal behaviour, even when the agent begins unaware of factors critical to success. Our experiments show our agent learns optimal behaviour on small and large problems, and that conserving information on discovering new possibilities results in faster convergence.

引用

页码：123 / 133

页数：11

共 50 条

[21] Building Optimal Operation Policies for Dam Management Using Factored Markov Decision Processes
Reyes, Alberto
Ibargueengoytia, Pablo H.
Romero, Ines
Pech, David
Borunda, Monica
[J]. ADVANCES IN ARTIFICIAL INTELLIGENCE AND ITS APPLICATIONS, MICAI 2015, PT II, 2015, 9414 : 475 - 484
[22] Online Learning in Kernelized Markov Decision Processes
Chowdhury, Sayak Ray
Gopalan, Aditya
[J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
[23] Blackwell Online Learning for Markov Decision Processes
Li, Tao
Peng, Guanze
Zhu, Quanyan
[J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
[24] Bayesian Learning of Noisy Markov Decision Processes
Singh, Sumeetpal S.
Chopin, Nicolas
Whiteley, Nick
[J]. ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2013, 23 (01):
[25] HIERARCHICAL REPRESENTATION LEARNING FOR MARKOV DECISION PROCESSES
Steccanella, Lorenzo
Jonsson, Anders
Totaro, Simone
[J]. CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 568 - 585
[26] Episodic task learning in Markov decision processes
Yong Lin
Fillia Makedon
Yurong Xu
[J]. Artificial Intelligence Review, 2011, 36 : 87 - 98
[27] Learning Markov Decision Processes for Model Checking
Mao, Hua
Chen, Yingke
Jaeger, Manfred
Nielsen, Thomas D.
Larsen, Kim G.
Nielsen, Brian
[J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2012, (103): : 49 - 63
[28] Robust Anytime Learning of Markov Decision Processes
Suilen, Marnix
Simao, Thiago D.
Parker, David
Jansen, Nils
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[29] Episodic task learning in Markov decision processes
Lin, Yong
Makedon, Fillia
Xu, Yurong
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2011, 36 (02) : 87 - 98
[30] LEARNING ALGORITHMS FOR MARKOV DECISION-PROCESSES
KURANO, M
[J]. JOURNAL OF APPLIED PROBABILITY, 1987, 24 (01) : 270 - 276

← 1 2 3 4 5 →