Learning Factored Markov Decision Processes with Unawareness

被引:0
|
作者
Innes, Craig [1 ]
Lascarides, Alex [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9AB, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Methods for learning and planning in sequential decision problems often assume the learner is aware of all possible states and actions in advance. This assumption is sometimes untenable. In this paper, we give a method to learn factored markov decision problems from both domain exploration and expert assistance, which guarantees convergence to near-optimal behaviour, even when the agent begins unaware of factors critical to success. Our experiments show our agent learns optimal behaviour on small and large problems, and that conserving information on discovering new possibilities results in faster convergence.
引用
收藏
页码:123 / 133
页数:11
相关论文
共 50 条
  • [21] Building Optimal Operation Policies for Dam Management Using Factored Markov Decision Processes
    Reyes, Alberto
    Ibargueengoytia, Pablo H.
    Romero, Ines
    Pech, David
    Borunda, Monica
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE AND ITS APPLICATIONS, MICAI 2015, PT II, 2015, 9414 : 475 - 484
  • [22] Online Learning in Kernelized Markov Decision Processes
    Chowdhury, Sayak Ray
    Gopalan, Aditya
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [23] Blackwell Online Learning for Markov Decision Processes
    Li, Tao
    Peng, Guanze
    Zhu, Quanyan
    [J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [24] Bayesian Learning of Noisy Markov Decision Processes
    Singh, Sumeetpal S.
    Chopin, Nicolas
    Whiteley, Nick
    [J]. ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2013, 23 (01):
  • [25] HIERARCHICAL REPRESENTATION LEARNING FOR MARKOV DECISION PROCESSES
    Steccanella, Lorenzo
    Jonsson, Anders
    Totaro, Simone
    [J]. CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 568 - 585
  • [26] Episodic task learning in Markov decision processes
    Yong Lin
    Fillia Makedon
    Yurong Xu
    [J]. Artificial Intelligence Review, 2011, 36 : 87 - 98
  • [27] Learning Markov Decision Processes for Model Checking
    Mao, Hua
    Chen, Yingke
    Jaeger, Manfred
    Nielsen, Thomas D.
    Larsen, Kim G.
    Nielsen, Brian
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2012, (103): : 49 - 63
  • [28] Robust Anytime Learning of Markov Decision Processes
    Suilen, Marnix
    Simao, Thiago D.
    Parker, David
    Jansen, Nils
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [29] Episodic task learning in Markov decision processes
    Lin, Yong
    Makedon, Fillia
    Xu, Yurong
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2011, 36 (02) : 87 - 98
  • [30] LEARNING ALGORITHMS FOR MARKOV DECISION-PROCESSES
    KURANO, M
    [J]. JOURNAL OF APPLIED PROBABILITY, 1987, 24 (01) : 270 - 276