Comparative criteria for partially observable contingent planning

被引:0
|
作者
Dorin Shmaryahu
Guy Shani
Jörg Hoffmann
机构
[1] Ben Gurion University of the Negev,
[2] Saarland University,undefined
关键词
Planning; Contingent planning; Comparative Criteria; Plan tree; Partial observability;
D O I
暂无
中图分类号
学科分类号
摘要
In contingent planning under partial observability with sensing actions, agents actively use sensing to discover meaningful facts about the world. The solution can be represented as a plan tree or graph, branching on various possible observations. Typically in contingent planning one seeks a satisfying plan leading to a goal state at each leaf. In many applications, however, one may prefer some satisfying plans to others, such as plans that lead to the goal with a lower average cost. However, methods such as average cost make an implicit assumption concerning the probabilities of outcomes, which may not apply when the stochastic dynamics of the environment are unknown. We focus on the problem of providing valid comparative criteria for contingent plan trees and graphs, allowing us to compare two plans and decide which one is preferable. We suggest a set of such comparison criteria—plan simplicity, dominance, and best and worst plan costs.We also argue that in some cases certain branches of the plan correspond to an unlikely combination of mishaps, and can be ignored, and provide methods for pruning such unlikely branches before comparing the plan graphs. We explain these criteria, and discuss their validity, correlations, and application to real world problems. We also suggest efficient algorithms for computing the comparative criteria where needed. We provide experimental results, showing that existing contingent planners provide diverse plans, that can be compared using these criteria.
引用
收藏
页码:481 / 517
页数:36
相关论文
共 50 条
  • [31] Feature extraction for decision-theoretic planning in partially observable environments
    Fujita, Hajime
    Nakamura, Yutaka
    Ishii, Shin
    ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 1, 2006, 4131 : 820 - 829
  • [32] Planning in Partially Observable Domains with Fuzzy Epistemic States and Probabilistic Dynamics
    Drougard, Nicolas
    Dubois, Didier
    Farges, Jean-Loup
    Teichteil-Koenigsbuch, Florent
    SCALABLE UNCERTAINTY MANAGEMENT (SUM 2015), 2015, 9310 : 220 - 233
  • [33] Information-guided Planning: An Online Approach for Partially Observable Problems
    do Carmo Alves, Matheus Aparecido
    Varma, Amokh
    Elkhatib, Yehia
    Marcolino, Leandro Soriano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [34] Policy Reuse for Learning and Planning in Partially Observable Markov Decision Processes
    Wu, Bo
    Feng, Yanpeng
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 549 - 552
  • [35] A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes
    Ross, Stephane
    Pineau, Joelle
    Chaib-draa, Brahim
    Kreitmann, Pierre
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 1729 - 1770
  • [36] Planning treatment of ischemic heart disease with partially observable Markov decision processes
    Hauskrecht, M
    Fraser, H
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2000, 18 (03) : 221 - 244
  • [37] Influence of State-Variable Constraints on Partially Observable Monte Carlo Planning
    Castellini, Alberto
    Chalkiadakis, Georgios
    Farinelli, Alessandro
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 5540 - 5546
  • [38] PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES WITH PARTIALLY OBSERVABLE RANDOM DISCOUNT FACTORS
    Martinez-Garcia, E. Everardo
    Minjarez-Sosa, J. Adolfo
    Vega-Amaya, Oscar
    KYBERNETIKA, 2022, 58 (06) : 960 - 983
  • [39] Traffic Route Planning in Partially Observable Environment Using Actions Group Representation
    Luo, Minzhong
    Yu, Shan
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2021, PT II, 2021, 12816 : 101 - 113
  • [40] Risk-aware shielding of Partially Observable Monte Carlo Planning policies 
    Mazzi, Giulio
    Castellini, Alberto
    Farinelli, Alessandro
    ARTIFICIAL INTELLIGENCE, 2023, 324