Using Learned Policies in Heuristic-Search Planning

被引:0
|
作者
Yoon, SungWook [1 ]
Fern, Alan [1 ]
Givan, Robert [1 ]
机构
[1] Arizona State Univ, Tempe, AZ 85281 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many current state-of-the-art planners rely on forward heuristic search. The success of such search typically depends on heuristic distance-to-the-goal estimates derived from the plangraph. Such estimates are effective in guiding search for many domains, but there remain many other domains where current heuristics are inadequate to guide forward search effectively. In some of these domains, it is possible to learn reactive policies from example plans that solve many problems. However, due to the inductive nature of these learning techniques, the policies are often faulty, and fail to achieve high success rates. In this work, we consider how to effectively integrate imperfect learned policies with imperfect heuristics in order to improve over each alone. We propose a simple approach that uses the policy to augment the states expanded during each search step. In particular, during each search node expansion, we add not only its neighbors, but all the nodes along the trajectory followed by the policy from the node until some horizon. Empirical results show that our proposed approach benefits both of the leveraged automated techniques, learning and heuristic search, outperforming the state-of-the-art in most benchmark planning domains.
引用
收藏
页码:2047 / 2052
页数:6
相关论文
共 50 条
  • [31] A PARALLEL HEURISTIC-SEARCH TECHNIQUE FOR STRING COMPARISON
    HADLOCK, F
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE RESEARCH, VOL 1, 1989, 1 : 187 - 205
  • [32] PRA - MASSIVELY-PARALLEL HEURISTIC-SEARCH
    EVETT, M
    HENDLER, J
    MAHANTI, A
    NAU, D
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1995, 25 (02) : 133 - 143
  • [33] UTILITY OF PATHMAX IN PARTIAL ORDER HEURISTIC-SEARCH
    DASGUPTA, P
    CHAKRABARTI, PP
    DESARKAR, SC
    [J]. INFORMATION PROCESSING LETTERS, 1995, 55 (06) : 317 - 322
  • [34] A NEW HEURISTIC-SEARCH TECHNIQUE - ALGORITHM SA
    ZHANG, B
    ZHANG, L
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1985, 7 (01) : 103 - 107
  • [35] DISTRIBUTION-SYSTEM SERVICE RESTORATION USING A HEURISTIC-SEARCH APPROACH
    HSU, YY
    HUANG, HM
    KUO, HC
    PENG, SK
    CHANG, CW
    CHANG, KJ
    YU, HS
    CHOW, CE
    KUO, RT
    [J]. IEEE TRANSACTIONS ON POWER DELIVERY, 1992, 7 (02) : 734 - 740
  • [36] A BIBLIOGRAPHY OF HEURISTIC-SEARCH RESEARCH THROUGH 1992
    STEWART, BS
    LIAW, CF
    WHITE, CC
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (02): : 268 - 293
  • [37] IMPLEMENTATION OF HEURISTIC-SEARCH STRATEGIES FOR DISTRIBUTION FEEDER RECONFIGURATION
    TAYLOR, T
    LUBKEMAN, D
    [J]. IEEE TRANSACTIONS ON POWER DELIVERY, 1990, 5 (01) : 239 - 246
  • [38] PROBABILISTIC ANALYSIS OF OUTPUT COST OF A HEURISTIC-SEARCH ALGORITHM
    SRIMANI, PK
    [J]. INFORMATION SCIENCES, 1989, 47 (01) : 53 - 62
  • [39] REGION-BASED FRACTAL IMAGE COMPRESSION USING HEURISTIC-SEARCH
    THOMAS, L
    DERAVI, F
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 1995, 4 (06) : 832 - 838
  • [40] SCHEDULING FLEXIBLE MANUFACTURING SYSTEMS USING PETRI NETS AND HEURISTIC-SEARCH
    LEE, DY
    DICESARE, F
    [J]. IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1994, 10 (02): : 123 - 132