Using Learned Policies in Heuristic-Search Planning

被引:0
|
作者
Yoon, SungWook [1 ]
Fern, Alan [1 ]
Givan, Robert [1 ]
机构
[1] Arizona State Univ, Tempe, AZ 85281 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many current state-of-the-art planners rely on forward heuristic search. The success of such search typically depends on heuristic distance-to-the-goal estimates derived from the plangraph. Such estimates are effective in guiding search for many domains, but there remain many other domains where current heuristics are inadequate to guide forward search effectively. In some of these domains, it is possible to learn reactive policies from example plans that solve many problems. However, due to the inductive nature of these learning techniques, the policies are often faulty, and fail to achieve high success rates. In this work, we consider how to effectively integrate imperfect learned policies with imperfect heuristics in order to improve over each alone. We propose a simple approach that uses the policy to augment the states expanded during each search step. In particular, during each search node expansion, we add not only its neighbors, but all the nodes along the trajectory followed by the policy from the node until some horizon. Empirical results show that our proposed approach benefits both of the leveraged automated techniques, learning and heuristic search, outperforming the state-of-the-art in most benchmark planning domains.
引用
收藏
页码:2047 / 2052
页数:6
相关论文
共 50 条
  • [1] PERSONNEL SCHEDULING USING HEURISTIC-SEARCH AND CONSTRAINTS
    POLLACK, RB
    [J]. EXPERT SYSTEMS AND THE LEADING EDGE IN PRODUCTION AND OPERATIONS MANAGEMENT: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE, 1989, : 567 - 575
  • [2] STRATEGIES IN HEURISTIC-SEARCH
    GEORGEFF, MP
    [J]. ARTIFICIAL INTELLIGENCE, 1983, 20 (04) : 393 - 425
  • [3] HEURISTIC-SEARCH THROUGH ISLANDS
    CHAKRABARTI, PP
    GHOSE, S
    DESARKAR, SC
    [J]. ARTIFICIAL INTELLIGENCE, 1986, 29 (03) : 339 - 347
  • [4] AND OR GRAPH HEURISTIC-SEARCH METHODS
    MAHANTI, A
    BAGCHI, A
    [J]. JOURNAL OF THE ACM, 1985, 32 (01) : 28 - 51
  • [5] MULTIOBJECTIVE HEURISTIC-SEARCH IN AND/OR GRAPHS
    LIAW, CF
    STEWART, BS
    WHITE, CC
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1995, 25 (11): : 1513 - 1521
  • [6] DISTRIBUTED CONSTRAINED HEURISTIC-SEARCH
    SYCARA, K
    ROTH, S
    SADEH, N
    FOX, M
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1991, 21 (06): : 1446 - 1461
  • [7] A UNIFYING APPROACH TO HEURISTIC-SEARCH
    EIBEN, AE
    AARTS, EHL
    VANHEE, KM
    NUIJTEN, WPM
    [J]. ANNALS OF OPERATIONS RESEARCH, 1995, 55 : 81 - 99
  • [8] HEURISTIC-SEARCH IN RESTRICTED MEMORY
    CHAKRABARTI, PP
    GHOSE, S
    ACHARYA, A
    DESARKAR, SC
    [J]. ARTIFICIAL INTELLIGENCE, 1989, 41 (02) : 197 - 221
  • [9] ADMISSIBLE HEURISTIC-SEARCH IN AND OR GRAPHS
    BAGCHI, A
    MAHANTI, A
    [J]. THEORETICAL COMPUTER SCIENCE, 1983, 24 (02) : 207 - 219
  • [10] EFFICIENT IMPLEMENTATION OF HEURISTIC-SEARCH
    CREMELIE, N
    MARTENS, JP
    [J]. ELECTRONICS LETTERS, 1994, 30 (21) : 1748 - 1749