Improving Exploration in UCT Using Local Manifolds

被引:0
|
作者
Srinivasan, Sriram [1 ]
Talvitie, Erik [2 ]
Bowling, Michael [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Franklin & Marshall Coll, Lancaster, PA 17604 USA
关键词
SEARCH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte Carlo planning has been proven successful in many sequential decision-making settings, but it suffers from poor exploration when the rewards are sparse. In this paper, we improve exploration in UCT by generalizing across similar states using a given distance metric. When the state space does not have a natural distance metric, we show how we can learn a local manifold from the transition graph of states in the near future. to obtain a distance metric. On domains inspired by video games, empirical evidence shows that our algorithm is more sample efficient than UCT, particularly when rewards are sparse.
引用
收藏
页码:3386 / 3392
页数:7
相关论文
共 50 条
  • [21] LOCAL PSEUDOCONVEXITY IN KAHLER MANIFOLDS
    ELENCWAJG, G
    ANNALES DE L INSTITUT FOURIER, 1975, 25 (02) : 295 - 314
  • [22] LOCAL SECTIONS OF FLOWS ON MANIFOLDS
    CHEWNING, WC
    OWEN, RS
    PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 1975, 49 (01) : 71 - 77
  • [23] Local midpoints on smooth manifolds
    Kim, Sejong
    Lawson, Jimmie
    DIFFERENTIAL GEOMETRY AND ITS APPLICATIONS, 2015, 39 : 129 - 146
  • [24] Local Convexity on Smooth Manifolds
    T. Rapcsák
    Journal of Optimization Theory and Applications, 2005, 127 : 165 - 176
  • [25] Local convexity on smooth manifolds
    Rapcsák, T
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2005, 127 (01) : 165 - 176
  • [26] REDUCIBLE MANIFOLDS AND LOCAL PRODUCTS
    SHAPIRO, YL
    DOKLADY AKADEMII NAUK SSSR, 1972, 206 (06): : 1305 - &
  • [27] Exploration Without Global Consistency Using Local Volume Consolidation
    Cieslewski, Titus
    Ziegler, Andreas
    Scaramuzza, Davide
    ROBOTICS RESEARCH: THE 19TH INTERNATIONAL SYMPOSIUM ISRR, 2022, 20 : 559 - 574
  • [28] Exhaustive local chemical space exploration using a transformer model
    Tibo, Alessandro
    He, Jiazhen
    Janet, Jon Paul
    Nittinger, Eva
    Engkvist, Ola
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [29] Improving local Wiener filtering using matched filter
    Kazubek, M.
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2006, 153 (04): : 501 - 506
  • [30] Improving image clarity using local feature dimension
    Lowe, Thomas
    IET IMAGE PROCESSING, 2015, 9 (07) : 553 - 559