Layered State Discovery for Incremental Autonomous Exploration

被引:0
|
作者
Chen, Liyu [1 ]
Tirinzoni, Andrea [2 ]
Lazaric, Alessandro [2 ]
Pirotta, Matteo [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Meta, Menlo Pk, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the autonomous exploration (AX) problem proposed by Lim & Auer (2012). In this setting, the objective is to discover a set of epsilon-optimal policies reaching a set S-L(->) of incrementally L-controllable states. We introduce a novel layered decomposition of the set of incrementally L-controllable states that is based on the iterative application of a state-expansion operator. We leverage these results to design Layered Autonomous Exploration (LAE), a novel algorithm for AX that attains a sample complexity of (O) over tilde (LSL(1+epsilon)->Gamma(L(1+epsilon))A log(12)(S-L(1+epsilon)(->))/epsilon(2)), where S-L(1+epsilon)(->) is the number of states that are incrementally L(1 + epsilon)-controllable, A is the number of actions, and Gamma((1+epsilon)) is the branching factor of the transitions over such states. LAE improves over the algorithm of Tarbouriech et al. (2020b) by a factor of L-2 and it is the first algorithm for AX that works in a countably-infinite state space. Moreover, we show that, under a certain identifiability assumption, LAE achieves minimax-optimal sample complexity of (O) over tilde (LSL -> A log(12) (S-L(->))/epsilon(2)), outperforming existing algorithms and matching for the first time the lower bound proved by Cai et al. (2022) up to logarithmic factors.
引用
收藏
页数:49
相关论文
共 50 条
  • [1] Improved Sample Complexity for Incremental Autonomous Exploration in MDPs
    Tarbouriech, Jean
    Pirotta, Matteo
    Valko, Michal
    Lazaric, Alessandro
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Autonomous Robotic Exploration by Incremental Road Map Construction
    Wang, Chaoqun
    Chi, Wenzheng
    Sun, Yuxiang
    Meng, Max Q. -H.
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2019, 16 (04) : 1720 - 1731
  • [3] Online Incremental Learning of the Terrain Traversal Cost in Autonomous Exploration
    Pragr, Milos
    Cizek, Petr
    Bayer, Jan
    Faigl, Jan
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,
  • [4] Autonomous UAV Exploration of Dynamic Environments Via Incremental Sampling and Probabilistic Roadmap
    Xu, Zhefan
    Deng, Di
    Shimada, Kenji
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 2729 - 2736
  • [5] UMAT Initial Exploration of Incremental Disturbed State Model of soil
    Liu Pinghui
    Fang Houguo
    Li Yuqin
    [J]. 2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 4, 2011, : 6 - 8
  • [6] Multi-state autonomous drilling for lunar exploration
    Chen Chongbin
    Quan Qiquan
    Shi Xiaomeng
    Deng Zongquan
    Tang Dewei
    Jiang Shengyuan
    [J]. Chinese Journal of Aeronautics, 2016, (05) : 1397 - 1404
  • [7] Multi-state autonomous drilling for lunar exploration
    Chen Chongbin
    Quan Qiquan
    Shi Xiaomeng
    Deng Zongquan
    Tang Dewei
    Jiang Shengyuan
    [J]. Chinese Journal of Aeronautics., 2016, 29 (05) - 1404
  • [8] Multi-state autonomous drilling for lunar exploration
    Chen Chongbin
    Quan Qiquan
    Shi Xiaomeng
    Deng Zongquan
    Tang Dewei
    Jiang Shengyuan
    [J]. CHINESE JOURNAL OF AERONAUTICS, 2016, 29 (05) : 1397 - 1404
  • [9] Multi-state autonomous drilling for lunar exploration
    [J]. Quan, Qiquan (quanqiquan@hit.edu.cn), 1600, Chinese Journal of Aeronautics (29):
  • [10] Autonomous Discovery of Subgoals Using Acyclic State Trajectories
    Jin, Zhao
    Jin, Jian
    Liu, WeiYi
    [J]. INFORMATION COMPUTING AND APPLICATIONS, 2010, 6377 : 49 - +