Learning Landmark Selection Policies for Mapping Unknown Environments

被引:0
|
作者
Strasdat, Hauke [1 ]
Stachniss, Cyrill [2 ]
Burgard, Wolfram [2 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Comp, 180 Queens Gate, London SW7 2AZ, England
[2] Univ Freiburg, Dept Comp Sci, D-79110 Freiburg, Germany
来源
ROBOTICS RESEARCH | 2011年 / 70卷
关键词
D O I
暂无
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In general, a mobile robot that operates in unknown environments has to maintain a map and has to determine its own location given the map. This introduces significant computational and memory constraints for most autonomous systems, especially for lightweight robots such as humanoids or flying vehicles. In this paper, we present a universal approach for learning a landmark selection policy that allows a robot to discard landmarks that are not valuable for its current navigation task. This enables the robot to reduce the computational burden and to carry out its task more efficiently by maintaining only the important landmarks. Our approach applies an unscented Kalman filter for addressing the simultaneous localization and mapping problem and uses Monte-Carlo reinforcement learning to obtain the selection policy. In addition to that, we present a technique to compress learned policies without introducing a performance loss. In this way, our approach becomes applicable on systems with constrained memory resources. Based on real world and simulation experiments, we show that the learned policies allow for efficient robot navigation and outperform handcrafted strategies. We furthermore demonstrate that the learned policies are not only usable in a specific scenario but can also be generalized towards environments with varying properties.
引用
收藏
页码:483 / +
页数:3
相关论文
共 50 条
  • [41] Exploration of Unknown Environments Using Deep Reinforcement Learning
    McCalmon, Joseph
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15970 - 15971
  • [42] A Reinforcement Learning Framework for Space Missions in Unknown Environments
    Tavallali, Peyman
    Karumanchi, Sisir
    Bowkett, Joseph
    Reid, William
    Kennedy, Brett
    2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,
  • [43] Learning and strategy selection in probabilistic environments
    Gaissmaier, W
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON COGNITIVE MODELING, 2004, : 406 - 407
  • [44] Learning sensor data characteristics in unknown environments.
    Bokareva, Tatiana
    Bulusu, Nirupama
    Jha, Sanjay
    2006 THIRD ANNUAL INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: NETWORKING & SERVICES, 2006, : 11 - +
  • [45] Max Weight Learning Algorithms for Scheduling in Unknown Environments
    Neely, Michael J.
    Rager, Scott T.
    La Porta, Thomas F.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (05) : 1179 - 1191
  • [46] MetaNet: Automated Dynamic Selection of Scheduling Policies in Cloud Environments
    Tuli, Shreshth
    Casale, Giuliano
    Jennings, Nicholas R.
    2022 IEEE 15TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2022), 2022, : 331 - 341
  • [47] Policies for Assisted Virtual Machine Selection in Cloud Computing Environments
    Teixeira, Mario Meireles
    Bestavros, Azer
    2015 XXXIII BRAZILIAN SYMPOSIUM ON COMPUTER NETWORKS AND DISTRIBUTED SYSTEMS, 2015, : 228 - 236
  • [48] Mapping Unknown Environments through Passive Deformation of Soft, Growing Robots
    Fuentes, Francesco
    Blumenschein, Laura H.
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 2522 - 2527
  • [49] Mapping of Unknown Environments using Minimal Sensing from a Stochastic Swarm
    Dirafzoon, Alireza
    Betthauser, Joseph
    Schornick, Jeff
    Benavides, Daniel
    Lobaton, Edgar
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 3842 - 3849
  • [50] Exploration and Mapping of Unknown Polygonal Environments Based on Uncertain Range Data
    Dakulovic, Marija
    Iles, Sandor
    Petrovic, Ivan
    AUTOMATIKA, 2011, 52 (02) : 118 - 131