Planning Large-scale Object Rearrangement Using Deep Reinforcement Learning

被引:1
|
作者
Ghosh, Sourav [1 ]
Das, Dipanjan [1 ]
Chakraborty, Abhishek [1 ]
Agarwal, Marichi [1 ]
Bhowmick, Brojeshwar [1 ]
机构
[1] TCS Res, Robot & Autonomous Syst, Chennai, Tamil Nadu, India
关键词
D O I
10.1109/IJCNN55064.2022.9889793
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object rearrangement is about moving a set of objects from an initial state to a goal state through task and motion planning. Existing methods either show poor scalability in number of objects they can handle, or do not generalize well across situations, or need explicit running buffers to avoid collisions during placements. In this paper, we propose a deep-RL based task planning method to solve large-scale object rearrangement problems. Given the source and target state of objects in the form of images, our method determines a collision-free object movement plan. Our method produces a feasible plan in discrete-continuous action space where picking the selected objects are discrete actions followed by a set of continuous actions to place the object. We propose a novel hierarchical dense reward structure to train our deep-RL network to make our method more sample efficient using the AI2Thor simulator. We show that our method works well on unseen publicly available datasets and on a publicly available simulation environment such as Pybullet thereby demonstrating the superiority of our method in terms of generalizability. To the best of our knowledge, our method is the first one that demonstrates the rearrangement across different scenarios from 2D surfaces such as tabletops to 3D rooms with a large number of objects and without any explicit need of buffer space.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control
    Tan, Tian
    Bao, Feng
    Deng, Yue
    Jin, Alex
    Dai, Qionghai
    Wang, Jie
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2687 - 2700
  • [22] Distributed Hierarchical Deep Reinforcement Learning for Large-Scale Grid Emergency Control
    Chen, Yixi
    Zhu, Jizhong
    Liu, Yun
    Zhang, Le
    Zhou, Jialin
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (02) : 4446 - 4458
  • [23] Large-Scale Traffic Signal Control Using Constrained Network Partition and Adaptive Deep Reinforcement Learning
    Gu, Hankang
    Wang, Shangbo
    Ma, Xiaoguang
    Jia, Dongyao
    Mao, Guoqiang
    Lim, Eng Gee
    Wong, Cheuk Pong Ryan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 7619 - 7632
  • [24] Large-scale Deep Learning at Baidu
    Yu, Kai
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2211 - 2211
  • [25] Large-Scale Mobile App Identification Using Deep Learning
    Rezaei, Shahbaz
    Kroencke, Bryce
    Liu, Xin
    IEEE ACCESS, 2020, 8 : 348 - 362
  • [26] Rich Punctuations Prediction Using Large-scale Deep Learning
    Wu, Xueyang
    Zhu, Su
    Wu, Yue
    Yu, Kai
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [27] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
  • [28] Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning
    Yuan, Weihao
    Stork, Johannes A.
    Kragic, Danica
    Wang, Michael Y.
    Hang, Kaiyu
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 270 - 277
  • [29] AUTONOMOUS NAVIGATION OF UAV IN LARGE-SCALE UNKNOWN COMPLEX ENVIRONMENT WITH DEEP REINFORCEMENT LEARNING
    Wang, Chao
    Wang, Jian
    Zhang, Xudong
    Zhang, Xiao
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 858 - 862
  • [30] A DEEP REINFORCEMENT LEARNING APPROACH TO FLOCKING AND NAVIGATION OF UAVS IN LARGE-SCALE COMPLEX ENVIRONMENTS
    Wang, Chao
    Wang, Jian
    Zhang, Xudong
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 1228 - 1232