Robot Manipulation of Dynamic Object with Vision-based Reinforcement Learning

被引:0
|
作者
Liu, Chenchen [1 ]
Zhang, Zhengshen [1 ]
Zhou, Lei [1 ]
Liu, Zhiyang [1 ]
Ang, Marcelo H., Jr. [2 ]
Lu, Wenfeng [2 ]
Tay, Francis E. H. [2 ]
机构
[1] Natl Univ Singapore, Adv Robot Ctr, Singapore, Singapore
[2] Natl Univ Singapore, Dept Mech Engn, Singapore, Singapore
关键词
Robot Manipulation; Reinforcement Learning; Deep Learning; Computer Vision; MOVING OBJECT; TRACKING;
D O I
10.1109/ICCRE61448.2024.10589748
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the advanced development of AI, Deep Reinforcement Learning (DRL) is one way to let robots learn how to complete tasks independently. Based on DRL algorithms, robot manipulation in the tabletop scene is regarded as a typical environment for testing and validation. Within this context, we designed a vision-based scene to explore the performance of DRL algorithms in manipulating a dynamic object. Contrary to previous work with simple moving trajectories, the complexity of our approach was increased by applying a randomized vibrating trajectory. To better capture visual cues in motion, we also incorporated a Long Short-term Memory (LSTM) module to process the temporal information between the serial frames. By comparing with setups that only used basic CNN encoders, we found the addition of LSTM can drastically increase the learning efficiency under model-free RL algorithms like SAC. The learning inflection point of applying LSTM appeared at 300k time steps while the others were delayed to 500k time steps or even later, which underscores the necessity of extracting temporal features in dynamic scenes. Furthermore, we observed that by integrating LSTM, there is no need to explicitly append kinematic information, which also supports the idea that temporal information inherently encodes kinematics data.
引用
收藏
页码:21 / 26
页数:6
相关论文
共 50 条
  • [1] Vision-based reinforcement learning for robot navigation
    Zhu, WY
    Levinson, S
    [J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1025 - 1030
  • [2] Object manipulation by learning stereo vision-based robots
    Nguyen, MC
    Graefe, V
    [J]. IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4: EXPANDING THE SOCIETAL ROLE OF ROBOTICS IN THE NEXT MILLENNIUM, 2001, : 146 - 151
  • [3] Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives
    Son, Dongwon
    Kim, Myungsin
    Sim, Jaecheol
    Shin, Wonsik
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5756 - 5763
  • [4] Vision-based reinforcement learning control of soft robot manipulators
    Li, Jinzhou
    Ma, Jie
    Hu, Yujie
    Zhang, Li
    Liu, Zhijie
    Sun, Shiying
    [J]. ROBOTIC INTELLIGENCE AND AUTOMATION, 2024, : 783 - 790
  • [5] Learning object-specific vision-based manipulation in virtual environments
    Matsikis, A
    Zoumpoulidis, T
    Broicher, FH
    Kraiss, KF
    [J]. IEEE ROMAN 2002, PROCEEDINGS, 2002, : 204 - 210
  • [6] Vision-based Belt Manipulation by Humanoid Robot
    Qin, Yili
    Escande, Adrien
    Tanguy, Arnaud
    Yoshida, Eiichi
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 3547 - 3552
  • [7] How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
    Lee, Alex X.
    Devin, Coline
    Springenberg, Jost Tobias
    Zhou, Yuxiang
    Lampe, Thomas
    Abdolmaleki, Abbas
    Bousmalis, Konstantinos
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 2468 - 2475
  • [8] Purposive behavior acquisition for a real robot by vision-based reinforcement learning
    Asada, M
    Noda, S
    Tawaratsumida, S
    Hosoda, K
    [J]. MACHINE LEARNING, 1996, 23 (2-3) : 279 - 303
  • [9] Vision-based manipulation with the humanoid robot Romeo
    Claudio, Giovanni
    Spindler, Fabien
    Chaumette, Francois
    [J]. 2016 IEEE-RAS 16TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2016, : 286 - 293
  • [10] Learning vision-based robotic manipulation tasks sequentially in offline reinforcement learning settings
    Yadav, Sudhir Pratap
    Nagar, Rajendra
    Shah, Suril V.
    [J]. ROBOTICA, 2024, 42 (06) : 1715 - 1730