Robot Manipulation of Dynamic Object with Vision-based Reinforcement Learning

被引：0

作者：

Liu, Chenchen ^{[1
]}

Zhang, Zhengshen ^{[1
]}

Zhou, Lei ^{[1
]}

Liu, Zhiyang ^{[1
]}

Ang, Marcelo H., Jr. ^{[2
]}

Lu, Wenfeng ^{[2
]}

Tay, Francis E. H. ^{[2
]}

机构：

[1] Natl Univ Singapore, Adv Robot Ctr, Singapore, Singapore

[2] Natl Univ Singapore, Dept Mech Engn, Singapore, Singapore

来源：

2024 9TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING, ICCRE 2024 | 2024年

关键词：

Robot Manipulation; Reinforcement Learning; Deep Learning; Computer Vision; MOVING OBJECT; TRACKING;

D O I：

10.1109/ICCRE61448.2024.10589748

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the advanced development of AI, Deep Reinforcement Learning (DRL) is one way to let robots learn how to complete tasks independently. Based on DRL algorithms, robot manipulation in the tabletop scene is regarded as a typical environment for testing and validation. Within this context, we designed a vision-based scene to explore the performance of DRL algorithms in manipulating a dynamic object. Contrary to previous work with simple moving trajectories, the complexity of our approach was increased by applying a randomized vibrating trajectory. To better capture visual cues in motion, we also incorporated a Long Short-term Memory (LSTM) module to process the temporal information between the serial frames. By comparing with setups that only used basic CNN encoders, we found the addition of LSTM can drastically increase the learning efficiency under model-free RL algorithms like SAC. The learning inflection point of applying LSTM appeared at 300k time steps while the others were delayed to 500k time steps or even later, which underscores the necessity of extracting temporal features in dynamic scenes. Furthermore, we observed that by integrating LSTM, there is no need to explicitly append kinematic information, which also supports the idea that temporal information inherently encodes kinematics data.

引用

页码：21 / 26

页数：6

共 50 条

[1] Vision-based reinforcement learning for robot navigation
Zhu, WY
Levinson, S
[J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1025 - 1030
[2] Object manipulation by learning stereo vision-based robots
Nguyen, MC
Graefe, V
[J]. IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4: EXPANDING THE SOCIETAL ROLE OF ROBOTICS IN THE NEXT MILLENNIUM, 2001, : 146 - 151
[3] Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives
Son, Dongwon
Kim, Myungsin
Sim, Jaecheol
Shin, Wonsik
[J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5756 - 5763
[4] Vision-based reinforcement learning control of soft robot manipulators
Li, Jinzhou
Ma, Jie
Hu, Yujie
Zhang, Li
Liu, Zhijie
Sun, Shiying
[J]. ROBOTIC INTELLIGENCE AND AUTOMATION, 2024, : 783 - 790
[5] Learning object-specific vision-based manipulation in virtual environments
Matsikis, A
Zoumpoulidis, T
Broicher, FH
Kraiss, KF
[J]. IEEE ROMAN 2002, PROCEEDINGS, 2002, : 204 - 210
[6] Vision-based Belt Manipulation by Humanoid Robot
Qin, Yili
Escande, Adrien
Tanguy, Arnaud
Yoshida, Eiichi
[J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 3547 - 3552
[7] How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
Lee, Alex X.
Devin, Coline
Springenberg, Jost Tobias
Zhou, Yuxiang
Lampe, Thomas
Abdolmaleki, Abbas
Bousmalis, Konstantinos
[J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 2468 - 2475
[8] Purposive behavior acquisition for a real robot by vision-based reinforcement learning
Asada, M
Noda, S
Tawaratsumida, S
Hosoda, K
[J]. MACHINE LEARNING, 1996, 23 (2-3) : 279 - 303
[9] Vision-based manipulation with the humanoid robot Romeo
Claudio, Giovanni
Spindler, Fabien
Chaumette, Francois
[J]. 2016 IEEE-RAS 16TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2016, : 286 - 293
[10] Learning vision-based robotic manipulation tasks sequentially in offline reinforcement learning settings
Yadav, Sudhir Pratap
Nagar, Rajendra
Shah, Suril V.
[J]. ROBOTICA, 2024, 42 (06) : 1715 - 1730

← 1 2 3 4 5 →