Learning Sliding Policy of Flat Multi-target Objects in Clutter Scenes

被引:0
|
作者
Wu, Liangdong [1 ]
Wu, Jiaxi [2 ]
Li, Zhengwei [3 ]
Chen, Yurou [2 ]
Liu, Zhiyong [1 ,2 ,4 ]
机构
[1] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[4] Chinese Acad Sci, Cloud Comp Ctr, Dongguan, Guangdong, Peoples R China
来源
INFORMATION TECHNOLOGY AND CONTROL | 2024年 / 53卷 / 01期
关键词
Deep Learning in Manipulation; Reinforcement Learning; Robot Control; Intelligent system; sliding policy;
D O I
10.5755/j01.itc.53.1.34708
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In clutter scenes, one or several targets need to be obtained, which is hard for robot manipulation task. Especially, when the targets are flat objects like book, plates, due to limitation of common robot end-effectors, it will be more challenging. By employing pre-grasp operation like sliding, it becomes feasible to rearrange objects and shift the target towards table edge, enabling the robot to grasp it from a lateral perspective. In this paper, the proposed method transfers the task into a Parameterized Action Markov Decision Process to solve the problem, which is based on deep reinforcement learning. The mask images are taken as one of observations to the network for avoiding the impact of noise of original image. In order to improve data utilization, the policy network predicts the parameters for the sliding primitive of each object, which is weight-sharing, and then the Q-network selects the optimal execution target. Meanwhile, extra reward mechanism is adopted for improving the efficiency of task actions to cope with multiple targets. In addition, an adaptive policy scaling algorithm is proposed to improve the speed and adaptability of policy training. In both simulation and real system, our method achieves a higher task success rate and requires fewer actions to accomplish the flat multi-target sliding manipulation task within clutter scene, which verifies the effectiveness of ours.
引用
收藏
页码:5 / 18
页数:14
相关论文
共 50 条
  • [21] Improved Lightweight Multi-Target Recognition Model for Live Streaming Scenes
    Li, Zongwei
    Qiao, Kai
    Chen, Jianing
    Li, Zhenyu
    Zhang, Yanhui
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [22] A method for multi-target human behavior recognition in small and medium scenes
    Yang, Tao
    Dong, Liquan
    Kong, Lingqin
    Chu, Xuhong
    Zhao, Yuejin
    Liu, Ming
    OPTICAL METROLOGY AND INSPECTION FOR INDUSTRIAL APPLICATIONS IX, 2022, 12319
  • [23] Topological Sweep for Multi-Target Detection of Geostationary Space Objects
    Liu, Daqi
    Chen, Bo
    Chin, Tat-Jun
    Rutten, Mark G.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 5166 - 5177
  • [24] Distributed multi-target tracking in clutter for passive linear array sonar systems
    Zhang, Qian
    Xie, Yifan
    Song, Taek Lyul
    2017 20TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2017, : 479 - 486
  • [25] Beyond global and local multi-target learning
    Basgalupp, Marcio
    Cerri, Ricardo
    Schietgat, Leander
    Triguero, Isaac
    Vens, Celine
    INFORMATION SCIENCES, 2021, 579 : 508 - 524
  • [26] An Empirical Comparison on Multi-Target Regression Learning
    Xi, Xuefeng
    Sheng, Victor S.
    Sun, Binqi
    Wang, Lei
    Hu, Fuyuan
    CMC-COMPUTERS MATERIALS & CONTINUA, 2018, 56 (02): : 185 - 198
  • [27] Gaussian mixture belief propagation multi-target tracking under dense clutter
    Li L.
    Lei M.
    Lei, Ming (mlei@sjtu.edu.cn), 1600, Harbin Institute of Technology (52): : 38 - 46
  • [28] Artificial learning approaches for multi-target tracking
    Blount, D
    Kouritzin, M
    McCrosky, J
    AUTOMATIC TARGET RECOGNITION XIV, 2004, 5426 : 293 - 304
  • [29] Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once
    Han, Jiangfan
    Dong, Xiaoyi
    Zhang, Ruimao
    Chen, Dongdong
    Zhang, Weiming
    Yu, Nenghai
    Luo, Ping
    Wang, Xiaogang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5157 - 5166
  • [30] Multi-target tracking in clutter using a high pulse repetition frequency radar
    Shi, Yi Fang
    Song, Taek Lyul
    Lee, Jong Hyun
    IET RADAR SONAR AND NAVIGATION, 2015, 9 (08): : 1047 - 1054