Learning Sliding Policy of Flat Multi-target Objects in Clutter Scenes

被引:0
|
作者
Wu, Liangdong [1 ]
Wu, Jiaxi [2 ]
Li, Zhengwei [3 ]
Chen, Yurou [2 ]
Liu, Zhiyong [1 ,2 ,4 ]
机构
[1] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[4] Chinese Acad Sci, Cloud Comp Ctr, Dongguan, Guangdong, Peoples R China
来源
INFORMATION TECHNOLOGY AND CONTROL | 2024年 / 53卷 / 01期
关键词
Deep Learning in Manipulation; Reinforcement Learning; Robot Control; Intelligent system; sliding policy;
D O I
10.5755/j01.itc.53.1.34708
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In clutter scenes, one or several targets need to be obtained, which is hard for robot manipulation task. Especially, when the targets are flat objects like book, plates, due to limitation of common robot end-effectors, it will be more challenging. By employing pre-grasp operation like sliding, it becomes feasible to rearrange objects and shift the target towards table edge, enabling the robot to grasp it from a lateral perspective. In this paper, the proposed method transfers the task into a Parameterized Action Markov Decision Process to solve the problem, which is based on deep reinforcement learning. The mask images are taken as one of observations to the network for avoiding the impact of noise of original image. In order to improve data utilization, the policy network predicts the parameters for the sliding primitive of each object, which is weight-sharing, and then the Q-network selects the optimal execution target. Meanwhile, extra reward mechanism is adopted for improving the efficiency of task actions to cope with multiple targets. In addition, an adaptive policy scaling algorithm is proposed to improve the speed and adaptability of policy training. In both simulation and real system, our method achieves a higher task success rate and requires fewer actions to accomplish the flat multi-target sliding manipulation task within clutter scene, which verifies the effectiveness of ours.
引用
收藏
页码:5 / 18
页数:14
相关论文
共 50 条
  • [31] Offline Estimation and Online Update Algorithm of Clutter Intensity in Multi-target Tracking
    Gong, Yang
    Cui, Chen
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, : 848 - 853
  • [32] Ballistic missile tracking in the presence of clutter using multi-target tracking algorithm
    Asad, Muhammad
    Khan, Sumair
    Arif, Muhammad
    Mehmood, Zahid
    Durrani, Sajjad
    Khan, Uzair
    PROCEEDINGS OF 2017 14TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2017, : 357 - 360
  • [33] Rauch-Tung-Striebel Smoothing Linear Multi-Target Tracking in Clutter
    Memon, Sufyan Ali
    Kim, Wan-Gu
    Park, Min-Seuk
    Attique, Muhammad
    IEEE ACCESS, 2022, 10 : 3007 - 3016
  • [34] Multi-Target Tracking Based on Multi-Bernoulli Filter with Amplitude for Unknown Clutter Rate
    Yuan, Changshun
    Wang, Jun
    Lei, Peng
    Bi, Yanxian
    Sun, Zhongsheng
    SENSORS, 2015, 15 (12) : 30385 - 30402
  • [35] Adaptive Estimation of Spatial Clutter Measurement Density Using Clutter Measurement Probability for Enhanced Multi-Target Tracking
    Park, Seung Hyo
    Chong, Sa Yong
    Kim, Hyung June
    Song, Taek Lyul
    SENSORS, 2020, 20 (01)
  • [36] Spatial Clutter Measurement Density Estimation with the Clutter Probability for Improving Multi-target Tracking Performance in Cluttered Environments
    Park, Seung Hyo
    Xie, Yifan
    Han, Du Hee
    Song, Taek Lyul
    2018 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2018, : 90 - 95
  • [37] Learning Moving Objects in a Multi-Target Tracking Scenario for Mobile Robots that use Laser Range Measurements
    Kondaxakis, Polychronis
    Baltzakis, Hans
    Trahanias, Panos
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 1667 - 1672
  • [38] Research on Multi-Target Assignment Method for Clusters Based on Deep Deterministic Policy Gradient Learning
    Li, Qiaoyi
    Wang, Zhengjie
    Zhang, Xiaoning
    Cheng, Qiyuan
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2024, 44 (10): : 1051 - 1057
  • [39] A new multi-target tracking algorithm for a large number of orbiting objects
    Delande, E.
    Houssineau, J.
    Franco, J.
    Frueh, C.
    Clark, D.
    Jah, M.
    ADVANCES IN SPACE RESEARCH, 2019, 64 (03) : 645 - 667
  • [40] Waveform Design Based Multi-Target Hypothesis Testing Under Unknown Clutter Parameters
    Zhu, Bingqi
    Gao, Yesheng
    Sheng, Hui
    Wang, Kaizhi
    Liu, Xingzhao
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 6617 - 6620