Learning Sliding Policy of Flat Multi-target Objects in Clutter Scenes

被引：0

作者：

Wu, Liangdong ^{[1
]}

Wu, Jiaxi ^{[2
]}

Li, Zhengwei ^{[3
]}

Chen, Yurou ^{[2
]}

Liu, Zhiyong ^{[1
,2
,4
]}

机构：

[1] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China

[2] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China

[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China

[4] Chinese Acad Sci, Cloud Comp Ctr, Dongguan, Guangdong, Peoples R China

来源：

INFORMATION TECHNOLOGY AND CONTROL | 2024年 / 53卷 / 01期

关键词：

Deep Learning in Manipulation; Reinforcement Learning; Robot Control; Intelligent system; sliding policy;

D O I：

10.5755/j01.itc.53.1.34708

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In clutter scenes, one or several targets need to be obtained, which is hard for robot manipulation task. Especially, when the targets are flat objects like book, plates, due to limitation of common robot end-effectors, it will be more challenging. By employing pre-grasp operation like sliding, it becomes feasible to rearrange objects and shift the target towards table edge, enabling the robot to grasp it from a lateral perspective. In this paper, the proposed method transfers the task into a Parameterized Action Markov Decision Process to solve the problem, which is based on deep reinforcement learning. The mask images are taken as one of observations to the network for avoiding the impact of noise of original image. In order to improve data utilization, the policy network predicts the parameters for the sliding primitive of each object, which is weight-sharing, and then the Q-network selects the optimal execution target. Meanwhile, extra reward mechanism is adopted for improving the efficiency of task actions to cope with multiple targets. In addition, an adaptive policy scaling algorithm is proposed to improve the speed and adaptability of policy training. In both simulation and real system, our method achieves a higher task success rate and requires fewer actions to accomplish the flat multi-target sliding manipulation task within clutter scene, which verifies the effectiveness of ours.

引用

页码：5 / 18

页数：14

共 50 条

[31] Offline Estimation and Online Update Algorithm of Clutter Intensity in Multi-target Tracking
Gong, Yang
Cui, Chen
2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, : 848 - 853
[32] Ballistic missile tracking in the presence of clutter using multi-target tracking algorithm
Asad, Muhammad
Khan, Sumair
Arif, Muhammad
Mehmood, Zahid
Durrani, Sajjad
Khan, Uzair
PROCEEDINGS OF 2017 14TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2017, : 357 - 360
[33] Rauch-Tung-Striebel Smoothing Linear Multi-Target Tracking in Clutter
Memon, Sufyan Ali
Kim, Wan-Gu
Park, Min-Seuk
Attique, Muhammad
IEEE ACCESS, 2022, 10 : 3007 - 3016
[34] Multi-Target Tracking Based on Multi-Bernoulli Filter with Amplitude for Unknown Clutter Rate
Yuan, Changshun
Wang, Jun
Lei, Peng
Bi, Yanxian
Sun, Zhongsheng
SENSORS, 2015, 15 (12) : 30385 - 30402
[35] Adaptive Estimation of Spatial Clutter Measurement Density Using Clutter Measurement Probability for Enhanced Multi-Target Tracking
Park, Seung Hyo
Chong, Sa Yong
Kim, Hyung June
Song, Taek Lyul
SENSORS, 2020, 20 (01)
[36] Spatial Clutter Measurement Density Estimation with the Clutter Probability for Improving Multi-target Tracking Performance in Cluttered Environments
Park, Seung Hyo
Xie, Yifan
Han, Du Hee
Song, Taek Lyul
2018 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2018, : 90 - 95
[37] Learning Moving Objects in a Multi-Target Tracking Scenario for Mobile Robots that use Laser Range Measurements
Kondaxakis, Polychronis
Baltzakis, Hans
Trahanias, Panos
2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 1667 - 1672
[38] Research on Multi-Target Assignment Method for Clusters Based on Deep Deterministic Policy Gradient Learning
Li, Qiaoyi
Wang, Zhengjie
Zhang, Xiaoning
Cheng, Qiyuan
Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2024, 44 (10): : 1051 - 1057
[39] A new multi-target tracking algorithm for a large number of orbiting objects
Delande, E.
Houssineau, J.
Franco, J.
Frueh, C.
Clark, D.
Jah, M.
ADVANCES IN SPACE RESEARCH, 2019, 64 (03) : 645 - 667
[40] Waveform Design Based Multi-Target Hypothesis Testing Under Unknown Clutter Parameters
Zhu, Bingqi
Gao, Yesheng
Sheng, Hui
Wang, Kaizhi
Liu, Xingzhao
2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 6617 - 6620

← 1 2 3 4 5 →