RGBManip: Monocular Image-based Robotic Manipulation through Active Object Pose Estimation

被引:0
|
作者
An, Boshi [1 ,2 ]
Geng, Yiran [1 ,2 ]
Chen, Kai [4 ]
Li, Xiaoqi [1 ,2 ,3 ]
Dou, Qi [4 ]
Dong, Hao [1 ,2 ]
机构
[1] Peking Univ, Sch CS, Hyperplane Lab, Beijing, Peoples R China
[2] Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China
[3] Beijing Acad Artificial Intelligence BAAI, Beijing, Peoples R China
[4] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICRA57147.2024.10610690
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robotic manipulation requires accurate perception of the environment, which poses a significant challenge due to its inherent complexity and constantly changing nature. In this context, RGB image and point-cloud observations are two commonly used modalities in visual-based robotic manipulation, but each of these modalities have their own limitations. Commercial point-cloud observations often suffer from issues like sparse sampling and noisy output due to the limits of the emission-reception imaging principle. On the other hand, RGB images, while rich in texture information, lack essential depth and 3D information crucial for robotic manipulation. To mitigate these challenges, we propose an image-only robotic manipulation framework that leverages an eye-on-hand monocular camera installed on the robot's parallel gripper. By moving with the robot gripper, this camera gains the ability to actively perceive the object from multiple perspectives during the manipulation process. This enables the estimation of 6D object poses, which can be utilized for manipulation. While, obtaining images from more and diverse viewpoints typically improves pose estimation, it also increases the manipulation time. To address this trade-off, we employ a reinforcement learning policy to synchronize the manipulation strategy with active perception, achieving a balance between 6D pose accuracy and manipulation efficiency. Our experimental results in both simulated and real-world environments showcase the state-of-the-art effectiveness of our approach. We believe that our method will inspire further research on real-world-oriented robotic manipulation. See https://rgbmanip.github.io/for more details.
引用
收藏
页码:7748 / 7755
页数:8
相关论文
共 50 条
  • [1] OBJECT POSE ESTIMATION IN MONOCULAR IMAGE USING MODIFIED FDCM
    Dabbour, Abd Alrazzak
    Habib, Rabie
    Saii, Mariam
    COMPUTER SCIENCE-AGH, 2020, 21 (01): : 97 - 112
  • [2] A Survey of Robotic Monocular Pose Estimation
    Zhang, Kun
    Song, Guozheng
    Ai, Qinglin
    SENSORS, 2025, 25 (05)
  • [3] 6D Pose Estimation of Transparent Object From Single RGB Image for Robotic Manipulation
    Byambaa, Munkhtulga
    Koutaki, Gou
    Choimaa, Lodoiravsal
    IEEE ACCESS, 2022, 10 : 114897 - 114906
  • [4] Image-Based Pose Estimation of an Endoscopic Instrument
    Reilink, Rob
    Stramigioli, Stefano
    Misra, Sarthak
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 3555 - 3556
  • [5] A Novel Image-Based Object Orientation Estimation Algorithm for Robotic Manipulator Applications
    Tsai, Chi-Yi
    Wong, Ching-Chang
    Liu, Tsung-Yen
    Tsao, An-Hung
    IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS 2012), 2012,
  • [6] Exploring Tactile Temporal Features for Object Pose Estimation during Robotic Manipulation
    Galaiya, Viral Rasik
    Asfour, Mohammed
    de Oliveira, Thiago Eustaquio Alves
    Jiang, Xianta
    da Fonseca, Vinicius Prado
    SENSORS, 2023, 23 (09)
  • [7] Object Recognition and Full Pose Registration from a Single Image for Robotic Manipulation
    Collet, Alvaro
    Berenson, Dmitry
    Srinivasa, Siddhartha S.
    Ferguson, Dave
    ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3534 - +
  • [8] Object recognition and pose estimation for robotic manipulation using color cooccurrence histograms
    Ekvall, S
    Hoffmann, F
    Kragic, D
    IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 1284 - 1289
  • [9] Object manipulation system based on image-based reinforcement learning
    Kim, Sunin
    Jo, HyunJun
    Song, Jae-Bok
    INTELLIGENT SERVICE ROBOTICS, 2022, 15 (02) : 171 - 177
  • [10] Object manipulation system based on image-based reinforcement learning
    Sunin Kim
    HyunJun Jo
    Jae-Bok Song
    Intelligent Service Robotics, 2022, 15 : 171 - 177