6-D Object Pose Estimation Using Multiscale Point Cloud Transformer

被引:12
|
作者
Zhou, Guangliang [1 ]
Wang, Deming [1 ]
Yan, Yi [1 ]
Liu, Chengju [1 ]
Chen, Qijun [1 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
关键词
3-D deep learning; object pose estimation; point cloud; self-attention; transformer;
D O I
10.1109/TIM.2022.3222467
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Predicting 6-D object pose is an essential task in vision measurement for robotic manipulation. RGB-based methods have a natural disadvantage due to the lack of 3-D information, thus leading to inferior results. Therefore, exploiting the geometry information in depth images is crucial to achieve accurate predictions. To this end, we propose a multiscale point cloud transformer (MSPCT) to better learn point cloud feature representations. MSPCT mainly consists of three types of modules: local transformer (LT), DownSampling (DS) module, and global transformer (GT). Specifically, the LT is designed to dynamically divide a local region centered on each point and further extract point-level feature with local context awareness. The DS module is utilized to decrease the resolution and enlarge the receptive field. GT is employed to capture global-range dependencies between extracted local features. Based on the proposed transformer blocks, we design a network architecture for object pose estimation, where we further obtain multiscale features by fusing the local features from LT and global features from GT to predict object's pose. Extensive experiments verify the effectiveness of LT and GT, and our pose estimation pipeline achieves promising results on three benchmark datasets.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] 6-D Object Pose Estimation Using Multiscale Point Cloud Transformer
    Zhou, Guangliang
    Wang, Deming
    Yan, Yi
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [2] MODELING AND INTERPRETING 6-D OBJECT POSE ESTIMATION
    Soler, Diego
    Hirata, Roberto, Jr.
    Espadoto, Mateus
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2325 - 2329
  • [3] Challenges for Monocular 6-D Object Pose Estimation in Robotics
    Thalhammer, Stefan
    Bauer, Dominik
    Hoenig, Peter
    Weibel, Jean-Baptiste
    Garcia-Rodriguez, Jose
    Vincze, Markus
    IEEE TRANSACTIONS ON ROBOTICS, 2024, 40 : 4065 - 4084
  • [4] 6-D Object Pose Estimation Based on Point Pair Matching for Robotic Grasp Detection
    Yu, Sheng
    Zhai, Di-Hua
    Zhan, Yufeng
    Wang, Wencai
    Guan, Yuyin
    Xia, Yuanqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [5] Compression of Plenoptic Point Cloud Attributes Using 6-D Point Clouds and 6-D Transforms
    Krivokuca, Maja
    Miandji, Ehsan
    Guillemot, Christine
    Chou, Philip A. A.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 593 - 607
  • [6] Pose Estimation of Rigid Object in Point Cloud
    Liu Zongming
    Liu Guodong
    Li Jianxun
    Ye Dong
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 708 - 713
  • [7] Efficient MSPSO Sampling for Object Detection and 6-D Pose Estimation in 3-D Scenes
    Xing, Xuejun
    Guo, Jianwei
    Nan, Liangliang
    Gu, Qingyi
    Zhang, Xiaopeng
    Yan, Dong-Ming
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (10) : 10281 - 10291
  • [8] Center-Based Decoupled Point Cloud Registration for 6D Object Pose Estimation
    Jiang, Haobo
    Dang, Zheng
    Gu, Shuo
    Xie, Jin
    Salzmann, Mathieu
    Yang, Jian
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3404 - 3414
  • [9] PointPoseNet: Point Pose Network for Robust 6D Object Pose Estimation
    Chen, Wei
    Duan, Jinming
    Basevi, Hector
    Chang, Hyung Jin
    Leonardis, Ales
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2813 - 2822
  • [10] Efficient Center Voting for Object Detection and 6D Pose Estimation in 3D Point Cloud
    Guo, Jianwei
    Xing, Xuejun
    Quan, Weize
    Yan, Dong-Ming
    Gu, Qingyi
    Liu, Yang
    Zhang, Xiaopeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5072 - 5084