6-D Object Pose Estimation Using Multiscale Point Cloud Transformer

被引:12
|
作者
Zhou, Guangliang [1 ]
Wang, Deming [1 ]
Yan, Yi [1 ]
Liu, Chengju [1 ]
Chen, Qijun [1 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
关键词
3-D deep learning; object pose estimation; point cloud; self-attention; transformer;
D O I
10.1109/TIM.2022.3222467
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Predicting 6-D object pose is an essential task in vision measurement for robotic manipulation. RGB-based methods have a natural disadvantage due to the lack of 3-D information, thus leading to inferior results. Therefore, exploiting the geometry information in depth images is crucial to achieve accurate predictions. To this end, we propose a multiscale point cloud transformer (MSPCT) to better learn point cloud feature representations. MSPCT mainly consists of three types of modules: local transformer (LT), DownSampling (DS) module, and global transformer (GT). Specifically, the LT is designed to dynamically divide a local region centered on each point and further extract point-level feature with local context awareness. The DS module is utilized to decrease the resolution and enlarge the receptive field. GT is employed to capture global-range dependencies between extracted local features. Based on the proposed transformer blocks, we design a network architecture for object pose estimation, where we further obtain multiscale features by fusing the local features from LT and global features from GT to predict object's pose. Extensive experiments verify the effectiveness of LT and GT, and our pose estimation pipeline achieves promising results on three benchmark datasets.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] An Efficient Global Point Cloud Descriptor for Object Recognition and Pose Estimation
    Silva do Monte Lima, Joao Paulo
    Teichrieb, Veronica
    2016 29TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2016, : 56 - 63
  • [32] HFE-Net: hierarchical feature extraction and coordinate conversion of point cloud for object 6D pose estimation
    Shen, Ze
    Chu, Hao
    Wang, Fei
    Guo, Yi
    Liu, Shangdong
    Han, Shuai
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (06): : 3167 - 3178
  • [33] Robotic grasping method with 6D pose estimation and point cloud fusion
    Ma, Haofei
    Wang, Gongcheng
    Bai, Hua
    Xia, Zhiyu
    Wang, Weidong
    Du, Zhijiang
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2024, : 5603 - 5613
  • [34] HFE-Net: hierarchical feature extraction and coordinate conversion of point cloud for object 6D pose estimation
    Ze Shen
    Hao Chu
    Fei Wang
    Yi Guo
    Shangdong Liu
    Shuai Han
    Neural Computing and Applications, 2024, 36 : 3167 - 3178
  • [35] Match Normalization: Learning-Based Point Cloud Registration for 6D Object Pose Estimation in the Real World
    Dang, Zheng
    Wang, Lizhou
    Guo, Yu
    Salzmann, Mathieu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (06) : 4489 - 4503
  • [36] SE(3) Diffusion Model-based Point Cloud Registration for Robust 6D Object Pose Estimation
    Jiang, Haobo
    Salzmann, Mathieu
    Dang, Zheng
    Xie, Jin
    Yang, Jian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [37] PoseRBPF: A Rao-Blackwellized Particle Filter for 6-D Object Pose Tracking
    Deng, Xinke
    Mousavian, Arsalan
    Xiang, Yu
    Xia, Fei
    Bretl, Timothy
    Fox, Dieter
    IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (05) : 1328 - 1342
  • [38] YOLOPose: Transformer-Based Multi-object 6D Pose Estimation Using Keypoint Regression
    Amini, Arash
    Periyasamy, Arul Selvam
    Behnke, Sven
    INTELLIGENT AUTONOMOUS SYSTEMS 17, IAS-17, 2023, 577 : 392 - 406
  • [39] SS-Pose: Self-Supervised 6-D Object Pose Representation Learning Without Rendering
    Mu, Fengjun
    Huang, Rui
    Zhang, Jingting
    Zou, Chaobin
    Shi, Kecheng
    Sun, Shixiang
    Zhan, Huayi
    Zhao, Pengbo
    Qiu, Jing
    Cheng, Hong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, : 13665 - 13675
  • [40] WS-OPE: Weakly Supervised 6-D Object Pose Regression Using Relative Multi-Camera Pose Constraints
    Li, Fu
    Shugurov, Ivan
    Busam, Benjamin
    Yang, Shaowu
    Ilic, Slobodan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 3703 - 3710