Test-time adaptation for 6D pose tracking

被引:1
|
作者
Tian, Long [1 ]
Oh, Changjae [1 ]
Cavallaro, Andrea [1 ,2 ,3 ]
机构
[1] Queen Mary Univ London, Ctr Intelligent Sensing, London, England
[2] Idiap Res Inst, Martigny, Switzerland
[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
6D pose tracking; Keypoints detection; Self-supervised learning; Transformer;
D O I
10.1016/j.patcog.2024.110390
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a test -time adaptation for 6D object pose tracking that learns to adapt a pre -trained model to track the 6D pose of novel objects. We consider the problem of 6D object pose tracking as a 3D keypoint detection and matching task and present a model that extracts 3D keypoints. Given an RGB-D image and the mask of a target object for each frame, the proposed model consists of the selfand cross -attention modules to produce the features that aggregate the information within and across frames, respectively. By using the keypoints detected from the features for each frame, we estimate the pose changes between two frames, which enables 6D pose tracking when the 6D pose of a target object in the initial frame is given. Our model is first trained in a source domain, a category -level tracking dataset where the ground truth 6D pose of the object is available. To deploy this pre -trained model to track novel objects, we present a test -time adaptation strategy that trains the model to adapt to the target novel object by self -supervised learning. Given an RGB-D video sequence of the novel object, the proposed self -supervised losses encourage the model to estimate the 6D pose changes that can keep the photometric and geometric consistency of the object. We validate our method on the NOCS-REAL275 dataset and our collected dataset, and the results show the advantages of tracking novel objects. The collected dataset and visualisation of tracking results are available: https://qm-ipalab.github.io/TA-6DT/
引用
收藏
页数:11
相关论文
共 50 条
  • [21] DomainAdaptor: A Novel Approach to Test-time Adaptation
    Zhang, Jian
    Qi, Lei
    Shi, Yinghuan
    Gao, Yang
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18925 - 18935
  • [22] Video Test-Time Adaptation for Action Recognition
    Lin, Wei
    Mirza, Muhammad Jehanzeb
    Kozinski, Mateusz
    Possegger, Horst
    Kuchne, Hilde
    Bischof, Horst
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22952 - 22961
  • [23] Test-Time Adaptation for Egocentric Action Recognition
    Plananamente, Mirco
    Plizzari, Chiara
    Caputo, Barbara
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 206 - 218
  • [24] Improved Test-Time Adaptation for Domain Generalization
    Chen, Liang
    Zhang, Yong
    Song, Yibing
    Shan, Ying
    Liu, Lingqiao
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24172 - 24182
  • [25] HFF6D: Hierarchical Feature Fusion Network for Robust 6D Object Pose Tracking
    Liu, Jian
    Sun, Wei
    Liu, Chongpei
    Zhang, Xing
    Fan, Shimeng
    Wu, Wei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7719 - 7731
  • [26] Enhancing Generalizable 6D Pose Tracking of an In-Hand Object With Tactile Sensing
    Liu, Yun
    Xu, Xiaomeng
    Chen, Weihang
    Yuan, Haocheng
    Wang, He
    Xu, Jing
    Chen, Rui
    Yi, Li
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (02) : 1106 - 1113
  • [27] Shape Enhanced Keypoints Learning with Geometric Prior for 6D Object Pose Tracking
    Majcher, Mateusz
    Kwolek, Bogdan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2985 - 2991
  • [28] 6D Pose Uncertainty in Robotic Perception
    Feiten, Wendelin
    Atwal, Pradeep
    Eidenberger, Robert
    Grundmann, Thilo
    [J]. ADVANCES IN ROBOTICS RESEARCH, 2009, : 89 - +
  • [29] On Evaluation of 6D Object Pose Estimation
    Hodan, Tomas
    Matas, Jiri
    Obdrzalek, Stephan
    [J]. COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 606 - 619
  • [30] PoseRBPF: A Rao-Blackwellized Particle Filter for 6D Object Pose Tracking
    Deng, Xinke
    Mousavian, Arsalan
    Xiang, Yu
    Xia, Fei
    Bretl, Timothy
    Fox, Dieter
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,