Test-time adaptation for 6D pose tracking

被引:1
|
作者
Tian, Long [1 ]
Oh, Changjae [1 ]
Cavallaro, Andrea [1 ,2 ,3 ]
机构
[1] Queen Mary Univ London, Ctr Intelligent Sensing, London, England
[2] Idiap Res Inst, Martigny, Switzerland
[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
6D pose tracking; Keypoints detection; Self-supervised learning; Transformer;
D O I
10.1016/j.patcog.2024.110390
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a test -time adaptation for 6D object pose tracking that learns to adapt a pre -trained model to track the 6D pose of novel objects. We consider the problem of 6D object pose tracking as a 3D keypoint detection and matching task and present a model that extracts 3D keypoints. Given an RGB-D image and the mask of a target object for each frame, the proposed model consists of the selfand cross -attention modules to produce the features that aggregate the information within and across frames, respectively. By using the keypoints detected from the features for each frame, we estimate the pose changes between two frames, which enables 6D pose tracking when the 6D pose of a target object in the initial frame is given. Our model is first trained in a source domain, a category -level tracking dataset where the ground truth 6D pose of the object is available. To deploy this pre -trained model to track novel objects, we present a test -time adaptation strategy that trains the model to adapt to the target novel object by self -supervised learning. Given an RGB-D video sequence of the novel object, the proposed self -supervised losses encourage the model to estimate the 6D pose changes that can keep the photometric and geometric consistency of the object. We validate our method on the NOCS-REAL275 dataset and our collected dataset, and the results show the advantages of tracking novel objects. The collected dataset and visualisation of tracking results are available: https://qm-ipalab.github.io/TA-6DT/
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Unraveling Batch Normalization for Realistic Test-Time Adaptation
    Su, Zixian
    Guo, Jingwei
    Yao, Kai
    Yang, Xi
    Wang, Qiufeng
    Huang, Kaizhu
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 15136 - 15144
  • [42] Test-Time Adaptation with Shape Moments for Image Segmentation
    Bateson, Mathilde
    Lombaert, Herve
    Ben Ayed, Ismail
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV, 2022, 13434 : 736 - 745
  • [43] Exploring Motion Cues for Video Test-Time Adaptation
    Zeng, Runhao
    Deng, Qi
    Xu, Huixuan
    Niu, Shuaicheng
    Chen, Jian
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1840 - 1850
  • [44] VPA: Fully Test-Time Visual Prompt Adaptation
    Sun, Jiachen
    Ibrahim, Mark
    Hall, Melissa
    Evtimov, Ivan
    Mao, Z. Morley
    Ferrer, Cristian Canton
    Hazirbas, Caner
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5796 - 5806
  • [45] Single Shot 6D Object Pose Estimation
    Kleeberger, Kilian
    Huber, Marco F.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6239 - 6245
  • [46] DPOD: 6D Pose Object Detector and Refiner
    Zakharov, Sergey
    Shugurov, Ivan
    Ilic, Slobodan
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1941 - 1950
  • [47] BOP: Benchmark for 6D Object Pose Estimation
    Hodan, Tomas
    Michel, Frank
    Brachmann, Eric
    Kehl, Wadim
    Buch, Anders Glent
    Kraft, Dirk
    Drost, Bertram
    Vidal, Joel
    Ihrke, Stephan
    Zabulis, Xenophon
    Sahin, Caner
    Manhardt, Fabian
    Tombari, Federico
    Kim, Tae-Kyun
    Matas, Jiri
    Rother, Carsten
    [J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 19 - 35
  • [48] A Novel Distribution for Representation of 6D Pose Uncertainty
    Zhang, Lei
    Shang, Huiliang
    Lin, Yandan
    [J]. MICROMACHINES, 2022, 13 (01)
  • [49] Survey on 6D Pose Estimation of Rigid Object
    Chen, Jiale
    Zhang, Lijun
    Liu, Yi
    Xu, Chi
    [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7440 - 7445
  • [50] Orientation Keypoints for 6D Human Pose Estimation
    Fisch, Martin
    Clark, Ronald
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 10145 - 10158