The impact of domain randomization on cross-device monocular deep 6DoF detection

被引:0
|
作者
da Cunha, Kelvin B. [1 ]
Brito, Caio [1 ,2 ]
Valenca, Lucas [3 ]
Figueiredo, Lucas [1 ,5 ]
Simoes, Francisco [1 ,4 ]
Teichrieb, Veronica [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, Voxar Labs, BR-50740560 Recife, PE, Brazil
[2] Univ Montreal, LIGUM, 2900 Boul Edouard Montpetit, Montreal, PQ H3T 1J4, Canada
[3] Univ Laval, 1065 Ave Med, Laval, PQ 1065, Canada
[4] Univ Fed Rural Pernambuco, Dept Comp, BR-52171900 Recife, PE, Brazil
[5] Univ Fed Rural Pernambuco, Unidade Acad Belo Jardim, BR-55156580 Belo Jardim, PE, Brazil
关键词
6DoF pose estimation; Domain randomization; Deep learning; Cross-device;
D O I
10.1016/j.patrec.2022.04.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work evaluates the use of synthetic data to train deep 6DoF pose estimation models that use a monocular RGB camera as input. We have compared different training strategies combining real and synthetic data (with domain randomization) to investigate how to better handle real-world challenges. We show that it is possible to obtain accurate models using less real data and suggest how to utilize this strategy. In this work, we have captured and made available two datasets: one real and one synthetic, totaling over 110,0 0 0 annotated frames. These datasets are organized according to the different cameras used and the challenges present in the sequences, all featuring textureless 3D printed objects. We also show that synthetic data can help models generalize, handling challenges such as fast motion, occlusion, illumination changes, color variation, scale changes, and unexpected geometry. Finally, we evaluated 70 different models to understand how a model trained for one camera sensor performs when used with a different sensor. To this end, we also suggest how to handle this challenge better by using synthetic simulations to supplement training.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:224 / 231
页数:8
相关论文
共 50 条
  • [1] The impact of domain randomization on cross-device monocular deep 6DoF detection
    da Cunha, Kelvin B.
    Brito, Caio
    Valença, Lucas
    Figueiredo, Lucas
    Simões, Francisco
    Teichrieb, Veronica
    [J]. Pattern Recognition Letters, 2022, 159 : 224 - 231
  • [2] A Study on the Impact of Domain Randomization for Monocular Deep 6DoF Pose Estimation
    da Cunha, Kelvin B.
    Brito, Caio
    Valenca, Luas
    Simoes, Francisco
    Teichrieb, Veronica
    [J]. 2020 33RD SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2020), 2020, : 332 - 339
  • [3] Squeezed Deep 6DoF Object Detection using Knowledge Distillation
    Felix, Heitor
    Rodrigues, Walber M.
    Macedo, David
    Simoes, Francisco
    Oliveira, Adriano L., I
    Teichrieb, Veronica
    Zanchettin, Cleber
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] A Deep Learning Method for Lightweight and Cross-Device IoT Botnet Detection
    Catillo, Marta
    Pecchia, Antonio
    Villano, Umberto
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (02):
  • [5] Unsupervised Domain Adaptation for 6DOF Indoor Localization
    Di Mauro, Daniele
    Furnari, Antonino
    Signorello, Giovanni
    Farinella, Giovanni
    [J]. VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 954 - 961
  • [6] Consolidated domain adaptive detection and localization framework for cross-device colonoscopic images
    Liu, Xinyu
    Guo, Xiaoqing
    Liu, Yajie
    Yuan, Yixuan
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 71
  • [7] A PHANTOM® device with 6dof force feedback and sensing capabilities
    Kheddar, Abderrahmane
    Gourishankar, Venkatraghavan
    Evrard, Paul
    [J]. HAPTICS: PERCEPTION, DEVICES AND SCENARIOS, PROCEEDINGS, 2008, 5024 : 146 - +
  • [8] 6DoF Catheter Detection, Application to Intracardiac Echocardiography
    Ralovich, Kristof
    John, Matthias
    Camus, Estelle
    Navab, Nassir
    Heimann, Tobias
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2014, PT II, 2014, 8674 : 635 - 642
  • [9] 6DoF Tracking in Virtual Reality by Deep RNN Model
    Chang, Yun-Kai
    Chen, Mai-Keh
    Li, Yun-Lun
    Li, Hao-Ting
    Chiang, Chen-Kuo
    [J]. 2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, : 193 - 196
  • [10] End-to-End 6DoF Pose Estimation From Monocular RGB Images
    Zou, Wenbin
    Wu, Di
    Tian, Shishun
    Xiang, Canqun
    Li, Xia
    Zhang, Lu
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2021, 67 (01) : 87 - 96