D-SPDH: Improving 3D Robot Pose Estimation in Sim2Real Scenario via Depth Data

被引:0
|
作者
Simoni, Alessandro [1 ]
Borghi, Guido [2 ]
Garattoni, Lorenzo [3 ]
Francesca, Gianpiero [3 ]
Vezzani, Roberto [1 ]
机构
[1] Univ Modena & Reggio Emilia, Dept Engn Enzo Ferrari, I-42122 Reggio Emilia, Italy
[2] Univ Modena & Reggio Emilia, Dipartimento Educ & Sci Umane, I-42122 Reggio Emilia, Italy
[3] Toyota Motor Europe, B-1130 Brussels, Belgium
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Robots; Three-dimensional displays; Robot kinematics; Cameras; Robot vision systems; Pose estimation; Synthetic data; Training; Rendering (computer graphics); Deep learning; Human-machine systems; Computer vision; Human-machine interaction; human-robot interaction; collaborative robots (Cobots); robot pose estimation; deep learning; computer vision; depth maps;
D O I
10.1109/ACCESS.2024.3492812
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, there has been a notable surge in the significance attributed to technologies facilitating secure and efficient cohabitation and collaboration between humans and machines, with a particular interest in robotic systems. A pivotal element in actualizing this novel and challenging collaborative paradigm involves different technical tasks, including the comprehension of 3D poses exhibited by both humans and robots through the utilization of non-intrusive systems, such as cameras. In this scenario, the availability of vision-based systems capable of detecting in real-time the robot's pose is needed as a first step towards a safe and effective interaction to, for instance, avoid collisions. Therefore, in this work, we propose a vision-based system, referred to as D-SPDH, able to estimate the 3D robot pose. The system is based on double-branch architecture and depth data as a single input; any additional information regarding the state of the internal encoders of the robot is not required. The working scenario is the Sim2Real, i.e., the system is trained only with synthetic data and then tested on real sequences, thus eliminating the time-consuming acquisition and annotation procedures of real data, common phases in deep learning algorithms. Moreover, we introduce SimBa++, a dataset featuring both synthetic and real sequences with new real-world double-arm movements, and that represents a challenging setting in which the proposed approach is tested. Experimental results show that our D-SPDH method achieves state-of-the-art and real-time performance, paving the way a possible future non-invasive systems to monitor human-robot interactions.
引用
收藏
页码:166660 / 166673
页数:14
相关论文
共 50 条
  • [31] Correspondence-free pose estimation for 3D objects from noisy depth data
    Xenophon Zabulis
    Manolis I. A. Lourakis
    Panagiotis Koutlemanis
    The Visual Computer, 2018, 34 : 193 - 211
  • [32] Correspondence-free pose estimation for 3D objects from noisy depth data
    Zabulis, Xenophon
    Lourakis, Manolis I. A.
    Koutlemanis, Panagiotis
    VISUAL COMPUTER, 2018, 34 (02): : 193 - 211
  • [33] Monocular 3D Human Pose Estimation by Predicting Depth on Joints
    Nie, Bruce Xiaohan
    Wei, Ping
    Zhu, Song-Chun
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3467 - 3475
  • [34] 3D Hand Pose Estimation from RGB Using Privileged Learning with Depth Data
    Yuan, Shanxin
    Stenger, Bjorn
    Kim, Tae-Kyun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2866 - 2873
  • [35] Hand-eye 3D Pose Estimation for a Drawing Robot
    Sultan, Malik Saad
    Chen, Xiaopeng
    Ma, Gan
    Xue, Jingtao
    Ni, Wencheng
    Zhang, Tongtong
    Zhang, Wen
    2013 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2013, : 1325 - 1331
  • [36] Mobile robot control using 3D hand pose estimation
    Hoshino, Kiyoshi
    Kasahara, Takuya
    Igo, Naoki
    Tomida, Motomasa
    Tanimoto, Takanobu
    Mukai, Toshimitsu
    Brossard, Gilles
    Kotani, Hajime
    TENTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2011, 8000
  • [37] 3D Pose and Target Position Estimation for a Quadruped Walking Robot
    Cho, Kuk
    Baeg, SeungHo
    Park, Sangdeok
    2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2013, : 466 - +
  • [38] Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation
    Shan, Wenkang
    Lu, Haopeng
    Wang, Shanshe
    Zhang, Xinfeng
    Gao, Wen
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3446 - 3454
  • [39] 3D Human Pose Estimation via Intuitive Physics
    Tripathi, Shashank
    Mueller, Lea
    Huang, Chun-Hao P.
    Taheri, Omid
    Black, Michael J.
    Tzionas, Dimitrios
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4713 - 4725
  • [40] Generic 3D Representation via Pose Estimation and Matching
    Zamir, Amir R.
    Wekel, Tilman
    Agrawal, Pulkit
    Wei, Colin
    Malik, Jitendra
    Savarese, Silvio
    COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 535 - 553