D-SPDH: Improving 3D Robot Pose Estimation in Sim2Real Scenario via Depth Data

被引：0

作者：

Simoni, Alessandro ^{[1
]}

Borghi, Guido ^{[2
]}

Garattoni, Lorenzo ^{[3
]}

Francesca, Gianpiero ^{[3
]}

Vezzani, Roberto ^{[1
]}

机构：

[1] Univ Modena & Reggio Emilia, Dept Engn Enzo Ferrari, I-42122 Reggio Emilia, Italy

[2] Univ Modena & Reggio Emilia, Dipartimento Educ & Sci Umane, I-42122 Reggio Emilia, Italy

[3] Toyota Motor Europe, B-1130 Brussels, Belgium

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Robots; Three-dimensional displays; Robot kinematics; Cameras; Robot vision systems; Pose estimation; Synthetic data; Training; Rendering (computer graphics); Deep learning; Human-machine systems; Computer vision; Human-machine interaction; human-robot interaction; collaborative robots (Cobots); robot pose estimation; deep learning; computer vision; depth maps;

D O I：

10.1109/ACCESS.2024.3492812

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, there has been a notable surge in the significance attributed to technologies facilitating secure and efficient cohabitation and collaboration between humans and machines, with a particular interest in robotic systems. A pivotal element in actualizing this novel and challenging collaborative paradigm involves different technical tasks, including the comprehension of 3D poses exhibited by both humans and robots through the utilization of non-intrusive systems, such as cameras. In this scenario, the availability of vision-based systems capable of detecting in real-time the robot's pose is needed as a first step towards a safe and effective interaction to, for instance, avoid collisions. Therefore, in this work, we propose a vision-based system, referred to as D-SPDH, able to estimate the 3D robot pose. The system is based on double-branch architecture and depth data as a single input; any additional information regarding the state of the internal encoders of the robot is not required. The working scenario is the Sim2Real, i.e., the system is trained only with synthetic data and then tested on real sequences, thus eliminating the time-consuming acquisition and annotation procedures of real data, common phases in deep learning algorithms. Moreover, we introduce SimBa++, a dataset featuring both synthetic and real sequences with new real-world double-arm movements, and that represents a challenging setting in which the proposed approach is tested. Experimental results show that our D-SPDH method achieves state-of-the-art and real-time performance, paving the way a possible future non-invasive systems to monitor human-robot interactions.

引用

页码：166660 / 166673

页数：14

共 50 条

[31] Correspondence-free pose estimation for 3D objects from noisy depth data
Xenophon Zabulis
Manolis I. A. Lourakis
Panagiotis Koutlemanis
The Visual Computer, 2018, 34 : 193 - 211
[32] Correspondence-free pose estimation for 3D objects from noisy depth data
Zabulis, Xenophon
Lourakis, Manolis I. A.
Koutlemanis, Panagiotis
VISUAL COMPUTER, 2018, 34 (02): : 193 - 211
[33] Monocular 3D Human Pose Estimation by Predicting Depth on Joints
Nie, Bruce Xiaohan
Wei, Ping
Zhu, Song-Chun
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3467 - 3475
[34] 3D Hand Pose Estimation from RGB Using Privileged Learning with Depth Data
Yuan, Shanxin
Stenger, Bjorn
Kim, Tae-Kyun
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2866 - 2873
[35] Hand-eye 3D Pose Estimation for a Drawing Robot
Sultan, Malik Saad
Chen, Xiaopeng
Ma, Gan
Xue, Jingtao
Ni, Wencheng
Zhang, Tongtong
Zhang, Wen
2013 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2013, : 1325 - 1331
[36] Mobile robot control using 3D hand pose estimation
Hoshino, Kiyoshi
Kasahara, Takuya
Igo, Naoki
Tomida, Motomasa
Tanimoto, Takanobu
Mukai, Toshimitsu
Brossard, Gilles
Kotani, Hajime
TENTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2011, 8000
[37] 3D Pose and Target Position Estimation for a Quadruped Walking Robot
Cho, Kuk
Baeg, SeungHo
Park, Sangdeok
2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2013, : 466 - +
[38] Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation
Shan, Wenkang
Lu, Haopeng
Wang, Shanshe
Zhang, Xinfeng
Gao, Wen
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3446 - 3454
[39] 3D Human Pose Estimation via Intuitive Physics
Tripathi, Shashank
Mueller, Lea
Huang, Chun-Hao P.
Taheri, Omid
Black, Michael J.
Tzionas, Dimitrios
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4713 - 4725
[40] Generic 3D Representation via Pose Estimation and Matching
Zamir, Amir R.
Wekel, Tilman
Agrawal, Pulkit
Wei, Colin
Malik, Jitendra
Savarese, Silvio
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 535 - 553

← 1 2 3 4 5 →