Multi-modal 3D Human Pose Estimation for Human-Robot Collaborative Applications

被引：2

作者：

Peppas, Konstantinos ^{[1
]}

Tsiolis, Konstantinos ^{[1
]}

Mariolis, Ioannis ^{[1
]}

Topalidou-Kyniazopoulou, Angeliki ^{[1
]}

Tzovaras, Dimitrios ^{[1
]}

机构：

[1] Ctr Res & Technol Hellas CERTH, Inst Informat Technol, 6th Km Charilaou Thermi Rd, Thessaloniki, Greece

来源：

STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2020 | 2021年 / 12644卷

关键词：

Multi-modal learning; 3D human pose estimation; Collaborative tasks; Deep learning; CNN;

D O I：

10.1007/978-3-030-73973-7_34

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a multi-modal 3D human pose estimation approach which combines a 2D human pose estimation network utilizing RGB data with a 3D human pose estimation network utilizing the 2D pose estimation results and depth information, in order to predict 3D human poses. We improve upon the state-of-the-art by proposing the use of a more accurate 2D human pose estimation network, as well as by introducing squeeze-excite blocks into the architecture of the 3D pose estimation network. More importantly, we focused on the challenging application of 3D human pose estimation during collaborative tasks. In that direction, we selected appropriate sub-sets that address collaborative tasks from a large-scale multi-view RGB-D dataset and generated a novel one-view RGB-D dataset for training and testing respectively. We achieved above state-of-the-art performance among RGB-D approaches when tested on a novel benchmark RGB-D dataset on collaborative assembly that we have created and made publicly available.

引用

页码：355 / 364

页数：10

共 50 条

[11] Human-robot dialogue annotation for multi-modal common ground
Bonial, Claire
Lukin, Stephanie M.
Abrams, Mitchell
Baker, Anthony
Donatelli, Lucia
Foots, Ashley
Hayes, Cory J.
Henry, Cassidy
Hudson, Taylor
Marge, Matthew
Pollard, Kimberly A.
Artstein, Ron
Traum, David
Voss, Clare R.
LANGUAGE RESOURCES AND EVALUATION, 2024,
[12] Continuous Multi-Modal Interaction Causes Human-Robot Alignment
Wallkotter, Sebastian
Joannou, Michael
Westlake, Samuel
Belphaeme, Tony
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON HUMAN AGENT INTERACTION (HAI'17), 2017, : 375 - 379
[13] mRI: Multi-modal 3D Human Pose Estimation Dataset using mmWave, RGB-D, and Inertial Sensors
An, Sizhe
Li, Yin
Ogras, Umit
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[14] On pose estimation for human-robot symbiosis
Bhuiyan, Md. Al-Amin
Chang, Hong Liu
Ueno, Haruki
International Journal of Advanced Robotic Systems, 2008, 5 (01) : 19 - 30
[15] On Pose Estimation for Human-Robot Symbiosis
Bhulyan, Md. Al-Amin
Liu, Chang Hong
Ueno, Haruki
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2008, 5 (01): : 19 - 30
[16] HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving
Zanfir, Andrei
Zanfir, Mihai
Gorban, Alexander
Ji, Jingwei
Zhou, Yin
Anguelov, Dragomir
Sminchisescu, Cristian
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1114 - 1124
[17] 3D Hand and Object Pose Estimation for Real-time Human-robot Interaction
Bandi, Chaitanya
Kisner, Hannes
Thomas, Urike
PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 770 - 780
[18] A Multi-Modal and Collaborative Human–Machine Interface for a Walking Robot
J. Estremera
E. Garcia
P. Gonzalez de Santos
Journal of Intelligent and Robotic Systems, 2002, 35 : 397 - 425
[19] Detecting and tracking of 3D face pose for human-robot interaction
Dornaika, Fadi
Raducanu, Bogdan
2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 1716 - +
[20] Design of Robot Teaching Assistants Through Multi-modal Human-Robot Interactions
Ferrarelli, Paola
Lazaro, Maria T.
Iocchi, Luca
ROBOTICS IN EDUCATION: LATEST RESULTS AND DEVELOPMENTS, 2018, 630 : 274 - 286

← 1 2 3 4 5 →