Multi-modal 3D Human Pose Estimation for Human-Robot Collaborative Applications

被引:2
|
作者
Peppas, Konstantinos [1 ]
Tsiolis, Konstantinos [1 ]
Mariolis, Ioannis [1 ]
Topalidou-Kyniazopoulou, Angeliki [1 ]
Tzovaras, Dimitrios [1 ]
机构
[1] Ctr Res & Technol Hellas CERTH, Inst Informat Technol, 6th Km Charilaou Thermi Rd, Thessaloniki, Greece
关键词
Multi-modal learning; 3D human pose estimation; Collaborative tasks; Deep learning; CNN;
D O I
10.1007/978-3-030-73973-7_34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a multi-modal 3D human pose estimation approach which combines a 2D human pose estimation network utilizing RGB data with a 3D human pose estimation network utilizing the 2D pose estimation results and depth information, in order to predict 3D human poses. We improve upon the state-of-the-art by proposing the use of a more accurate 2D human pose estimation network, as well as by introducing squeeze-excite blocks into the architecture of the 3D pose estimation network. More importantly, we focused on the challenging application of 3D human pose estimation during collaborative tasks. In that direction, we selected appropriate sub-sets that address collaborative tasks from a large-scale multi-view RGB-D dataset and generated a novel one-view RGB-D dataset for training and testing respectively. We achieved above state-of-the-art performance among RGB-D approaches when tested on a novel benchmark RGB-D dataset on collaborative assembly that we have created and made publicly available.
引用
收藏
页码:355 / 364
页数:10
相关论文
共 50 条
  • [11] Human-robot dialogue annotation for multi-modal common ground
    Bonial, Claire
    Lukin, Stephanie M.
    Abrams, Mitchell
    Baker, Anthony
    Donatelli, Lucia
    Foots, Ashley
    Hayes, Cory J.
    Henry, Cassidy
    Hudson, Taylor
    Marge, Matthew
    Pollard, Kimberly A.
    Artstein, Ron
    Traum, David
    Voss, Clare R.
    LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [12] Continuous Multi-Modal Interaction Causes Human-Robot Alignment
    Wallkotter, Sebastian
    Joannou, Michael
    Westlake, Samuel
    Belphaeme, Tony
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON HUMAN AGENT INTERACTION (HAI'17), 2017, : 375 - 379
  • [13] mRI: Multi-modal 3D Human Pose Estimation Dataset using mmWave, RGB-D, and Inertial Sensors
    An, Sizhe
    Li, Yin
    Ogras, Umit
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [14] On pose estimation for human-robot symbiosis
    Bhuiyan, Md. Al-Amin
    Chang, Hong Liu
    Ueno, Haruki
    International Journal of Advanced Robotic Systems, 2008, 5 (01) : 19 - 30
  • [15] On Pose Estimation for Human-Robot Symbiosis
    Bhulyan, Md. Al-Amin
    Liu, Chang Hong
    Ueno, Haruki
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2008, 5 (01): : 19 - 30
  • [16] HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving
    Zanfir, Andrei
    Zanfir, Mihai
    Gorban, Alexander
    Ji, Jingwei
    Zhou, Yin
    Anguelov, Dragomir
    Sminchisescu, Cristian
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1114 - 1124
  • [17] 3D Hand and Object Pose Estimation for Real-time Human-robot Interaction
    Bandi, Chaitanya
    Kisner, Hannes
    Thomas, Urike
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 770 - 780
  • [18] A Multi-Modal and Collaborative Human–Machine Interface for a Walking Robot
    J. Estremera
    E. Garcia
    P. Gonzalez de Santos
    Journal of Intelligent and Robotic Systems, 2002, 35 : 397 - 425
  • [19] Detecting and tracking of 3D face pose for human-robot interaction
    Dornaika, Fadi
    Raducanu, Bogdan
    2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 1716 - +
  • [20] Design of Robot Teaching Assistants Through Multi-modal Human-Robot Interactions
    Ferrarelli, Paola
    Lazaro, Maria T.
    Iocchi, Luca
    ROBOTICS IN EDUCATION: LATEST RESULTS AND DEVELOPMENTS, 2018, 630 : 274 - 286