Graph-Based Hand-Object Meshes and Poses Reconstruction With Multi-Modal Input

被引:0
|
作者
Almadani, Murad [1 ,2 ]
Elhayek, Ahmed [3 ]
Malik, Jameel [4 ]
Stricker, Didier [1 ]
机构
[1] German Res Ctr Artificial Intelligence, Augmented Vis Grp, D-67663 Kaiserslautern, Germany
[2] Khalifa Univ, Dept Biomed Engn, Abu Dhabi, U Arab Emirates
[3] Univ Prince Mugrin, Medina 42241, Saudi Arabia
[4] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci SEECS, Islamabad 44000, Pakistan
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Three-dimensional displays; Shape; Image reconstruction; Convolution; Pose estimation; Feature extraction; Solid modeling; Hand pose estimation; hand shape estimation; hand-object interaction; graph convolution; machine learning; 3D HAND;
D O I
10.1109/ACCESS.2021.3117473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating the hand-object meshes and poses is a challenging computer vision problem with many practical applications. In this paper, we introduce a simple yet efficient hand-object reconstruction algorithm. To this end, we exploit the fact that both the poses and the meshes are graphs-based representations of the hand-object with different levels of details. This allows taking advantage of the powerful Graph Convolution networks (GCNs) to build a coarse-to-fine Graph-based hand-object reconstruction algorithm. Thus, we start by estimating a coarse graph that represents the 2D hand-object poses. Then, more details (e.g. third dimension and mesh vertices) are gradually added to the graph until it represents the dense 3D hand-object meshes. This paper also explores the problem of representing the RGBD input in different modalities (e.g. voxelized RGBD). Hence, we adopted a multi-modal representation of the input by combining 3D representation (i.e. voxelized RGBD) and 2D representation (i.e. RGB only). We include intensive experimental evaluations that measure the ability of our simple algorithm to achieve state-of-the-art accuracy on the most challenging datasets (i.e. HO-3D and FPHAB).
引用
收藏
页码:136438 / 136447
页数:10
相关论文
共 50 条
  • [1] Graph-Based Hand-Object Meshes and Poses Reconstruction with Multi-Modal Input
    Almadani, Murad
    Elhayek, Ahmed
    Malik, Jameel
    Stricker, Didier
    [J]. IEEE Access, 2021, 9 : 136438 - 136447
  • [2] Flexible Multi-modal Graph-Based Segmentation
    Sanberg, Willem P.
    Do, Luat
    de With, Peter H. N.
    [J]. ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2013, 2013, 8192 : 492 - 503
  • [3] Multi-Modal Hand-Object Pose Estimation With Adaptive Fusion and Interaction Learning
    Hoang, Dinh-Cuong
    Tan, Phan Xuan
    Nguyen, Anh-Nhat
    Vu, Duy-Quang
    Vu, Van-Duc
    Nguyen, Thu-Uyen
    Hoang, Ngoc-Anh
    Phan, Khanh-Toan
    Tran, Duc-Thanh
    Nguyen, Van-Thiep
    Duong, Quang-Tri
    Ho, Ngoc-Trung
    Tran, Cong-Trinh
    Duong, Van-Hiep
    Ngo, Phuc-Quan
    [J]. IEEE ACCESS, 2024, 12 : 54339 - 54351
  • [4] HOPE-Net: A Graph-based Model for Hand-Object Pose Estimation
    Doosti, Bardia
    Naha, Shujon
    Mirbagheri, Majid
    Crandall, David J.
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6607 - 6616
  • [5] Leveraging multi-modal fusion for graph-based image annotation
    Amiri, S. Hamid
    Jamzad, Mansour
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 55 : 816 - 828
  • [6] GRAPH-BASED MULTI-MODAL SCENE DETECTION FOR MOVIE AND TELEPLAY
    Xu, Su
    Feng, Bailan
    Ding, Peng
    Xu, Bo
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1413 - 1416
  • [7] Interaction Fusion: Real-time Reconstruction of Hand Poses and Deformable Objects in Hand-object Interactions
    Zhang, Hao
    Bo, Zi-Hao
    Yong, Jun-Hai
    Xu, Feng
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2019, 38 (04):
  • [8] Portable graph-based rumour detection against multi-modal heterophily
    Nguyen, Thanh Tam
    Ren, Zhao
    Nguyen, Thanh Toan
    Jo, Jun
    Nguyen, Quoc Viet Hung
    Yin, Hongzhi
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [9] An automated multi-modal graph-based pipeline for mouse genetic discovery
    Fang, Zhuoqing
    Peltz, Gary
    [J]. BIOINFORMATICS, 2022, 38 (13) : 3385 - 3394
  • [10] A Graph-Based Approach to Assessing Multi-Modal Team Communication in Global Organizations
    Uflacker, Matthias
    Zeier, Alexander
    [J]. 2008 IEEE SYMPOSIUM ON ADVANCED MANAGEMENT OF INFORMATION FOR GLOBALIZED ENTERPRISES, PROCEEDINGS, 2008, : 307 - 309