Joint Hand-Object 3D Reconstruction From a Single Image With Cross-Branch Feature Fusion

被引:33
|
作者
Chen, Yujin [1 ]
Tu, Zhigang [1 ]
Kang, Di [2 ]
Chen, Ruizhi [1 ]
Bao, Linchao [2 ]
Zhang, Zhengyou [2 ]
Yuan, Junsong [3 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China
[2] Tencent Lab, Shenzhen 518057, Peoples R China
[3] Univ Buffalo, Comp Sci & Engn Dept, Buffalo, NY 14228 USA
关键词
Three-dimensional displays; Shape; Image reconstruction; Task analysis; Solid modeling; Image coding; Pose estimation; Hand pose and shape estimation; 3D object reconstruction; hand-object interaction; POSE ESTIMATION;
D O I
10.1109/TIP.2021.3068645
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate 3D reconstruction of the hand and object shape from a hand-object image is important for understanding human-object interaction as well as human daily activities. Different from bare hand pose estimation, hand-object interaction poses a strong constraint on both the hand and its manipulated object, which suggests that hand configuration may be crucial contextual information for the object, and vice versa. However, current approaches address this task by training a two-branch network to reconstruct the hand and object separately with little communication between the two branches. In this work, we propose to consider hand and object jointly in feature space and explore the reciprocity of the two branches. We extensively investigate cross-branch feature fusion architectures with MLP or LSTM units. Among the investigated architectures, a variant with LSTM units that enhances object feature with hand feature shows the best performance gain. Moreover, we employ an auxiliary depth estimation module to augment the input RGB image with the estimated depth map, which further improves the reconstruction accuracy. Experiments conducted on public datasets demonstrate that our approach significantly outperforms existing approaches in terms of the reconstruction accuracy of objects.
引用
收藏
页码:4008 / 4021
页数:14
相关论文
共 50 条
  • [1] 3D Object Reconstruction from Hand-Object Interactions
    Tzionas, Dimitrios
    Gall, Juergen
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 729 - 737
  • [2] SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction
    Swamy, Anilkumar
    Leroy, Vincent
    Weinzaepfel, Philippe
    Baradel, Fabien
    Galaaoui, Salma
    Bregier, Romain
    Armando, Matthieu
    Franco, Jean-Sebastien
    Rogez, Gregory
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 1927 - 1936
  • [3] SHOWMe: Robust object-agnostic hand-object 3D reconstruction from RGB video
    Swamy, Anilkumar
    Leroy, Vincent
    Weinzaepfel, Philippe
    Baradel, Fabien
    Galaaoui, Salma
    Bregier, Romain
    Armando, Matthieu
    Franco, Jean-Sebastien
    Rogez, Gregory
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 247
  • [4] HandO: a hybrid 3D hand-object reconstruction model for unknown objects
    Yu, Hang
    Cheang, Chilam
    Fu, Yanwei
    Xue, Xiangyang
    [J]. MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1845 - 1859
  • [5] 3D voxel reconstruction from single-view image based on cross-domain feature fusion
    Xiong, Wenjing
    Huang, Fang
    Zhang, Hao
    Jiang, Ming
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
  • [6] Towards Unconstrained Joint Hand-Object Reconstruction From RGB Videos
    Hasson, Yana
    Varol, Gul
    Schmid, Cordelia
    Laptev, Ivan
    [J]. 2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 659 - 668
  • [7] gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction
    Chen, Zerui
    Chen, Shizhe
    Schmid, Cordelia
    Laptev, Ivan
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12890 - 12900
  • [8] 3D hand reconstruction from a single image based on biomechanical constraints
    Guiqing Li
    Zihui Wu
    Yuxin Liu
    Huiqian Zhang
    Yongwei Nie
    Aihua Mao
    [J]. The Visual Computer, 2021, 37 : 2699 - 2711
  • [9] 3D hand reconstruction from a single image based on biomechanical constraints
    Li, Guiqing
    Wu, Zihui
    Liu, Yuxin
    Zhang, Huiqian
    Nie, Yongwei
    Mao, Aihua
    [J]. VISUAL COMPUTER, 2021, 37 (9-11): : 2699 - 2711
  • [10] Application of Deep Learning to 3D Object Reconstruction From a Single Image
    Chen, Jia
    Zhang, Yu-Qi
    Song, Peng
    Wei, Yan-Tao
    Wang, Yu
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (04): : 657 - 668