Joint Hand-Object 3D Reconstruction From a Single Image With Cross-Branch Feature Fusion

被引：33

作者：

Chen, Yujin ^{[1
]}

Tu, Zhigang ^{[1
]}

Kang, Di ^{[2
]}

Chen, Ruizhi ^{[1
]}

Bao, Linchao ^{[2
]}

Zhang, Zhengyou ^{[2
]}

Yuan, Junsong ^{[3
]}

机构：

[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China

[2] Tencent Lab, Shenzhen 518057, Peoples R China

[3] Univ Buffalo, Comp Sci & Engn Dept, Buffalo, NY 14228 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2021年 / 30卷

关键词：

Three-dimensional displays; Shape; Image reconstruction; Task analysis; Solid modeling; Image coding; Pose estimation; Hand pose and shape estimation; 3D object reconstruction; hand-object interaction; POSE ESTIMATION;

D O I：

10.1109/TIP.2021.3068645

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Accurate 3D reconstruction of the hand and object shape from a hand-object image is important for understanding human-object interaction as well as human daily activities. Different from bare hand pose estimation, hand-object interaction poses a strong constraint on both the hand and its manipulated object, which suggests that hand configuration may be crucial contextual information for the object, and vice versa. However, current approaches address this task by training a two-branch network to reconstruct the hand and object separately with little communication between the two branches. In this work, we propose to consider hand and object jointly in feature space and explore the reciprocity of the two branches. We extensively investigate cross-branch feature fusion architectures with MLP or LSTM units. Among the investigated architectures, a variant with LSTM units that enhances object feature with hand feature shows the best performance gain. Moreover, we employ an auxiliary depth estimation module to augment the input RGB image with the estimated depth map, which further improves the reconstruction accuracy. Experiments conducted on public datasets demonstrate that our approach significantly outperforms existing approaches in terms of the reconstruction accuracy of objects.

引用

页码：4008 / 4021

页数：14

共 50 条

[1] 3D Object Reconstruction from Hand-Object Interactions
Tzionas, Dimitrios
Gall, Juergen
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 729 - 737
[2] SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction
Swamy, Anilkumar
Leroy, Vincent
Weinzaepfel, Philippe
Baradel, Fabien
Galaaoui, Salma
Bregier, Romain
Armando, Matthieu
Franco, Jean-Sebastien
Rogez, Gregory
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 1927 - 1936
[3] SHOWMe: Robust object-agnostic hand-object 3D reconstruction from RGB video
Swamy, Anilkumar
Leroy, Vincent
Weinzaepfel, Philippe
Baradel, Fabien
Galaaoui, Salma
Bregier, Romain
Armando, Matthieu
Franco, Jean-Sebastien
Rogez, Gregory
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 247
[4] HandO: a hybrid 3D hand-object reconstruction model for unknown objects
Yu, Hang
Cheang, Chilam
Fu, Yanwei
Xue, Xiangyang
[J]. MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1845 - 1859
[5] 3D voxel reconstruction from single-view image based on cross-domain feature fusion
Xiong, Wenjing
Huang, Fang
Zhang, Hao
Jiang, Ming
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
[6] Towards Unconstrained Joint Hand-Object Reconstruction From RGB Videos
Hasson, Yana
Varol, Gul
Schmid, Cordelia
Laptev, Ivan
[J]. 2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 659 - 668
[7] gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction
Chen, Zerui
Chen, Shizhe
Schmid, Cordelia
Laptev, Ivan
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12890 - 12900
[8] 3D hand reconstruction from a single image based on biomechanical constraints
Guiqing Li
Zihui Wu
Yuxin Liu
Huiqian Zhang
Yongwei Nie
Aihua Mao
[J]. The Visual Computer, 2021, 37 : 2699 - 2711
[9] 3D hand reconstruction from a single image based on biomechanical constraints
Li, Guiqing
Wu, Zihui
Liu, Yuxin
Zhang, Huiqian
Nie, Yongwei
Mao, Aihua
[J]. VISUAL COMPUTER, 2021, 37 (9-11): : 2699 - 2711
[10] Application of Deep Learning to 3D Object Reconstruction From a Single Image
Chen, Jia
Zhang, Yu-Qi
Song, Peng
Wei, Yan-Tao
Wang, Yu
[J]. Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (04): : 657 - 668

← 1 2 3 4 5 →