Reconstructing Hand-Held Objects from Monocular Video

被引:8
|
作者
Huang, Di [1 ]
Ji, Xiaopeng [2 ]
He, Xingyi [2 ]
Sun, Jiaming [3 ]
He, Tong [4 ]
Shuai, Qing [2 ]
Ouyang, Wanli [1 ,4 ]
Zhou, Xiaowei [5 ]
机构
[1] Univ Sydney, Sydney, NSW, Australia
[2] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[3] Image Derivat Inc, Hangzhou, Zhejiang, Peoples R China
[4] Shanghai AI Lab, Shanghai, Peoples R China
[5] Zhejiang Univ, State Kay Lab CAD&CG, Hangzhou, Zhejiang, Peoples R China
关键词
Object reconstruction; joint hand-object reconstruction;
D O I
10.1145/3550469.3555401
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an approach that reconstructs a hand-held object from a monocular video. In contrast to many recent methods that directly predict object geometry by a trained network, the proposed approach does not require any learned prior about the object and is able to recover more accurate and detailed object geometry. The key idea is that the hand motion naturally provides multiple views of the object and the motion can be reliably estimated by a hand pose tracker. Then, the object geometry can be recovered by solving a multi-view reconstruction problem. We devise an implicit neural representation-based method to solve the reconstruction problem and address the issues of imprecise hand pose estimation, relative hand-object motion, and insufficient geometry optimization for small objects. We also provide a newly collected dataset with 3D ground truth to validate the proposed approach. The dataset and code will be released at https://dihuangdh.github.io/hhor.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] A HAND-HELD BRAIN
    不详
    INFOSYSTEMS, 1984, 31 (10): : 102 - 102
  • [32] HAND-HELD MIKES
    不详
    ELECTRONICS & WIRELESS WORLD, 1984, 90 (1583): : 14 - 14
  • [33] Crystal Palace: Merging Virtual Objects and Physical Hand-held Tools
    Kashiwagi, Toshiro
    Sumi, Kaoru
    Fels, Sidney
    Zhou, Qian
    Wu, Fan
    2019 26TH IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2019, : 1411 - 1412
  • [34] Dynamic 3D Avatar Creation from Hand-held Video Input
    Ichim, Alexandru Eugen
    Bouaziz, Sofien
    Pauly, Mark
    ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04):
  • [35] A RESPIRATORY PROFILE FROM A HAND-HELD COMPUTER
    AQUINO, MM
    HEART & LUNG, 1985, 14 (01): : 88 - 90
  • [36] Hand-held methane detector from Crowcon
    不详
    INTERNATIONAL GAS ENGINEERING AND MANAGEMENT, 2006, 46 (09): : 35 - 35
  • [37] Spatiotemporal consistency-based adaptive hand-held video stabilization
    Li, Xiao
    Li, Shuai
    Qin, Hong
    Hao, Aimin
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (01)
  • [38] Coherent video generation for multiple hand-held cameras with dynamic foreground
    Fang-Lue Zhang
    Connelly Barnes
    Hao-Tian Zhang
    Junhong Zhao
    Gabriel Salas
    Computational Visual Media, 2020, 6 (03) : 291 - 306
  • [39] Coherent video generation for multiple hand-held cameras with dynamic foreground
    Fang-Lue Zhang
    Connelly Barnes
    Hao-Tian Zhang
    Junhong Zhao
    Gabriel Salas
    Computational Visual Media, 2020, 6 : 291 - 306
  • [40] Using Hand-Held Point and Shoot Video Cameras in Clinical Education
    Stoten, Sharon
    JOURNAL OF CONTINUING EDUCATION IN NURSING, 2011, 42 (02): : 55 - 56