BodyNet: Volumetric Inference of 3D Human Body Shapes

被引:238
|
作者
Varol, Gul [1 ,5 ]
Ceylan, Duygu [3 ]
Russell, Bryan [4 ]
Yang, Jimei [3 ]
Yumer, Ersin [3 ,7 ]
Laptev, Ivan [1 ,5 ]
Schmid, Cordelia [2 ,6 ]
机构
[1] Inria, Paris, France
[2] Inria, Grenoble, France
[3] Adobe Res, San Jose, CA USA
[4] Adobe Res, San Francisco, CA USA
[5] PSL Res Univ, CNRS, Inria, Ecole Normale Super, Paris, France
[6] Univ Grenoble Alpes, LJK, INPG, CNRS,Inria, Grenoble, France
[7] Argo AI, Pittsburgh, PA USA
来源
关键词
D O I
10.1007/978-3-030-01234-2_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human shape estimation is an important task for video editing, animation and fashion industry. Predicting 3D human body shape from natural images, however, is highly challenging due to factors such as variation in human bodies, clothing and viewpoint. Prior methods addressing this problem typically attempt to fit parametric body models with certain priors on pose and shape. In this work we argue for an alternative representation and propose BodyNet, a neural network for direct inference of volumetric body shape from a single image. BodyNet is an end-to-end trainable network that benefits from (i) a volumetric 3D loss, (ii) a multi-view re-projection loss, and (iii) intermediate supervision of 2D pose, 2D body part segmentation, and 3D pose. Each of them results in performance improvement as demonstrated by our experiments. To evaluate the method, we fit the SMPL model to our network output and show state-of-the-art results on the SURREAL and Unite the People datasets, outperforming recent approaches. Besides achieving stateof-the-art performance, our method also enables volumetric body-part segmentation.
引用
收藏
页码:20 / 38
页数:19
相关论文
共 50 条
  • [1] Inference of human postures by classification of 3D human body shape
    Cohen, I
    Li, HX
    [J]. IEEE INTERNATIONAL WORKSHOP ON ANALYSIS AND MODELING OF FACE AND GESTURES, 2003, : 74 - 81
  • [2] 3D ShapeNets: A Deep Representation for Volumetric Shapes
    Wu, Zhirong
    Song, Shuran
    Khosla, Aditya
    Yu, Fisher
    Zhang, Linguang
    Tang, Xiaoou
    Xiao, Jianxiong
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1912 - 1920
  • [3] Reconstruction of 3D volumetric defects in conductors of arbitrary shapes
    Rubinacci, G
    Tamburrino, A
    Ventre, S
    Villone, F
    [J]. ELECTROMAGNETIC NONDESTRUCTIVE EVALUATION (VIII), 2004, 24 : 77 - 84
  • [4] Human perception of 3D shapes
    Pizlo, Zygmunt
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2007, 4673 : 1 - 12
  • [5] Computerized pattern making focus on fitting to 3D human body shapes
    Cho, Young Sook
    Tsuchiya, Keiichi
    Takatera, Masayuki
    Inui, Shigeru
    Park, Hyejun
    Shimizu, Yoshio
    [J]. INTERNATIONAL JOURNAL OF CLOTHING SCIENCE AND TECHNOLOGY, 2010, 22 (01) : 16 - 24
  • [7] 3D Body Shapes Estimation from Dressed-Human Silhouettes
    Song, Dan
    Tong, Ruofeng
    Chang, Jian
    Yang, Xiaosong
    Tang, Min
    Zhang, Jian Jun
    [J]. COMPUTER GRAPHICS FORUM, 2016, 35 (07) : 147 - 156
  • [8] Local approximation of scalar functions on 3D shapes and volumetric data
    Patane, Giuseppe
    Spagnuolo, Michela
    [J]. COMPUTERS & GRAPHICS-UK, 2012, 36 (05): : 387 - 397
  • [9] SAFARI FROM VISUAL SIGNALS: RECOVERING VOLUMETRIC 3D SHAPES
    Agudo, Antonio
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2495 - 2499
  • [10] VoxSegNet: Volumetric CNNs for Semantic Part Segmentation of 3D Shapes
    Wang, Zongji
    Lu, Feng
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (09) : 2919 - 2930