One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer

被引:26
|
作者
Lin, Jing [1 ,2 ]
Zeng, Ailing [1 ]
Wang, Haoqian [2 ]
Zhang, Lei [1 ]
Li, Yu [1 ]
机构
[1] Int Digital Econ Acad IDEA, Sehnzhen, Peoples R China
[2] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.02027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Whole-body mesh recovery aims to estimate the 3D human body, face, and hands parameters from a single image. It is challenging to perform this task with a single network due to resolution issues, i.e., the face and hands are usually located in extremely small regions. Existing works usually detect hands and faces, enlarge their resolution to feed in a specific network to predict the parameter, and finally fuse the results. While this copy-paste pipeline can capture the fine-grained details of the face and hands, the connections between different parts cannot be easily recovered in late fusion, leading to implausible 3D rotation and unnatural pose. In this work, we propose a one-stage pipeline for expressive whole-body mesh recovery, named OSX, without separate networks for each part. Specifically, we design a Component Aware Transformer (CAT) composed of a global body encoder and a local face/hand decoder. The encoder predicts the body parameters and provides a high-quality feature map for the decoder, which performs a feature-level upsample-crop scheme to extract highresolution part-specific features and adopt keypoint-guided deformable attention to estimate hand and face precisely. The whole pipeline is simple yet effective without any manual post-processing and naturally avoids implausible prediction. Comprehensive experiments demonstrate the effectiveness of OSX. Lastly, we build a large-scale Upper-Body dataset (UBody) with high-quality 2D and 3D whole-body annotations. It contains persons with partially visible bodies in diverse real-life scenarios to bridge the gap between the basic task and downstream applications.
引用
收藏
页码:21159 / 21168
页数:10
相关论文
共 50 条
  • [21] Strategies to improve 3D whole-body PET image reconstruction
    Cutler, PD
    Xu, M
    PHYSICS IN MEDICINE AND BIOLOGY, 1996, 41 (08): : 1453 - 1467
  • [22] 3D Whole-body skin imaging for automated melanoma detection
    Marchetti, M. A.
    Nazir, Z. H.
    Nanda, J. K.
    Dusza, S. W.
    D'Alessandro, B. M.
    DeFazio, J.
    Halpern, A. C.
    Rotemberg, V. M.
    Marghoob, A. A.
    JOURNAL OF THE EUROPEAN ACADEMY OF DERMATOLOGY AND VENEREOLOGY, 2023, 37 (05) : 945 - 950
  • [23] Clinical anthropometrics and body composition from 3D whole-body surface scans
    B K Ng
    B J Hinton
    B Fan
    A M Kanaya
    J A Shepherd
    European Journal of Clinical Nutrition, 2016, 70 : 1265 - 1270
  • [24] Reliability of the Styku 3D Whole-Body Scanner for the Assessment of Body Size in Athletes
    Derouchey, Joe D.
    Tomkinson, Grant R.
    Rhoades, Jesse L.
    Fitzgerald, John S.
    MEASUREMENT IN PHYSICAL EDUCATION AND EXERCISE SCIENCE, 2020, 24 (03) : 228 - 234
  • [25] Clinical anthropometrics and body composition from 3D whole-body surface scans
    Ng, B. K.
    Hinton, B. J.
    Fan, B.
    Kanaya, A. M.
    Shepherd, J. A.
    EUROPEAN JOURNAL OF CLINICAL NUTRITION, 2016, 70 (11) : 1265 - 1270
  • [26] A review of 3D human body pose estimation and mesh recovery
    Muhammad, Zaka-Ud-Din
    Huang, Zhangjin
    Khan, Rashid
    DIGITAL SIGNAL PROCESSING, 2022, 128
  • [27] Whole-body surface assessment - implementation and experiences with 360° 3D whole-body scans: opportunities to objectively monitor the extremities and the body trunk
    Etzel, Lucas
    Koban, Konstantin Christoph
    Li, Zhouxiao
    Frank, Konstantin
    Giunta, Riccardo Enzo
    Schenck, Thilo Ludwig
    HANDCHIRURGIE MIKROCHIRURGIE PLASTISCHE CHIRURGIE, 2019, 51 (04) : 240 - 248
  • [28] FEATURE AWARE 3D MESH COMPRESSION USING ROBUST PRINCIPAL COMPONENT ANALYSIS
    Lalos, Aris S.
    Arvanitis, Gerasimos
    Spathis-Papadiotis, Aristotelis
    Moustakas, Konstantinos
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [29] Path aggregation one-stage anchor free 3D object detection
    Liu, Yanfei
    Li, Chao
    Ning, Kanglin
    Li, Yali
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 25085 - 25103
  • [30] Path aggregation one-stage anchor free 3D object detection
    Yanfei Liu
    Chao Li
    Kanglin Ning
    Yali Li
    Multimedia Tools and Applications, 2024, 83 : 25085 - 25103