One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer

被引:26
|
作者
Lin, Jing [1 ,2 ]
Zeng, Ailing [1 ]
Wang, Haoqian [2 ]
Zhang, Lei [1 ]
Li, Yu [1 ]
机构
[1] Int Digital Econ Acad IDEA, Sehnzhen, Peoples R China
[2] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.02027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Whole-body mesh recovery aims to estimate the 3D human body, face, and hands parameters from a single image. It is challenging to perform this task with a single network due to resolution issues, i.e., the face and hands are usually located in extremely small regions. Existing works usually detect hands and faces, enlarge their resolution to feed in a specific network to predict the parameter, and finally fuse the results. While this copy-paste pipeline can capture the fine-grained details of the face and hands, the connections between different parts cannot be easily recovered in late fusion, leading to implausible 3D rotation and unnatural pose. In this work, we propose a one-stage pipeline for expressive whole-body mesh recovery, named OSX, without separate networks for each part. Specifically, we design a Component Aware Transformer (CAT) composed of a global body encoder and a local face/hand decoder. The encoder predicts the body parameters and provides a high-quality feature map for the decoder, which performs a feature-level upsample-crop scheme to extract highresolution part-specific features and adopt keypoint-guided deformable attention to estimate hand and face precisely. The whole pipeline is simple yet effective without any manual post-processing and naturally avoids implausible prediction. Comprehensive experiments demonstrate the effectiveness of OSX. Lastly, we build a large-scale Upper-Body dataset (UBody) with high-quality 2D and 3D whole-body annotations. It contains persons with partially visible bodies in diverse real-life scenarios to bridge the gap between the basic task and downstream applications.
引用
收藏
页码:21159 / 21168
页数:10
相关论文
共 50 条
  • [31] FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection
    Wang, Tai
    Zhu, Xinge
    Pang, Jiangmiao
    Lin, Dahua
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 913 - 922
  • [32] 3D Optical Reconstruction of the Nervous System of the Whole-Body Marine Invertebrates
    Milichko, Valentin A.
    Dyachuk, Vyacheslav
    CHEMICAL & BIOMEDICAL IMAGING, 2023, 1 (09): : 852 - 863
  • [33] Optimization of transmission and emission scan duration in 3D whole-body PET
    Beyer, T
    Kinahan, PE
    Townsend, DW
    1996 IEEE NUCLEAR SCIENCE SYMPOSIUM - CONFERENCE RECORD, VOLS 1-3, 1997, : 1362 - 1366
  • [34] Optimization of transmission and emission scan duration in 3D whole-body PET
    Beyer, Thomas
    Kinahan, Paul E.
    Townsend, David W.
    IEEE Transactions on Nuclear Science, 1997, 44 (6 pt 2): : 2400 - 2407
  • [35] Optimization of transmission and emission scan duration in 3D whole-body PET
    Beyer, T
    Kinahan, PE
    Townsend, DW
    IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 1997, 44 (06) : 2400 - 2407
  • [36] EVALUATION OF WHOLE-BODY NAVIGATION AND SELECTION TECHNIQUES IN IMMERSIVE 3D ENVIRONMENTS
    Klompmaker, Florian
    Dridger, Alexander
    Nebe, Karsten
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE 2012, VOL 2, PTS A AND B, 2012, : 3 - +
  • [37] Correction: Clinical anthropometrics and body composition from 3D whole-body surface scans
    B. K. Ng
    B. J. Hinton
    B. Fan
    A. M. Kanaya
    J. A. Shepherd
    European Journal of Clinical Nutrition, 2021, 75 : 574 - 574
  • [38] Automatic and rapid whole-body 3D shape measurement based on multinode 3D sensing and speckle projection
    Guo, Jiping
    Peng, Xiang
    Li, Ameng
    Liu, Xiaoli
    Yu, Jiping
    APPLIED OPTICS, 2017, 56 (31) : 8759 - 8768
  • [39] One-stage Multi-task Detector for 3D Cardiac MR Imaging
    Lu, Weizeng
    Jia, Xi
    Chen, Wei
    Savioli, Nicolo
    de Marvao, Antonio
    Shen, Linlin
    O'Regan, Declan
    Duan, Jinming
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1949 - 1955
  • [40] Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images
    Tian, Zhi
    Chu, Xiangxiang
    Wang, Xiaoming
    Wei, Xiaolin
    Shen, Chunhua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,