3D human body modeling with orthogonal human mask image based on multi-channel Swin transformer architecture

被引:0
|
作者
Li, Xihang [1 ]
Li, Guiqin [1 ,3 ]
Li, Ming [1 ]
Liu, Kuiliang [1 ]
Mitrouchev, Peter [2 ]
机构
[1] Shanghai Univ, Shanghai Key Lab Intelligent Mfg & Robot, Shanghai, Peoples R China
[2] Univ Grenoble Alpes, G SCOP, St Martin Dheres, France
[3] Shanghai Univ, Shanghai Key Lab Intelligent Mfg & Robot, 333 Nanchen Rd, Shanghai 200444, Peoples R China
关键词
Body shape space; Human shape estimation; Orthogonal human mask; Swin transformer; Body shape classification; RECONSTRUCTION;
D O I
10.1016/j.imavis.2023.104795
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The reconstruction based on RGB images of dressed human body lacks the shape information of the human body under clothing, while the naked 3D human body scanning will violate the user's privacy. To overcome these limitations, a new method, based on Swin transformer (Swin-T), for reconstructing 3D human body shape from human orthogonal mask image is proposed. Its core is to express the reconstruction problem as solving regression mapping function. A fast body shape type classification method based on the human front mask is proposed. The regression function is innovatively represented as a piecewise function, with the body shape of the human body as the segmentation criterion. A multi-channel Swin-T architecture is designed, which can not only extract features from front and side mask images, but also their mixed features to construct the regression mapping function. Different body types for different genders are predicted with separate regression function to help estimate an accurate human model. Extensive experimental results show that the proposed method effectively achieves visually realistic and accurate body reconstruction, and significantly outperforms the current state-ofthe-art methods. In addition, the classification of body types can compensate for the errors caused by partial clothing laxity in practical applications, which is beneficial for users to obtain a more accurate 3D human model.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Low-Cost Multi-image Based 3D Human Body Modeling
    Wang, Zheng
    Gagalowicz, Andre
    Sun, Meijun
    COMPUTER VISION/COMPUTER GRAPHICS COLLABORATION TECHNIQUES, PROCEEDINGS, 2009, 5496 : 265 - +
  • [2] System Modeling of Human Body Based on Multi-channel Wrist Pulse Measurements
    Li, Huiling
    He, Qian
    Jin, Zhao
    Jiang, Yunfeng
    2024 IEEE 13RD SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP, SAM 2024, 2024,
  • [3] Multi-Focus Microscopy Image Fusion Based on Swin Transformer Architecture
    Xia, Han Hank
    Gao, Hao
    Shao, Hang
    Gao, Kun
    Liu, Wei
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [4] 3D Human Body Modeling Based on Single Kinect
    Zhang, Guanglin
    Li, Jiping
    Peng, Jianjun
    Pang, Hao
    Jiao, Xulun
    2014 7TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2014), 2014, : 100 - 104
  • [5] Multi-channel Features Fitted 3D CNNS and LSTMS for Human Activity Recognition
    Qin, Yang
    Mo, Lingfei
    Ye, Jing
    Du, Zhening
    2016 10TH INTERNATIONAL CONFERENCE ON SENSING TECHNOLOGY (ICST), 2016,
  • [6] Deformable Human Body Modeling from 3D Medical Image Scans
    Rhee, Taehyun
    Lui, Patrick
    Lewis, J. P.
    ROLE AND IMPORTANCE OF MATHEMATICS IN INNOVATION, 2017, 25 : 143 - 147
  • [7] MULTI-CHANNEL EEG COMPRESSION BASED ON 3D DECOMPOSITIONS
    Dauwels, Justin
    Srinivasan, K.
    Ramasubba, Reddy M.
    Cichocki, Andrzej
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 637 - 640
  • [8] Non-rigid temporal registration of 2D and 3D multi-channel microscopy image sequences of human cells
    Kim, I.
    Yang, S.
    Le Baccon, P.
    Heard, E.
    Chen, Y. -C.
    Spector, D.
    Kappell, C.
    Eils, R.
    Rohr, K.
    2007 4TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING : MACRO TO NANO, VOLS 1-3, 2007, : 1328 - +
  • [9] CATIA 3D Human Body Modeling Based on Photographic Method
    Sun Linlin
    Kong Fansen
    Yu Duonian
    Han Feifei
    2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [10] Swin transformer with multiscale 3D atrous convolution for hyperspectral image classification
    Farooque, Ghulam
    Liu, Qichao
    Sargano, Allah Bux
    Xiao, Liang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126