Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion

被引:1
|
作者
Zheng, Meng [1 ]
Planche, Benjamin [1 ]
Gong, Xuan [2 ]
Yang, Fan [1 ]
Chen, Terrence [1 ]
Wu, Ziyan [1 ]
机构
[1] United Imaging Intelligence, Cambridge, MA 02140 USA
[2] SUNY Buffalo, Buffalo, NY USA
关键词
3D mesh; Patient positioning; Patient modeling;
D O I
10.1007/978-3-031-16449-1_12
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
3D patient body modeling is critical to the success of automated patient positioning for smart medical scanning and operating rooms. Existing CNN-based end-to-end patient modeling solutions typically require a) customized network designs demanding large amount of relevant training data, covering extensive realistic clinical scenarios (e.g., patient covered by sheets), which leads to suboptimal generalizability in practical deployment, b) expensive 3D human model annotations, i.e., requiring huge amount of manual effort, resulting in systems that scale poorly. To address these issues, we propose a generic modularized 3D patient modeling method consists of (a) a multi-modal keypoint detection module with attentive fusion for 2D patient joint localization, to learn complementary cross-modality patient body information, leading to improved keypoint localization robustness and generalizability in a wide variety of imaging (e.g., CT, MRI etc.) and clinical scenarios (e.g., heavy occlusions); and (b) a self-supervised 3D mesh regression module which does not require expensive 3D mesh parameter annotations to train, bringing immediate cost benefits for clinical deployment. We demonstrate the efficacy of the proposed method by extensive patient positioning experiments on both public and clinical data. Our evaluation results achieve superior patient positioning performance across various imaging modalities in real clinical scenarios.
引用
收藏
页码:115 / 125
页数:11
相关论文
共 50 条
  • [1] Self-supervised multi-modal fusion network for multi-modal thyroid ultrasound image diagnosis
    Xiang, Zhuo
    Zhuo, Qiuluan
    Zhao, Cheng
    Deng, Xiaofei
    Zhu, Ting
    Wang, Tianfu
    Jiang, Wei
    Lei, Baiying
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [2] Self-Supervised Multi-Modal Hybrid Fusion Network for Brain Tumor Segmentation
    Fang, Feiyi
    Yao, Yazhou
    Zhou, Tao
    Xie, Guosen
    Lu, Jianfeng
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (11) : 5310 - 5320
  • [3] Self-Supervised Distilled Learning for Multi-modal Misinformation Identification
    Mu, Michael
    Das Bhattacharjee, Sreyasee
    Yuan, Junsong
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2818 - 2827
  • [4] Self-supervised Multi-Modal Video Forgery Attack Detection
    Zhao, Chenhui
    Li, Xiang
    Younes, Rabih
    [J]. 2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [5] Multi-modal emotion recognition using tensor decomposition fusion and self-supervised multi-tasking
    Wang, Rui
    Zhu, Jiawei
    Wang, Shoujin
    Wang, Tao
    Huang, Jingze
    Zhu, Xianxun
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2024, 13 (04)
  • [6] Self-supervised opinion summarization with multi-modal knowledge graph
    Lingyun Jin
    Jingqiang Chen
    [J]. Journal of Intelligent Information Systems, 2024, 62 : 191 - 208
  • [7] Self-supervised opinion summarization with multi-modal knowledge graph
    Jin, Lingyun
    Chen, Jingqiang
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (01) : 191 - 208
  • [8] MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding
    Yu, Hai-Tao
    Song, Mofei
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6773 - 6781
  • [9] Self-Supervised Entity Alignment Based on Multi-Modal Contrastive Learning
    Bo Liu
    Ruoyi Song
    Yuejia Xiang
    Junbo Du
    Weijian Ruan
    Jinhui Hu
    [J]. IEEE/CAA Journal of Automatica Sinica, 2022, 9 (11) : 2031 - 2033
  • [10] Self-supervised Multi-modal Alignment for Whole Body Medical Imaging
    Windsor, Rhydian
    Jamaludin, Amir
    Kadir, Timor
    Zisserman, Andrew
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 90 - 101