MULTIMODAL REPRESENTATION LEARNING FOR BLASTOCYST ASSESSMENT

被引:4
|
作者
Wang, Youcheng [1 ]
Zheng, Zhe [1 ]
Ni, Na [1 ]
Tong, Guoqing [2 ]
Cheng, Nuo [3 ]
Li, Kai [3 ]
Yin, Ping [3 ]
Chen, Yuanyuan [3 ]
Wu, Yingna [1 ]
Xie, Guangping [1 ]
机构
[1] ShanghaiTech Univ, Sch Creat & Art, Ctr Adapt Syst Engn, Shanghai, Peoples R China
[2] Xi An Jiao Tong Univ, Dept Reprod Med, Affiliated Hosp 1, Xian, Peoples R China
[3] Shuguang Hosp, Reprod Med Ctr, Shanghai, Peoples R China
关键词
Blastocyst Assessment; Multimodal Representation Learning; Image-text Retrieval; Visual Transformer;
D O I
10.1109/ISBI53787.2023.10230468
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Blastocyst selection based on morphology grading is crucial in in vitro fertilization (IVF) treatment. Several research studies based on convolutional neural networks (CNNs) have been reported to select the most viable blastocyst automatically. In this paper, we propose a multimodal representation learning framework in which the text description is firstly streamed as a complementary supervision signal to enrich the visual information. Moreover, we redefine the blastocyst assessment problem to an image-text retrieval task to solve the data imbalance. The experimental results show that the performance metrics, e.g., accuracy, outperform the unimodal classification (+1.5%) and image retrieval counterparts (+1.2%), which demonstrates our proposed model's effectiveness.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Learning multimodal word representation with graph convolutional networks
    Zhu, Wenhao
    Liu, Shuang
    Liu, Chaoming
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (06)
  • [42] Multimodal Representation Learning-Based Product Matching
    Feng, Changkai
    Chen, Wei
    Chen, Chao
    Xu, Tong
    Chen, Enhong
    CCKS 2022 - EVALUATION TRACK, 2022, 1711 : 180 - 190
  • [43] Multimodal Representation Learning For Real-World Applications
    Joshi, Abhinav
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 717 - 723
  • [44] Learning from the global view: Supervised contrastive learning of multimodal representation
    Mai, Sijie
    Zeng, Ying
    Hu, Haifeng
    INFORMATION FUSION, 2023, 100
  • [45] Multimodal Representation Learning via Graph Isomorphism Network for Toxicity Multitask Learning
    Wang, Guishen
    Feng, Hui
    Du, Mengyan
    Feng, Yuncong
    Cao, Chen
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (21) : 8322 - 8338
  • [46] A multimodal dynamical variational autoencoder for audiovisual speech representation learning
    Sadok, Samir
    Leglaive, Simon
    Girin, Laurent
    Alameda-Pineda, Xavier
    Seguier, Renaud
    NEURAL NETWORKS, 2024, 172
  • [47] MERL: Multimodal Event Representation Learning in Heterogeneous Embedding Spaces
    Zhang, Linhai
    Zhou, Deyu
    He, Yulan
    Yang, Zeng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14420 - 14427
  • [48] Self-Supervised Hypergraph Learning for Enhanced Multimodal Representation
    Shu, Hongji
    Meng, Chaojun
    de Meo, Pasquale
    Wang, Qing
    Zhu, Jia
    IEEE ACCESS, 2024, 12 : 20830 - 20839
  • [49] Multimodal face aging framework via learning disentangled representation
    Liu, Lu
    Wang, Shenghui
    Wan, Lili
    Yu, Haibo
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 83
  • [50] Learning Freehand Ultrasound Through Multimodal Representation and Skill Adaptation
    Deng, Xutian
    Jiang, Junnan
    Cheng, Wen
    Yang, Chenguang
    Li, Miao
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 14