MULTIMODAL REPRESENTATION LEARNING FOR BLASTOCYST ASSESSMENT

被引:4
|
作者
Wang, Youcheng [1 ]
Zheng, Zhe [1 ]
Ni, Na [1 ]
Tong, Guoqing [2 ]
Cheng, Nuo [3 ]
Li, Kai [3 ]
Yin, Ping [3 ]
Chen, Yuanyuan [3 ]
Wu, Yingna [1 ]
Xie, Guangping [1 ]
机构
[1] ShanghaiTech Univ, Sch Creat & Art, Ctr Adapt Syst Engn, Shanghai, Peoples R China
[2] Xi An Jiao Tong Univ, Dept Reprod Med, Affiliated Hosp 1, Xian, Peoples R China
[3] Shuguang Hosp, Reprod Med Ctr, Shanghai, Peoples R China
关键词
Blastocyst Assessment; Multimodal Representation Learning; Image-text Retrieval; Visual Transformer;
D O I
10.1109/ISBI53787.2023.10230468
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Blastocyst selection based on morphology grading is crucial in in vitro fertilization (IVF) treatment. Several research studies based on convolutional neural networks (CNNs) have been reported to select the most viable blastocyst automatically. In this paper, we propose a multimodal representation learning framework in which the text description is firstly streamed as a complementary supervision signal to enrich the visual information. Moreover, we redefine the blastocyst assessment problem to an image-text retrieval task to solve the data imbalance. The experimental results show that the performance metrics, e.g., accuracy, outperform the unimodal classification (+1.5%) and image retrieval counterparts (+1.2%), which demonstrates our proposed model's effectiveness.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Excavating multimodal correlation for representation learning
    Mai, Sijie
    Sun, Ya
    Zeng, Ying
    Hu, Haifeng
    INFORMATION FUSION, 2023, 91 : 542 - 555
  • [2] Deep Multimodal Representation Learning: A Survey
    Guo, Wenzhong
    Wang, Jianwen
    Wang, Shiping
    IEEE ACCESS, 2019, 7 : 63373 - 63394
  • [3] Relaxing Contrastiveness in Multimodal Representation Learning
    Lin, Zudi
    Bas, Erhan
    Singh, Kunwar Yashraj
    Swaminathan, Gurumurthy
    Bhotika, Rahul
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2226 - 2235
  • [4] Disentangled Multimodal Representation Learning for Recommendation
    Liu, Fan
    Chen, Huilin
    Cheng, Zhiyong
    Liu, Anan
    Nie, Liqiang
    Kankanhalli, Mohan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7149 - 7159
  • [5] Geometric Multimodal Contrastive Representation Learning
    Poklukar, Petra
    Vasco, Miguel
    Yin, Hang
    Melo, Francisco S.
    Paiva, Ana
    Kragic, Danica
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [6] Multimodal pretraining for unsupervised protein representation learning
    Nguyen, Viet Thanh Duy
    Hy, Truong Son
    BIOLOGY METHODS & PROTOCOLS, 2024, 9 (01):
  • [7] Improved Multimodal Representation Learning with Skip Connections
    Zhang, Ning
    Cao, Yu
    Liu, Benyuan
    Luo, Yan
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 654 - 662
  • [8] Variational Invariant Representation Learning for Multimodal Recommendation
    Yang, Wei
    Zhang, Haoran
    Zhang, Li
    PROCEEDINGS OF THE 2024 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2024, : 752 - 760
  • [9] A deep semantic framework for multimodal representation learning
    Cheng Wang
    Haojin Yang
    Christoph Meinel
    Multimedia Tools and Applications, 2016, 75 : 9255 - 9276
  • [10] Survey of Research on Deep Multimodal Representation Learning
    Pan, Mengzhu
    Li, Qianmu
    Qiu, Tian
    Computer Engineering and Applications, 2024, 59 (02) : 48 - 64