MULTIMODAL REPRESENTATION LEARNING FOR BLASTOCYST ASSESSMENT

被引:4
|
作者
Wang, Youcheng [1 ]
Zheng, Zhe [1 ]
Ni, Na [1 ]
Tong, Guoqing [2 ]
Cheng, Nuo [3 ]
Li, Kai [3 ]
Yin, Ping [3 ]
Chen, Yuanyuan [3 ]
Wu, Yingna [1 ]
Xie, Guangping [1 ]
机构
[1] ShanghaiTech Univ, Sch Creat & Art, Ctr Adapt Syst Engn, Shanghai, Peoples R China
[2] Xi An Jiao Tong Univ, Dept Reprod Med, Affiliated Hosp 1, Xian, Peoples R China
[3] Shuguang Hosp, Reprod Med Ctr, Shanghai, Peoples R China
关键词
Blastocyst Assessment; Multimodal Representation Learning; Image-text Retrieval; Visual Transformer;
D O I
10.1109/ISBI53787.2023.10230468
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Blastocyst selection based on morphology grading is crucial in in vitro fertilization (IVF) treatment. Several research studies based on convolutional neural networks (CNNs) have been reported to select the most viable blastocyst automatically. In this paper, we propose a multimodal representation learning framework in which the text description is firstly streamed as a complementary supervision signal to enrich the visual information. Moreover, we redefine the blastocyst assessment problem to an image-text retrieval task to solve the data imbalance. The experimental results show that the performance metrics, e.g., accuracy, outperform the unimodal classification (+1.5%) and image retrieval counterparts (+1.2%), which demonstrates our proposed model's effectiveness.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Fundamental Considerations on Representation Learning for Multimodal Processing
    Jin'no, Kenya
    Izumi, Masato
    Okamoto, Saki
    Dai, Mizuki
    Takahashi, Chisato
    Inami, Tatsuro
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION, HIMI 2023, PT I, 2023, 14015 : 389 - 399
  • [22] Multimodal Representation Learning for Recommendation in Internet of Things
    Huang, Zhenhua
    Xu, Xin
    Ni, Juan
    Zhu, Honghao
    Wang, Cheng
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (06) : 10675 - 10685
  • [23] Robust Multimodal Learning via Representation Decoupling
    Wei, Shicai
    Luo, Yang
    Wang, Yuji
    Luo, Chunbo
    COMPUTER VISION - ECCV 2024, PT XLII, 2025, 15100 : 38 - 54
  • [24] Multimodal deep representation learning for video classification
    Haiman Tian
    Yudong Tao
    Samira Pouyanfar
    Shu-Ching Chen
    Mei-Ling Shyu
    World Wide Web, 2019, 22 : 1325 - 1341
  • [25] Multimodal Representation Learning by Alternating Unimodal Adaptation
    Zhang, Xiaohui
    Yoon, Jaehong
    Bansal, Mohit
    Yao, Huaxiu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 27446 - 27456
  • [26] Deep Learning-Based Quantitative Blastocyst Assessment
    Zheng, Zhe
    Wang, Youcheng
    Ni, Na
    Tong, Guoqing
    Cheng, Nuo
    Yin, Ping
    Chen, Yuanyuan
    Wu, Yingna
    Xie, Guangping
    Yang, Tingting
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [27] Learning Comprehensive Multimodal Representation for Cancer Survival Prediction
    Wu, Xingqi
    Shi, Yi
    Liu, Honglei
    Li, Ao
    Wang, Minghui
    2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 332 - 336
  • [28] TriSAT: Trimodal Representation Learning for Multimodal Sentiment Analysis
    Huan, Ruohong
    Zhong, Guowei
    Chen, Peng
    Liang, Ronghua
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4105 - 4120
  • [29] Deep Multimodal Representation Learning from Temporal Data
    Yang, Xitong
    Ramesh, Palghat
    Chitta, Radha
    Madhvanath, Sriganesh
    Bernal, Edgar A.
    Luo, Jiebo
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5066 - 5074
  • [30] Knowledge Base Completion Based on Multimodal Representation Learning
    Wang J.
    Su H.
    Lai X.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (01): : 33 - 43