Multimodal contrastive learning for spatial gene expression prediction using histology images

被引:3
|
作者
Min, Wenwen [1 ]
Shi, Zhiceng [1 ]
Zhang, Jun [1 ]
Wan, Jun [2 ,3 ]
Wang, Changmiao
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, East Outer Ring Rd, Kunming 650500, Yunnan, Peoples R China
[2] Zhongnan Univ Econ & Law, Sch Informat & Engn, 182 South Lake Ave, Wuhan 430073, Hubei, Peoples R China
[3] Shenzhen Res Inst Big Data, Med Big Data, Longxiang Blvd, Shenzhen 518172, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
spatial transcriptomics; histology images; multimodal contrastive learning; transformer encoder; CELL; IDENTIFICATION; ARCHITECTURE; ATLAS;
D O I
10.1093/bib/bbae551
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In recent years, the advent of spatial transcriptomics (ST) technology has unlocked unprecedented opportunities for delving into the complexities of gene expression patterns within intricate biological systems. Despite its transformative potential, the prohibitive cost of ST technology remains a significant barrier to its widespread adoption in large-scale studies. An alternative, more cost-effective strategy involves employing artificial intelligence to predict gene expression levels using readily accessible whole-slide images stained with Hematoxylin and Eosin (H&E). However, existing methods have yet to fully capitalize on multimodal information provided by H&E images and ST data with spatial location. In this paper, we propose mclSTExp, a multimodal contrastive learning with Transformer and Densenet-121 encoder for Spatial Transcriptomics Expression prediction. We conceptualize each spot as a "word", integrating its intrinsic features with spatial context through the self-attention mechanism of a Transformer encoder. This integration is further enriched by incorporating image features via contrastive learning, thereby enhancing the predictive capability of our model. We conducted an extensive evaluation of highly variable genes in two breast cancer datasets and a skin squamous cell carcinoma dataset, and the results demonstrate that mclSTExp exhibits superior performance in predicting spatial gene expression. Moreover, mclSTExp has shown promise in interpreting cancer-specific overexpressed genes, elucidating immune-related genes, and identifying specialized spatial domains annotated by pathologists. Our source code is available at https://github.com/shizhiceng/mclSTExp.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Breast cancer histopathology image-based gene expression prediction using spatial transcriptomics data and deep learning
    Md Mamunur Rahaman
    Ewan K. A. Millar
    Erik Meijering
    Scientific Reports, 13
  • [42] Patient specific tumor growth prediction using multimodal images
    Liu, Yixun
    Sadowski, Samira M.
    Weisbrod, Allison B.
    Kebebew, Electron
    Summers, Ronald M.
    Yao, Jianhua
    MEDICAL IMAGE ANALYSIS, 2014, 18 (03) : 555 - 566
  • [43] Breast cancer histopathology image-based gene expression prediction using spatial transcriptomics data and deep learning
    Rahaman, Md Mamunur
    Millar, Ewan K. A.
    Meijering, Erik
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [44] Uncertainty-Aware Contrastive Learning for Semi-Supervised Classification of Multimodal Remote Sensing Images
    Ding, Kexin
    Lu, Ting
    Li, Shutao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
  • [45] AN APPROACH BASED ON CONTRASTIVE LEARNING AND VECTOR QUANTIZATION TO THE UNSUPERVISED LAND-COVER SEGMENTATION OF MULTIMODAL IMAGES
    Chen, Yuxing
    Bruzzone, Lorenzo
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 4811 - 4818
  • [46] ConCL: Concept Contrastive Learning for Dense Prediction Pre-training in Pathology Images
    Yang, Jiawei
    Chen, Hanbo
    Liang, Yuan
    Huang, Junzhou
    He, Lei
    Yao, Jianhua
    COMPUTER VISION, ECCV 2022, PT XXI, 2022, 13681 : 523 - 539
  • [47] Classification for thyroid nodule using ViT with contrastive learning in ultrasound images
    Sun, Jiawei
    Wu, Bobo
    Zhao, Tong
    Gao, Liugang
    Xie, Kai
    Lin, Tao
    Sui, Jianfeng
    Li, Xiaoqin
    Wu, Xiaojin
    Ni, Xinye
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 152
  • [48] Multimodal Deep Learning Crime Prediction Using Tweets
    Tam, Sakirin
    Tanriover, Omer Ozgur
    IEEE ACCESS, 2023, 11 : 93204 - 93214
  • [49] Deep learning-based multimodal integration of histology and genomics improves cancer origin prediction
    Shaban, Muhammad
    Lu, Ming Y.
    Williamson, Drew F. K.
    Chen, Richard J.
    Lipkova, Jana
    Chen, Tiffany Y.
    Mahmood, Faisal
    CANCER RESEARCH, 2023, 83 (02)
  • [50] Bridging Histology to Spatial Transcriptomics: A Pathology Foundation Model-driven Contrast Learning for Predicting Spatial Transcriptomic Profiles from Histology Images
    Huang, Zi Huai
    Xu, Ziyang
    Hu, Pingzhao
    GENETIC EPIDEMIOLOGY, 2024, 48 (07) : 361 - 362