DSCA: A dual-stream network with cross-attention on whole-slide image pyramids for cancer prognosis

被引:11
|
作者
Liu, Pei [1 ]
Fu, Bo [1 ]
Ye, Feng [2 ]
Yang, Rui [1 ]
Ji, Luping [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China
[2] Sichuan Univ, West China Hosp, Inst Clin Pathol, Chengdu 610041, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Whole-slide image; Computational pathology; Cancer prognosis; Survival analysis; Multiple instance learning; SURVIVAL PREDICTION; TRANSFORMER; PROGRESSION; MODEL;
D O I
10.1016/j.eswa.2023.120280
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The cancer prognosis on gigapixel Whole-Slide Images (WSIs) has always been a challenging task. To further enhance WSI visual representations, existing methods have explored image pyramids, instead of single -resolution images, in WSIs. Despite this, they still face two major problems: high computational cost and the unnoticed semantical gap in multi-resolution feature fusion. To tackle these problems, this paper proposes to efficiently exploit WSI pyramids from a new perspective, the dual-stream network with cross-attention (DSCA). Our key idea is to utilize two sub-streams to process the WSI patches with two resolutions, where a square pooling is devised in a high-resolution stream to significantly reduce computational costs, and a cross-attention-based method is proposed to properly handle the fusion of dual-stream features. We validate our DSCA on three publicly-available datasets with a total number of 3,101 WSIs from 1,911 patients. Our experiments and ablation studies verify that (i) the proposed DSCA could outperform existing state-of-the-art methods in cancer prognosis, by an average C-Index improvement of around 4.6%; (ii) our DSCA network is more efficient in computation-it has more learnable parameters (6.31M vs. 860.18K) but less computational costs (2.51G vs. 4.94G), compared to a typical existing multi-resolution network. (iii) the key components of DSCA, dual-stream and cross-attention, indeed contribute to our model's performance, gaining an average C-Index rise of around 2.0% while maintaining a relatively-small computational load. Our DSCA could serve as an alternative and effective tool for WSI-based cancer prognosis. Our source code is available at https://github.com/liupei101/DSCA.
引用
收藏
页数:11
相关论文
共 34 条
  • [1] AI-Generated Image Detection using a Cross-Attention Enhanced Dual-Stream Network
    Xi, Ziyi
    Huang, Wenmin
    Wei, Kangkang
    Luo, Weiqi
    Zheng, Peijia
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1463 - 1470
  • [2] Cascaded Cross-Attention Networks for Data-Efficient Whole-Slide Image Classification Using Transformers
    Khader, Firas
    Kather, Jakob Nikolas
    Han, Tianyu
    Nebelung, Sven
    Kuhl, Christiane
    Stegmaier, Johannes
    Truhn, Daniel
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT II, 2024, 14349 : 417 - 426
  • [3] Dual-Stream Attention Network for Hyperspectral Image Unmixing
    School of Computer Science and Technology, Ocean University of China, Qingdao
    266100, China
    不详
    266100, China
    arXiv,
  • [4] Dual-Stream Attention Network for Hyperspectral Image Unmixing
    Wang, Yufang
    Wu, Wenmin
    Qi, Lin
    Gao, Feng
    International Geoscience and Remote Sensing Symposium (IGARSS), 2024, : 9438 - 9441
  • [5] Speech Emotion Recognition Using Dual-Stream Representation and Cross-Attention Fusion
    Yu, Shaode
    Meng, Jiajian
    Fan, Wenqing
    Chen, Ye
    Zhu, Bing
    Yu, Hang
    Xie, Yaoqin
    Sun, Qiuirui
    ELECTRONICS, 2024, 13 (11)
  • [6] Intersection-union dual-stream cross-attention Lova-SwinUnet for skin cancer hair segmentation and image repair
    Qin, Juanjuan
    Pei, Dong
    Guo, Qian
    Cai, Xingjuan
    Xie, Liping
    Zhang, Wensheng
    Computers in Biology and Medicine, 2024, 180
  • [7] Dual-Stream Discriminative Attention Network for Cross-Scene Hyperspectral Image Classification
    Wang, Chenglong
    Guo, Yi
    Fu, Jiaojiao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [8] Dual-stream Self-attention Network for Image Captioning
    Wan, Boyang
    Jiang, Wenhui
    Fang, Yuming
    Wen, Wenying
    Liu, Hantao
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [9] DSCA-Net: Indoor Head Detection Network Using Dual-Stream Information and Channel Attention
    QIN Pinle
    SHEN Wenxiang
    ZENG Jianchao
    Chinese Journal of Electronics, 2020, 29 (06) : 1102 - 1109
  • [10] Dual-stream Multiple Instance Learning Network for Whole Slide Image Classification with Self-supervised Contrastive Learning
    Li, Bin
    Li, Yin
    Eliceiri, Kevin W.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14313 - 14323