DSCA: A dual-stream network with cross-attention on whole-slide image pyramids for cancer prognosis

被引:11
|
作者
Liu, Pei [1 ]
Fu, Bo [1 ]
Ye, Feng [2 ]
Yang, Rui [1 ]
Ji, Luping [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China
[2] Sichuan Univ, West China Hosp, Inst Clin Pathol, Chengdu 610041, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Whole-slide image; Computational pathology; Cancer prognosis; Survival analysis; Multiple instance learning; SURVIVAL PREDICTION; TRANSFORMER; PROGRESSION; MODEL;
D O I
10.1016/j.eswa.2023.120280
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The cancer prognosis on gigapixel Whole-Slide Images (WSIs) has always been a challenging task. To further enhance WSI visual representations, existing methods have explored image pyramids, instead of single -resolution images, in WSIs. Despite this, they still face two major problems: high computational cost and the unnoticed semantical gap in multi-resolution feature fusion. To tackle these problems, this paper proposes to efficiently exploit WSI pyramids from a new perspective, the dual-stream network with cross-attention (DSCA). Our key idea is to utilize two sub-streams to process the WSI patches with two resolutions, where a square pooling is devised in a high-resolution stream to significantly reduce computational costs, and a cross-attention-based method is proposed to properly handle the fusion of dual-stream features. We validate our DSCA on three publicly-available datasets with a total number of 3,101 WSIs from 1,911 patients. Our experiments and ablation studies verify that (i) the proposed DSCA could outperform existing state-of-the-art methods in cancer prognosis, by an average C-Index improvement of around 4.6%; (ii) our DSCA network is more efficient in computation-it has more learnable parameters (6.31M vs. 860.18K) but less computational costs (2.51G vs. 4.94G), compared to a typical existing multi-resolution network. (iii) the key components of DSCA, dual-stream and cross-attention, indeed contribute to our model's performance, gaining an average C-Index rise of around 2.0% while maintaining a relatively-small computational load. Our DSCA could serve as an alternative and effective tool for WSI-based cancer prognosis. Our source code is available at https://github.com/liupei101/DSCA.
引用
收藏
页数:11
相关论文
共 34 条
  • [31] HSG-MGAF Net: Heterogeneous subgraph-guided multiscale graph attention fusion network for interpretable prediction of whole-slide image
    Liang, Meiyan
    Jiang, Xing
    Cao, Jie
    Zhang, Shupeng
    Liu, Haishun
    Li, Bo
    Wang, Lin
    Zhang, Cunlin
    Jia, Xiaojun
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 247
  • [32] Weakly Supervised Framework for Cancer Region Detection of Hepatocellular Carcinoma in Whole-Slide Pathologic Images Based on Multiscale Attention Convolutional Neural Network
    Diao, Songhui
    Tian, Yinli
    Hu, Wanming
    Hou, Jiaxin
    Lambo, Ricardo
    Zhang, Zhicheng
    Xie, Yaoqin
    Nie, Xiu
    Zhang, Fa
    Racoceanu, Daniel
    Qin, Wenjian
    AMERICAN JOURNAL OF PATHOLOGY, 2022, 192 (03): : 553 - 563
  • [33] Classification of colorectal cancer consensus molecular subtypes using attention-based multi-instance learning network on whole-slide images
    Xu, Huilin
    Wu, Aoshen
    Ren, He
    Yu, Chenghang
    Liu, Gang
    Liu, Lei
    ACTA HISTOCHEMICA, 2023, 125 (06)
  • [34] CAF-AHGCN: context-aware attention fusion adaptive hypergraph convolutional network for human-interpretable prediction of gigapixel whole-slide image
    Liang, Meiyan
    Jiang, Xing
    Cao, Jie
    Li, Bo
    Wang, Lin
    Chen, Qinghui
    Zhang, Cunlin
    Zhao, Yuejin
    VISUAL COMPUTER, 2024, 40 (12): : 8747 - 8765