DSCA: A dual-stream network with cross-attention on whole-slide image pyramids for cancer prognosis

被引：11

作者：

Liu, Pei ^{[1
]}

Fu, Bo ^{[1
]}

Ye, Feng ^{[2
]}

Yang, Rui ^{[1
]}

Ji, Luping ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China

[2] Sichuan Univ, West China Hosp, Inst Clin Pathol, Chengdu 610041, Sichuan, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2023年 / 227卷

基金：

中国国家自然科学基金;

关键词：

Whole-slide image; Computational pathology; Cancer prognosis; Survival analysis; Multiple instance learning; SURVIVAL PREDICTION; TRANSFORMER; PROGRESSION; MODEL;

D O I：

10.1016/j.eswa.2023.120280

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The cancer prognosis on gigapixel Whole-Slide Images (WSIs) has always been a challenging task. To further enhance WSI visual representations, existing methods have explored image pyramids, instead of single -resolution images, in WSIs. Despite this, they still face two major problems: high computational cost and the unnoticed semantical gap in multi-resolution feature fusion. To tackle these problems, this paper proposes to efficiently exploit WSI pyramids from a new perspective, the dual-stream network with cross-attention (DSCA). Our key idea is to utilize two sub-streams to process the WSI patches with two resolutions, where a square pooling is devised in a high-resolution stream to significantly reduce computational costs, and a cross-attention-based method is proposed to properly handle the fusion of dual-stream features. We validate our DSCA on three publicly-available datasets with a total number of 3,101 WSIs from 1,911 patients. Our experiments and ablation studies verify that (i) the proposed DSCA could outperform existing state-of-the-art methods in cancer prognosis, by an average C-Index improvement of around 4.6%; (ii) our DSCA network is more efficient in computation-it has more learnable parameters (6.31M vs. 860.18K) but less computational costs (2.51G vs. 4.94G), compared to a typical existing multi-resolution network. (iii) the key components of DSCA, dual-stream and cross-attention, indeed contribute to our model's performance, gaining an average C-Index rise of around 2.0% while maintaining a relatively-small computational load. Our DSCA could serve as an alternative and effective tool for WSI-based cancer prognosis. Our source code is available at https://github.com/liupei101/DSCA.

引用

页数：11

共 34 条

[31] HSG-MGAF Net: Heterogeneous subgraph-guided multiscale graph attention fusion network for interpretable prediction of whole-slide image
Liang, Meiyan
Jiang, Xing
Cao, Jie
Zhang, Shupeng
Liu, Haishun
Li, Bo
Wang, Lin
Zhang, Cunlin
Jia, Xiaojun
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 247
[32] Weakly Supervised Framework for Cancer Region Detection of Hepatocellular Carcinoma in Whole-Slide Pathologic Images Based on Multiscale Attention Convolutional Neural Network
Diao, Songhui
Tian, Yinli
Hu, Wanming
Hou, Jiaxin
Lambo, Ricardo
Zhang, Zhicheng
Xie, Yaoqin
Nie, Xiu
Zhang, Fa
Racoceanu, Daniel
Qin, Wenjian
AMERICAN JOURNAL OF PATHOLOGY, 2022, 192 (03): : 553 - 563
[33] Classification of colorectal cancer consensus molecular subtypes using attention-based multi-instance learning network on whole-slide images
Xu, Huilin
Wu, Aoshen
Ren, He
Yu, Chenghang
Liu, Gang
Liu, Lei
ACTA HISTOCHEMICA, 2023, 125 (06)
[34] CAF-AHGCN: context-aware attention fusion adaptive hypergraph convolutional network for human-interpretable prediction of gigapixel whole-slide image
Liang, Meiyan
Jiang, Xing
Cao, Jie
Li, Bo
Wang, Lin
Chen, Qinghui
Zhang, Cunlin
Zhao, Yuejin
VISUAL COMPUTER, 2024, 40 (12): : 8747 - 8765

← 1 2 3 4 →