Predicting the Visual Attention of Pathologists Evaluating Whole Slide Images of Cancer

被引:2
|
作者
Chakraborty, Souradeep [1 ]
Gupta, Rajarsi [2 ]
Ma, Ke [9 ]
Govind, Darshana [5 ]
Sarder, Pinaki [6 ]
Choi, Won-Tak [8 ]
Mahmud, Waqas [2 ]
Yee, Eric [7 ]
Allard, Felicia [7 ]
Knudsen, Beatrice [3 ]
Zelinsky, Gregory [1 ,4 ]
Saltz, Joel [2 ]
Samaras, Dimitris [1 ]
机构
[1] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA
[2] SUNY Stony Brook, Dept Biomed Informat, Stony Brook, NY 11794 USA
[3] Univ Utah, Sch Med, Dept Pathol, Salt Lake City, UT USA
[4] SUNY Stony Brook, Dept Psychol, Stony Brook, NY 11794 USA
[5] Univ Buffalo, Dept Pathol & Anat Sci, Buffalo, NY USA
[6] Univ Florida, Dept Med, Gainesville, FL USA
[7] Univ Arkansas Med Sci, Dept Pathol, Little Rock, AR 72205 USA
[8] Univ Calif San Francisco, Dept Pathol, San Francisco, CA 94140 USA
[9] Snap Inc, Santa Monica, CA USA
关键词
Visual attention; Digital microscopy; Cognitive pathology;
D O I
10.1007/978-3-031-16961-8_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work presents PathAttFormer, a deep learning model that predicts the visual attention of pathologists viewing whole slide images (WSIs) while evaluating cancer. This model has two main components: (1) a patch-wise attention prediction module using a Swin transformer backbone and (2) a self-attention based attention refinement module to compute pairwise-similarity between patches to predict spatially consistent attention heatmaps. We observed a high level of agreement between model predictions and actual viewing behavior, collected by capturing panning and zooming movements using a digital microscope interface. Visual attention was analyzed in the evaluation of prostate cancer and gastrointestinal neuroendocrine tumors (GI-NETs), which differ greatly in terms of diagnostic paradigms and the demands on attention. Prostate cancer involves examining WSIs stained with Hematoxylin and Eosin (H&E) to identify distinct growth patterns for Gleason grading. In contrast, GI-NETs require a multi-step approach of identifying tumor regions in H&E WSIs and grading by quantifying the number of Ki-67 positive tumor cells highlighted with immunohistochemistry (IHC) in a separate image. We collected attention data from pathologists viewing prostate cancer H&EWSIs from The Cancer Genome Atlas (TCGA) and 21 H&E WSIs of GI-NETs with corresponding Ki-67 IHC WSIs. This is the first work that utilizes the Swin transformer architecture to predict visual attention in histopathology images of GI-NETs, which is generalizable to predicting attention in the evaluation of multiple sequential images in real world diagnostic pathology and IHC applications.
引用
收藏
页码:11 / 21
页数:11
相关论文
共 50 条
  • [31] Mitosis Extraction in Breast-Cancer Histopathological Whole Slide Images
    Roullier, Vincent
    Lezoray, Olivier
    Ta, Vinh-Thong
    Elmoatazi, Abderrahim
    ADVANCES IN VISUAL COMPUTING, PT I, 2010, 6453 : 539 - +
  • [32] An Artificial Intelligent System for Prostate Cancer Diagnosis in Whole Slide Images
    Sajib Saha
    Janardhan Vignarajan
    Adam Flesch
    Patrik Jelinko
    Petra Gorog
    Eniko Szep
    Csaba Toth
    Peter Gombas
    Tibor Schvarcz
    Orsolya Mihaly
    Marianna Kapin
    Alexandra Zub
    Levente Kuthi
    Laszlo Tiszlavicz
    Tibor Glasz
    Shaun Frost
    Journal of Medical Systems, 48 (1)
  • [33] Characterizing the development of visual search expertise in pathology residents viewing whole slide images
    Krupinski, Elizabeth A.
    Graham, Anna R.
    Weinstein, Ronald S.
    HUMAN PATHOLOGY, 2013, 44 (03) : 357 - 364
  • [34] Cancer Survival Prediction From Whole Slide Images With Self-Supervised Learning and Slide Consistency
    Fan, Lei
    Sowmya, Arcot
    Meijering, Erik
    Song, Yang
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (05) : 1401 - 1412
  • [35] Predicting Visual Discomfort of Stereoscopic Images Using Human Attention Model
    Jung, Yong Ju
    Sohn, Hosik
    Lee, Seong-Il
    Park, Hyun Wook
    Ro, Yong Man
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (12) : 2077 - 2082
  • [36] Prototypical multiple instance learning for predicting lymph node metastasis of breast cancer from whole-slide pathological images
    Yu, Jin-Gang
    Wu, Zihao
    Ming, Yu
    Deng, Shule
    Li, Yuanqing
    Ou, Caifeng
    He, Chunjiang
    Wang, Baiye
    Zhang, Pusheng
    Wang, Yu
    MEDICAL IMAGE ANALYSIS, 2023, 85
  • [37] Characterizing Immune Responses in Whole Slide Images of Cancer With Digital Pathology and Pathomics
    Gupta, Rajarsi
    Le, Han
    Van Arnam, John
    Belinsky, David
    Hasan, Mahmudul
    Samaras, Dimitris
    Kurc, Tahsin
    Saltz, Joel H.
    CURRENT PATHOBIOLOGY REPORTS, 2020, 8 (04) : 133 - 148
  • [38] Automatic Quality Evaluation of Whole Slide Images for the Practical Use of Whole Slide Imaging Scanner
    Shakhawat, Hossain Md
    Nakamura, Tomoya
    Kimura, Fumikazu
    Yagi, Yukako
    Yamaguchi, Masahiro
    ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2020, 8 (04): : 252 - 268
  • [39] Detection-Free Pipeline for Cervical Cancer Screening of Whole Slide Images
    Cao, Maosong
    Fei, Manman
    Cai, Jiangdong
    Liu, Luyan
    Zhang, Lichi
    Wang, Qian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 243 - 252
  • [40] Spatiality Sensitive Learning for Cancer Metastasis Detection in Whole-Slide Images
    Zheng, Haixia
    Zhou, Yu
    Huang, Xin
    MATHEMATICS, 2022, 10 (15)