The impact of site-specific digital histology signatures on deep learning model accuracy and bias

被引:121
|
作者
Howard, Frederick M. [1 ]
Dolezal, James [1 ]
Kochanny, Sara [1 ]
Schulte, Jefree [2 ]
Chen, Heather [2 ]
Heij, Lara [3 ,4 ]
Huo, Dezheng [5 ,6 ]
Nanda, Rita [1 ,6 ]
Olopade, Olufunmilayo I. [1 ,6 ]
Kather, Jakob N. [7 ,8 ,9 ]
Cipriani, Nicole [2 ,6 ]
Grossman, Robert L. [1 ,6 ]
Pearson, Alexander T. [1 ,6 ]
机构
[1] Univ Chicago, Dept Med, Sect Hematol Oncol, 5841 S Maryland Ave, Chicago, IL 60637 USA
[2] Univ Chicago, Dept Pathol, 5841 S Maryland Ave, Chicago, IL 60637 USA
[3] Univ Hosp RWTH Aachen, Dept Surg & Transplantat, Aachen, Germany
[4] Univ Hosp RWTH Aachen, Inst Pathol, Aachen, Germany
[5] Univ Chicago, Dept Publ Hlth Sci, Chicago, IL 60637 USA
[6] Univ Chicago Comprehens Canc Ctr, Chicago, IL USA
[7] Univ Hosp RWTH Aachen, Dept Med 3, Aachen, Germany
[8] Univ Leeds, Leeds Inst Med Res St Jamess, Pathol & Data Analyt, Leeds, W Yorkshire, England
[9] Univ Heidelberg Hosp, Natl Ctr Tumor Dis, Med Oncol, Heidelberg, Germany
关键词
COMPREHENSIVE GENOMIC CHARACTERIZATION; OPERATING CHARACTERISTIC CURVES; BREAST-CANCER; MITOSIS DETECTION; HEALTH-CARE; HISTOPATHOLOGY; ANCESTRY; RESOURCE; BIOLOGY; AREAS;
D O I
10.1038/s41467-021-24698-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Cancer Genome Atlas (TCGA) is one of the largest biorepositories of digital histology. Deep learning (DL) models have been trained on TCGA to predict numerous features directly from histology, including survival, gene expression patterns, and driver mutations. However, we demonstrate that these features vary substantially across tissue submitting sites in TCGA for over 3,000 patients with six cancer subtypes. Additionally, we show that histologic image differences between submitting sites can easily be identified with DL. Site detection remains possible despite commonly used color normalization and augmentation methods, and we quantify the image characteristics constituting this site-specific digital histology signature. We demonstrate that these site-specific signatures lead to biased accuracy for prediction of features including survival, genomic mutations, and tumor stage. Furthermore, ethnicity can also be inferred from site-specific signatures, which must be accounted for to ensure equitable application of DL. These site-specific signatures can lead to overoptimistic estimates of model performance, and we propose a quadratic programming method that abrogates this bias by ensuring models are not trained and validated on samples from the same site. Deep learning models have been trained on The Cancer Genome Atlas to predict numerous features directly from histology, including survival, gene expression patterns, and driver mutations. Here, the authors demonstrate that site-specific histologic signatures can lead to biased estimates of accuracy for such models, and propose a method to minimize such bias.
引用
收藏
页数:13
相关论文
共 50 条
  • [11] Deep learning links histology, molecular signatures and prognosis in cancer
    Coudray, Nicolas
    Tsirigos, Aristotelis
    NATURE CANCER, 2020, 1 (08) : 755 - 757
  • [12] Deep learning links histology, molecular signatures and prognosis in cancer
    Nicolas Coudray
    Aristotelis Tsirigos
    Nature Cancer, 2020, 1 : 755 - 757
  • [13] A smartphone application for site-specific pest management based on deep learning and spatial interpolation
    Zhou, Congliang
    Lee, Won Suk
    Zhang, Shuhao
    Liburd, Oscar E.
    Pourreza, Alireza
    Schueller, John K.
    Ampatzidis, Yiannis
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 218
  • [14] Grid-Free MIMO Beam Alignment Through Site-Specific Deep Learning
    Heng, Yuqiang
    Andrews, Jeffrey G.
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (02) : 908 - 921
  • [15] Site-Specific Defect Detection in Composite Using Solitary Waves Based on Deep Learning
    Kim, Tae-Yeon
    Yoon, Sangyoung
    Yeun, Chan Yeob
    Cantwell, Wesley J.
    Cho, Chung-Suk
    EUROPEAN WORKSHOP ON STRUCTURAL HEALTH MONITORING (EWSHM 2022), VOL 3, 2023, : 442 - 451
  • [16] Site-specific impact of polyphenols on the gastrointestinal microbiome
    Ebrahimi, Faezeh
    Subbiah, Vigasini
    Agar, Osman Tuncay
    Legione, Alistair R.
    Suleria, Hafiz A. R.
    CRITICAL REVIEWS IN FOOD SCIENCE AND NUTRITION, 2024,
  • [17] The Economic Impact of Site-Specific Weed Control
    C. Timmermann
    Roland Gerhards
    W. Kühbauch
    Precision Agriculture, 2003, 4 (3) : 249 - 260
  • [18] Digital soil mapping for site-specific management of soils
    Iticha, Birhanu
    Takele, Chalsissa
    GEODERMA, 2019, 351 : 85 - 91
  • [19] Signatures of site-specific reaction of H2 on Cu(100)
    Somers, MF
    McCormack, DA
    Kroes, GJ
    Olsen, RA
    Baerends, EJ
    Mowrey, RC
    JOURNAL OF CHEMICAL PHYSICS, 2002, 117 (14): : 6673 - 6687
  • [20] Predicting site-specific human selective pressure using evolutionary signatures
    Sadri, Javad
    Diallo, Abdoulaye Banire
    Blanchette, Mathieu
    BIOINFORMATICS, 2011, 27 (13) : I266 - I274