BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images

被引:40
|
作者
Brancati, Nadia [1 ]
Anniciello, Anna Maria [2 ]
Pati, Pushpak [3 ,4 ]
Riccio, Daniel [1 ,5 ]
Scognamiglio, Giosue [2 ]
Jaume, Guillaume [3 ,6 ]
De Pietro, Giuseppe [1 ]
Di Bonito, Maurizio [2 ]
Foncubierta, Antonio [3 ]
Botti, Gerardo [2 ]
Gabrani, Maria [3 ]
Feroce, Florinda [2 ]
Frucci, Maria [1 ]
机构
[1] ICAR CNR, Inst High Performance Comp & Networking Res Counc, 111 Via Pietro Castellino, I-80131 Naples, Italy
[2] IRCCS, Natl Canc Inst, Fdn Pascale, 53 Via Mariano Semmola, I-80131 Naples, Italy
[3] IBM Res, Saumerstr 4, CH-8803 Zurich, Switzerland
[4] ETH, Ramistr 101, CH-8092 Zurich, Switzerland
[5] Univ Naples Federico II, Dept Elect Engn & Informat Technol, Via Claudio 21, I-80125 Naples, Italy
[6] EPFL Rte Cantonale, CH-1015 Lausanne, Switzerland
关键词
PATHOLOGY;
D O I
10.1093/database/baac093
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Breast cancer is the most commonly diagnosed cancer and registers the highest number of deaths for women. Advances in diagnostic activities combined with large-scale screening policies have significantly lowered the mortality rates for breast cancer patients. However, the manual inspection of tissue slides by pathologists is cumbersome, time-consuming and is subject to significant inter- and intra-observer variability. Recently, the advent of whole-slide scanning systems has empowered the rapid digitization of pathology slides and enabled the development of Artificial Intelligence (AI)-assisted digital workflows. However, AI techniques, especially Deep Learning, require a large amount of high-quality annotated data to learn from. Constructing such task-specific datasets poses several challenges, such as data-acquisition level constraints, time-consuming and expensive annotations and anonymization of patient information. In this paper, we introduce the BReAst Carcinoma Subtyping (BRACS) dataset, a large cohort of annotated Hematoxylin and Eosin (H&E)-stained images to advance AI development in the automatic characterization of breast lesions. BRACS contains 547 Whole-Slide Images (WSIs) and 4539 Regions Of Interest (ROIs) extracted from the WSIs. Each WSI and respective ROIs are annotated by the consensus of three board-certified pathologists into different lesion categories. Specifically, BRACS includes three lesion types, i.e., benign, malignant and atypical, which are further subtyped into seven categories. It is, to the best of our knowledge, the largest annotated dataset for breast cancer subtyping both at WSI and ROI levels. Furthermore, by including the understudied atypical lesions, BRACS offers a unique opportunity for leveraging AI to better understand their characteristics. We encourage AI practitioners to develop and evaluate novel algorithms on the BRACS dataset to further breast cancer diagnosis and patient care.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Identifying neutrophils in H&E staining histology tissue images
    Wang, Jiazhuo, 1600, Springer Verlag (8673):
  • [2] Unsupervised Segmentation of H&E Breast Images
    Hope, Tyna A.
    Yaffe, Martin J.
    MEDICAL IMAGING 2017: DIGITAL PATHOLOGY, 2017, 10140
  • [3] Identifying Neutrophils in H&E Staining Histology Tissue Images
    Wang, Jiazhuo
    MacKenzie, John D.
    Ramachandran, Rageshree
    Chen, Danny Z.
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2014, PT I, 2014, 8673 : 73 - +
  • [4] A Classification Scheme for Lymphocyte Segmentation in H&E Stained Histology Images
    Kuse, Manohar
    Sharma, Tanuj
    Gupta, Sudhir
    RECOGNIZING PATTERNS IN SIGNALS, SPEECH, IMAGES, AND VIDEOS, 2010, 6388 : 235 - 243
  • [5] Tumor Region Localization in H&E Breast Carcinoma Images Using Deep Convolutional Neural Network
    Fauzi, Mohammad F. A.
    Jamaluddin, Mohammad F.
    Lee, Jenny T. H.
    Teoh, Kean H.
    Looi, Lai M.
    2018 IEEE THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2018, : 61 - 66
  • [6] Automatic Nuclei Segmentation in H&E Stained Breast Cancer Histopathology Images
    Veta, Mitko
    van Diest, Paul J.
    Kornegoor, Robert
    Huisman, Andre
    Viergever, Max A.
    Pluim, Josien P. W.
    PLOS ONE, 2013, 8 (07):
  • [7] Survival Outcome Prediction of Breast Carcinomas on H&E Images Using Artificial Intelligence
    Rynkiewicz, Joseph
    Paul, Julian
    Salhi, Yahia
    Bossard, Celine
    Salhi, Sanae
    Chetritt, Jerome
    LABORATORY INVESTIGATION, 2024, 104 (03) : S250 - S251
  • [8] Towards Virtual H&E Staining of Hyperspectral Lung Histology Images Using Conditional Generative Adversarial Networks
    Bayramoglu, Neslihan
    Kaakinen, Mika
    Eklund, Lauri
    Heikkila, Janne
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 64 - 71
  • [9] Mitosis detection techniques in H&E stained breast cancer pathological images: A comprehensive review
    Pan, Xipeng
    Lu, Yinghua
    Lan, Rushi
    Liu, Zhenbing
    Qin, Zujun
    Wang, Huadeng
    Liu, Zaiyi
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 91
  • [10] Automated Detection of DCIS in Whole-Slide H&E Stained Breast Histopathology Images
    Bejnordi, Babak Ehteshami
    Balkenhol, Maschenka
    Litjens, Geert
    Holland, Roland
    Bult, Peter
    Karssemeijer, Nico
    van der Laak, Jeroen A. W. M.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (09) : 2141 - 2150