BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images

被引:40
|
作者
Brancati, Nadia [1 ]
Anniciello, Anna Maria [2 ]
Pati, Pushpak [3 ,4 ]
Riccio, Daniel [1 ,5 ]
Scognamiglio, Giosue [2 ]
Jaume, Guillaume [3 ,6 ]
De Pietro, Giuseppe [1 ]
Di Bonito, Maurizio [2 ]
Foncubierta, Antonio [3 ]
Botti, Gerardo [2 ]
Gabrani, Maria [3 ]
Feroce, Florinda [2 ]
Frucci, Maria [1 ]
机构
[1] ICAR CNR, Inst High Performance Comp & Networking Res Counc, 111 Via Pietro Castellino, I-80131 Naples, Italy
[2] IRCCS, Natl Canc Inst, Fdn Pascale, 53 Via Mariano Semmola, I-80131 Naples, Italy
[3] IBM Res, Saumerstr 4, CH-8803 Zurich, Switzerland
[4] ETH, Ramistr 101, CH-8092 Zurich, Switzerland
[5] Univ Naples Federico II, Dept Elect Engn & Informat Technol, Via Claudio 21, I-80125 Naples, Italy
[6] EPFL Rte Cantonale, CH-1015 Lausanne, Switzerland
关键词
PATHOLOGY;
D O I
10.1093/database/baac093
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Breast cancer is the most commonly diagnosed cancer and registers the highest number of deaths for women. Advances in diagnostic activities combined with large-scale screening policies have significantly lowered the mortality rates for breast cancer patients. However, the manual inspection of tissue slides by pathologists is cumbersome, time-consuming and is subject to significant inter- and intra-observer variability. Recently, the advent of whole-slide scanning systems has empowered the rapid digitization of pathology slides and enabled the development of Artificial Intelligence (AI)-assisted digital workflows. However, AI techniques, especially Deep Learning, require a large amount of high-quality annotated data to learn from. Constructing such task-specific datasets poses several challenges, such as data-acquisition level constraints, time-consuming and expensive annotations and anonymization of patient information. In this paper, we introduce the BReAst Carcinoma Subtyping (BRACS) dataset, a large cohort of annotated Hematoxylin and Eosin (H&E)-stained images to advance AI development in the automatic characterization of breast lesions. BRACS contains 547 Whole-Slide Images (WSIs) and 4539 Regions Of Interest (ROIs) extracted from the WSIs. Each WSI and respective ROIs are annotated by the consensus of three board-certified pathologists into different lesion categories. Specifically, BRACS includes three lesion types, i.e., benign, malignant and atypical, which are further subtyped into seven categories. It is, to the best of our knowledge, the largest annotated dataset for breast cancer subtyping both at WSI and ROI levels. Furthermore, by including the understudied atypical lesions, BRACS offers a unique opportunity for leveraging AI to better understand their characteristics. We encourage AI practitioners to develop and evaluate novel algorithms on the BRACS dataset to further breast cancer diagnosis and patient care.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Automatic layer segmentation of H&E microscopic images of mice skin
    Hussein, Saif
    Selway, Joanne
    Jassim, Sabah
    Al-Assam, Hisham
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2016, 2016, 9869
  • [32] An Optimized Color Space for the Analysis of Digital Images of H&E Slides
    Zarella, Mark
    Breen, David
    Plagov, Andrei
    Garcia, Fernando
    LABORATORY INVESTIGATION, 2015, 95 : 403A - 403A
  • [33] Semantic segmentation to identify bladder layers from H&E Images
    Niazi, Muhammad Khalid Khan
    Yazgan, Enes
    Tavolara, Thomas E.
    Li, Wencheng
    Lee, Cheryl T.
    Parwani, Anil
    Gurcan, Metin N.
    DIAGNOSTIC PATHOLOGY, 2020, 15 (01)
  • [34] Predicting immunotherapy outcomes from H&E images in lung cancer
    Loo, Jessica
    Wang, Yang
    Wong, Pok Fai
    Wulczyn, Ellery
    Lai, Jeremy
    Cimermancic, Peter
    Steiner, David F.
    Weaver, Shamira S.
    CANCER RESEARCH, 2024, 84 (06)
  • [35] Automated assessment of breast cancer lymphatic infiltration by H&E image
    Kuo, Yung-Lung
    Ko, Chien-Chuan
    Lee, Ming-Ji
    THIRD INTERNATIONAL CONFERENCE ON INFORMATION SECURITY AND INTELLIGENT CONTROL (ISIC 2012), 2012, : 202 - 205
  • [36] Comparison of PHH3, Ki-67 and H&E Mitotic Count in Invasive Breast Carcinoma
    Woo, Jennifer
    Moatamed, Neda
    Sullivan, Peggy
    Lu, David
    Apple, Sophia
    LABORATORY INVESTIGATION, 2015, 95 : 74A - 74A
  • [37] Deep Learning Models Differentiate Tumor Grades from H&E Stained Histology Sections
    Khoshdeli, Mina
    Borowsky, Alexander
    Parvin, Bahram
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 620 - 623
  • [38] MSIreg: an R package for unsupervised coregistration of mass spectrometry and H&E images
    Lakkimsetty, Sai Srikanth
    Weber, Andreas
    Bemis, Kylie A.
    Stehl, Verena
    Bronsert, Peter
    Foell, Melanie C.
    Vitek, Olga
    BIOINFORMATICS, 2024, 40 (11)
  • [39] Comparison of PHH3, Ki-67 and H&E Mitotic Count in Invasive Breast Carcinoma
    Woo, Jennifer
    Moatamed, Neda
    Sullivan, Peggy
    Lu, David
    Apple, Sophia
    MODERN PATHOLOGY, 2015, 28 : 74A - 74A
  • [40] Deep learning to determine breast cancer estrogen receptor status from nuclear morphometric features in H&E images
    Rawat, Rishi R.
    Ruderman, Daniel
    Agus, David B.
    Macklin, Paul
    CANCER RESEARCH, 2017, 77