BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images

被引:40
|
作者
Brancati, Nadia [1 ]
Anniciello, Anna Maria [2 ]
Pati, Pushpak [3 ,4 ]
Riccio, Daniel [1 ,5 ]
Scognamiglio, Giosue [2 ]
Jaume, Guillaume [3 ,6 ]
De Pietro, Giuseppe [1 ]
Di Bonito, Maurizio [2 ]
Foncubierta, Antonio [3 ]
Botti, Gerardo [2 ]
Gabrani, Maria [3 ]
Feroce, Florinda [2 ]
Frucci, Maria [1 ]
机构
[1] ICAR CNR, Inst High Performance Comp & Networking Res Counc, 111 Via Pietro Castellino, I-80131 Naples, Italy
[2] IRCCS, Natl Canc Inst, Fdn Pascale, 53 Via Mariano Semmola, I-80131 Naples, Italy
[3] IBM Res, Saumerstr 4, CH-8803 Zurich, Switzerland
[4] ETH, Ramistr 101, CH-8092 Zurich, Switzerland
[5] Univ Naples Federico II, Dept Elect Engn & Informat Technol, Via Claudio 21, I-80125 Naples, Italy
[6] EPFL Rte Cantonale, CH-1015 Lausanne, Switzerland
关键词
PATHOLOGY;
D O I
10.1093/database/baac093
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Breast cancer is the most commonly diagnosed cancer and registers the highest number of deaths for women. Advances in diagnostic activities combined with large-scale screening policies have significantly lowered the mortality rates for breast cancer patients. However, the manual inspection of tissue slides by pathologists is cumbersome, time-consuming and is subject to significant inter- and intra-observer variability. Recently, the advent of whole-slide scanning systems has empowered the rapid digitization of pathology slides and enabled the development of Artificial Intelligence (AI)-assisted digital workflows. However, AI techniques, especially Deep Learning, require a large amount of high-quality annotated data to learn from. Constructing such task-specific datasets poses several challenges, such as data-acquisition level constraints, time-consuming and expensive annotations and anonymization of patient information. In this paper, we introduce the BReAst Carcinoma Subtyping (BRACS) dataset, a large cohort of annotated Hematoxylin and Eosin (H&E)-stained images to advance AI development in the automatic characterization of breast lesions. BRACS contains 547 Whole-Slide Images (WSIs) and 4539 Regions Of Interest (ROIs) extracted from the WSIs. Each WSI and respective ROIs are annotated by the consensus of three board-certified pathologists into different lesion categories. Specifically, BRACS includes three lesion types, i.e., benign, malignant and atypical, which are further subtyped into seven categories. It is, to the best of our knowledge, the largest annotated dataset for breast cancer subtyping both at WSI and ROI levels. Furthermore, by including the understudied atypical lesions, BRACS offers a unique opportunity for leveraging AI to better understand their characteristics. We encourage AI practitioners to develop and evaluate novel algorithms on the BRACS dataset to further breast cancer diagnosis and patient care.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] MARKER-CONTROLLED WATERSHED SEGMENTATION OF NUCLEI IN H&E STAINED BREAST CANCER BIOPSY IMAGES
    Veta, M.
    Huisman, A.
    Viergever, M. A.
    van Diest, P. J.
    Pluim, J. P. W.
    2011 8TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, 2011, : 618 - 621
  • [22] Spatially Resolved Gene Expression Prediction from H&E Histology Images via Bi-modal Contrastive Learning
    Xie, Ronald
    Pang, Kuan
    Chung, Sai W.
    Perciani, Catia T.
    MacParland, Sonya A.
    Wang, Bo
    Bader, Gary D.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [23] Deep Learning-Based Breast Cancer Subtype Classification from Whole-Slide Images: Leveraging the BRACS Dataset
    Hernandez, Nerea
    Carrillo-Perez, Francisco
    Ortuno, Francisco M.
    Rojas, Ignacio
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, PT II, IWBBIO 2024, 2024, 14849 : 200 - 213
  • [24] Classification of Elastic and Collagen Fibers in H&E Stained Hyperspectral Images
    Septiana, Lina
    Suzuki, Hiroyuki
    Ishikawa, Masahiro
    Obi, Takashi
    Kobayashi, Naoki
    Ohyama, Nagaaki
    Wihardjo, Erning
    Andiani, Dini
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 7031 - 7035
  • [25] Quantitative diagnosis of bladder cancer by morphometric analysis of H&E images
    Wu, Binlin
    Nebylitsa, Samantha V.
    Mukherjee, Sushmita
    Jain, Manu
    PHOTONIC THERAPEUTICS AND DIAGNOSTICS XI, 2015, 9303
  • [26] Semantic segmentation to identify bladder layers from H&E Images
    Muhammad Khalid Khan Niazi
    Enes Yazgan
    Thomas E. Tavolara
    Wencheng Li
    Cheryl T. Lee
    Anil Parwani
    Metin N. Gurcan
    Diagnostic Pathology, 15
  • [27] Evaluation of pediatric melanoma by conventional H&E histology and genetic analysis: Case report
    Eng, William
    Pohler, Holly
    Norman, Robert
    JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2017, 76 (06) : AB130 - AB130
  • [28] Classification of Multiple H&E Images via an Ensemble Computational Scheme
    da Costa Longo, Leonardo H.
    Roberto, Guilherme F.
    Tosta, Thaina A. A.
    de Faria, Paulo R.
    Loyola, Adriano M.
    Cardoso, Sergio V.
    Silva, Adriano B.
    do Nascimento, Marcelo Z.
    Neves, Leandro A.
    ENTROPY, 2024, 26 (01)
  • [29] An Optimized Color Space for the Analysis of Digital Images of H&E Slides
    Zarella, Mark
    Breen, David
    Plagov, Andrei
    Garcia, Fernando
    MODERN PATHOLOGY, 2015, 28 : 403A - 403A
  • [30] InvUnet:Inverse the Unet for Nuclear Segmentation in H&E Stained Images
    Zhang, Lifeng
    Li, Bin
    2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2020, : 251 - 256