Exploring Large-scale Public Medical Image Datasets

被引:115
|
作者
Oakden-Rayner, Luke [1 ,2 ,3 ]
机构
[1] Australian Inst Machine Learning, North Terrace, Adelaide, SA, Australia
[2] Univ Adelaide, Sch Publ Hlth, North Terrace, Adelaide, SA 5000, Australia
[3] Royal Adelaide Hosp, North Terrace, Adelaide, SA, Australia
关键词
Artificial intelligence; dataset; exploratory analysis; deep learning; quality control;
D O I
10.1016/j.acra.2019.10.006
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Rationale and Objectives: Medical artificial intelligence systems are dependent on well characterized large-scale datasets. Recently released public datasets have been of great interest to the field, but pose specific challenges due to the disconnect they cause between data generation and data usage, potentially limiting the utility of these datasets. Materials and Methods: We visually explore two large public datasets, to determine how accurate the provided labels are and whether other subtle problems exist. The ChestXray14 dataset contains 112,120 frontal chest films, and the Musculoskeletal Radiology (MURA) dataset contains 40,561 upper limb radiographs. A subset of around 700 images from both datasets was reviewed by a board-certified radiologist, and the quality of the original labels was determined. Results: The ChestXray14 labels did not accurately reflect the visual content of the images, with positive predictive values mostly between 10% and 30% lower than the values presented in the original documentation. There were other significant problems, with examples of hidden stratification and label disambiguation failure. The MURA labels were more accurate, but the original normal/abnormal labels were inaccurate for the subset of cases with degenerative joint disease, with a sensitivity of 60% and a specificity of 82%. Conclusion: Visual inspection of images is a necessary component of understanding large image datasets. We recommend that teams producing public datasets should perform this important quality control procedure and include a thorough description of their findings, along with an explanation of the data generating procedures and labeling rules, in the documentation for their datasets.
引用
收藏
页码:106 / 112
页数:7
相关论文
共 50 条
  • [21] Map Matching Algorithm for Large-scale Datasets
    Fiedler, David
    Cap, Michal
    Nykl, Jan
    Zilecky, Pavol
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 500 - 508
  • [22] Momentum Online LDA for Large-scale Datasets
    Ouyang, Jihong
    Lu, You
    Li, Ximing
    21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 1075 - 1076
  • [23] Large-Scale Datasets in Special Education Research
    Griffin, Megan M.
    Steinbrecher, Trisha D.
    USING SECONDARY DATASETS TO UNDERSTAND PERSONS WITH DEVELOPMENTAL DISABILITIES AND THEIR FAMILIES, 2013, 45 : 155 - 183
  • [24] Towards algorithmic analytics for large-scale datasets
    Danilo Bzdok
    Thomas E. Nichols
    Stephen M. Smith
    Nature Machine Intelligence, 2019, 1 : 296 - 306
  • [25] Iterative Classification for Sanitizing Large-Scale Datasets
    Li, Bo
    Vorobeychik, Yevgeniy
    Li, Muqun
    Malin, Bradley
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 841 - 846
  • [26] National differences in image quality assessment: An investigation on three large-scale IQA datasets
    Saupe, Dietmar
    Del Pin, Simon Hviid
    2024 16TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE, QOMEX 2024, 2024, : 214 - 220
  • [27] Accounting for intensity variation in image analysis of large-scale multiplexed clinical trial datasets
    Frei, Anja L.
    McGuigan, Anthony
    Sinha, Ritik R. A. K.
    Glaire, Mark A.
    Jabbar, Faiz
    Gneo, Luciana
    Tomasevic, Tijana
    Harkin, Andrea
    Iveson, Tim J.
    Saunders, Mark
    Oein, Karin
    Maka, Noori
    Pezella, Francesco
    Campo, Leticia
    Hay, Jennifer
    Edwards, Joanne
    Sansom, Owen J.
    Kelly, Caroline
    Tomlinson, Ian
    Kildal, Wanja
    Kerr, Rachel S.
    Kerr, David J.
    Danielsen, Havard E.
    Domingo, Enric
    TransSCOT Consortium, David N.
    Church, David N.
    Koelzer, Viktor H.
    JOURNAL OF PATHOLOGY CLINICAL RESEARCH, 2023, 9 (06): : 449 - 463
  • [28] Automatic segmentation of large-scale CT image datasets for detailed body composition analysis
    Nouman Ahmad
    Robin Strand
    Björn Sparresäter
    Sambit Tarai
    Elin Lundström
    Göran Bergström
    Håkan Ahlström
    Joel Kullberg
    BMC Bioinformatics, 24
  • [29] Automatic segmentation of large-scale CT image datasets for detailed body composition analysis
    Ahmad, Nouman
    Strand, Robin
    Sparresater, Bjoern
    Tarai, Sambit
    Lundstrom, Elin
    Bergstrom, Goeran
    Ahlstrom, Hakan
    Kullberg, Joel
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [30] 3DMiner: Discovering Shapes from Large-Scale Unannotated Image Datasets
    Cheng, Ta-Ying
    Gadelha, Matheus
    Pirk, Soeren
    Groueix, Thibault
    Mech, Radomir
    Markham, Andrew
    Trigoni, Niki
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9297 - 9307