Population-wide evaluation of artificial intelligence and radiologist assessment of screening mammograms

被引:4
|
作者
Kuhl, Johanne [1 ]
Elhakim, Mohammad Talal [1 ,2 ]
Stougaard, Sarah Wordenskjold [1 ]
Rasmussen, Benjamin Schnack Brandt [1 ,2 ,3 ]
Nielsen, Mads [4 ]
Gerke, Oke [1 ,5 ]
Larsen, Lisbet Bronsro [2 ]
Graumann, Ole [1 ,6 ,7 ]
机构
[1] Univ Southern Denmark, Dept Clin Res, Klovervaenget 10,2ndfloor, DK-5000 Odense C, Denmark
[2] Odense Univ Hosp, Dept Radiol, Klovervaenget 47,Ground Floor, DK-5000 Odense C, Denmark
[3] Odense Univ Hosp, CAI X Centre Clin Artificial Intelligence, Klovervaenget 8C, DK-5000 Odense C, Denmark
[4] Univ Copenhagen, Dept Comp Sci, Univ Pk 1, DK-2100 Copenhagen, Denmark
[5] Odense Univ Hosp, Dept Nucl Med, Klovervaenget 47, DK-5000 Odense C, Denmark
[6] Aarhus Univ Hosp, Dept Radiol, Palle Juul Jensens Blvd 99, DK-8200 Aarhus N, Denmark
[7] Aarhus Univ, Dept Clin Res, Palle Juul Jensens Blvd 99, DK-8200 Aarhus N, Denmark
关键词
Mammography; Breast cancer; Artificial intelligence; Screening; BREAST-CANCER;
D O I
10.1007/s00330-023-10423-7
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Objectives To validate an AI system for standalone breast cancer detection on an entire screening population in comparison to first-reading breast radiologists.Materials and methods All mammography screenings performed between August 4, 2014, and August 15, 2018, in the Region of Southern Denmark with follow-up within 24 months were eligible. Screenings were assessed as normal or abnormal by breast radiologists through double reading with arbitration. For an AI decision of normal or abnormal, two AI-score cut-off points were applied by matching at mean sensitivity (AI(sens)) and specificity (AI(spec)) of first readers. Accuracy measures were sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and recall rate (RR).Results The sample included 249,402 screenings (149,495 women) and 2033 breast cancers (72.6% screen-detected cancers, 27.4% interval cancers). AI(sens) had lower specificity (97.5% vs 97.7%; p < 0.0001) and PPV (17.5% vs 18.7%; p = 0.01) and a higher RR (3.0% vs 2.8%; p < 0.0001) than first readers. AI(spec) was comparable to first readers in terms of all accuracy measures. Both AI(sens) and AI(spec) detected significantly fewer screen-detected cancers (1166 (AI(sens)), 1156 (AI(spec)) vs 1252; p < 0.0001) but found more interval cancers compared to first readers (126 (AI(sens)), 117 (AI(spec)) vs 39; p < 0.0001) with varying types of cancers detected across multiple subgroups.Conclusion Standalone AI can detect breast cancer at an accuracy level equivalent to the standard of first readers when the AI threshold point was matched at first reader specificity. However, AI and first readers detected a different composition of cancers.
引用
收藏
页码:3935 / 3946
页数:12
相关论文
共 50 条
  • [1] Evaluation of Combined Artificial Intelligence and Radiologist Assessment to Interpret Screening Mammograms
    Schaffter, Thomas
    Buist, Diana S. M.
    Lee, Christoph, I
    Nikulin, Yaroslav
    Ribli, Dezso
    Guan, Yuanfang
    Lotter, William
    Jie, Zequn
    Du, Hao
    Wang, Sijia
    Feng, Jiashi
    Feng, Mengling
    Kim, Hyo-Eun
    Albiol, Francisco
    Albiol, Alberto
    Morrell, Stephen
    Wojna, Zbigniew
    Ahsen, Mehmet Eren
    Asif, Umar
    Yepes, Antonio Jimeno
    Yohanandan, Shivanthan
    Rabinovici-Cohen, Simona
    Yi, Darvin
    Hoff, Bruce
    Yu, Thomas
    Neto, Elias Chaibub
    Rubin, Daniel L.
    Lindholm, Peter
    Margolies, Laurie R.
    McBride, Russell Bailey
    Rothstein, Joseph H.
    Sieh, Weiva
    Ben-Ari, Rami
    Harrer, Stefan
    Trister, Andrew
    Friend, Stephen
    Norman, Thea
    Sahiner, Berkman
    Strand, Fredrik
    Guinney, Justin
    Stolovitzky, Gustavo
    [J]. JAMA NETWORK OPEN, 2020, 3 (03) : E200265
  • [2] Evaluation of combined artificial intelligence and radiologist assessment to interpret screening mammograms (vol 3, e200265, 2020)
    Albiol, Francisco
    Caballero, Luis
    [J]. JAMA NETWORK OPEN, 2020, 3 (03)
  • [3] A validated artificial intelligence-based pipeline for population-wide primary immunodeficiency screening
    Rider, Nicholas L.
    Coffey, Michael
    Kurian, Ashok
    Quinn, Jessica
    Orange, Jordan S.
    Modell, Vicki
    Modell, Fred
    [J]. JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2023, 151 (01) : 272 - 279
  • [4] External Evaluation of 3 Commercial Artificial Intelligence Algorithms for Independent Assessment of Screening Mammograms
    Salim, Mattie
    Wahlin, Erik
    Dembrower, Karin
    Azavedo, Edward
    Foukakis, Theodoros
    Liu, Yue
    Smith, Kevin
    Eklund, Martin
    Strand, Fredrik
    [J]. JAMA ONCOLOGY, 2020, 6 (10) : 1581 - 1588
  • [5] Identifying normal mammograms in a large screening population using artificial intelligence
    Lang, Kristina
    Dustler, Magnus
    Dahlblom, Victor
    Akesson, Anna
    Andersson, Ingvar
    Zackrisson, Sophia
    [J]. EUROPEAN RADIOLOGY, 2021, 31 (03) : 1687 - 1692
  • [6] Identifying normal mammograms in a large screening population using artificial intelligence
    Kristina Lång
    Magnus Dustler
    Victor Dahlblom
    Anna Åkesson
    Ingvar Andersson
    Sophia Zackrisson
    [J]. European Radiology, 2021, 31 : 1687 - 1692
  • [7] Beyond the AJR: " External Evaluation of 3 Commercial Artificial Intelligence Algorithms for Independent Assessment of Screening Mammograms"
    Gastounioti, Aimilia
    Conant, Emily F.
    [J]. AMERICAN JOURNAL OF ROENTGENOLOGY, 2021, 216 (06) : 1436 - 1436
  • [8] Women's attitudes and perspectives on the use of artificial intelligence in the assessment of screening mammograms
    Holen, Asne Sorlien
    Martiniussen, Marit Almenning
    Bergan, Marie Burns
    Moshina, Nataliia
    Hovda, Tone
    Hofvind, Solveig
    [J]. EUROPEAN JOURNAL OF RADIOLOGY, 2024, 175
  • [9] Towards population-wide exposure assessment
    Loomis, Dana
    [J]. OCCUPATIONAL AND ENVIRONMENTAL MEDICINE, 2012, 69 (07) : 455 - 456
  • [10] Population-Wide Screening for Chronic Kidney Disease
    van Mil, Dominique
    Heerspink, Hiddo L.
    Gansevoort, Ron T.
    [J]. ANNALS OF INTERNAL MEDICINE, 2024, 177 (01)