Simulated arbitration of discordance between radiologists and artificial intelligence interpretation of breast cancer screening mammograms

被引:0
|
作者
Marinovich, M. Luke [1 ,2 ]
Lotter, William [3 ,4 ]
Waddell, Andrew [5 ]
Houssami, Nehmat [1 ,2 ]
机构
[1] Univ Sydney, Daffodil Ctr, Joint Venture Canc Council NSW, Sydney, NSW, Australia
[2] Univ Sydney, Sydney Sch Publ Hlth, Fac Med & Hlth, Camperdown, NSW, Australia
[3] Dana Farber Canc Inst, Boston, MA USA
[4] Harvard Med Sch, Boston, MA USA
[5] BreastScreen WA, Perth, WA, Australia
基金
英国医学研究理事会;
关键词
Breast cancer; artificial intelligence; mammography; population screening;
D O I
10.1177/09691413241262960
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Artificial intelligence (AI) algorithms have been retrospectively evaluated as replacement for one radiologist in screening mammography double-reading; however, methods for resolving discordance between radiologists and AI in the absence of 'real-world' arbitration may underestimate cancer detection rate (CDR) and recall. In 108,970 consecutive screens from a population screening program (BreastScreen WA, Western Australia), 20,120 were radiologist/AI discordant without real-world arbitration. Recall probabilities were randomly assigned for these screens in 1000 simulations. Recall thresholds for screen-detected and interval cancers (sensitivity) and no cancer (false-positive proportion, FPP) were varied to calculate mean CDR and recall rate for the entire cohort. Assuming 100% sensitivity, the maximum CDR was 7.30 per 1000 screens. To achieve >95% probability that the mean CDR exceeded the screening program CDR (6.97 per 1000), interval cancer sensitivities >= 63% (at 100% screen-detected sensitivity) and >= 91% (at 80% screen-detected sensitivity) were required. Mean recall rate was relatively constant across sensitivity assumptions, but varied by FPP. FPP > 6.5% resulted in recall rates that exceeded the program estimate (3.38%). CDR improvements depend on a majority of interval cancers being detected in radiologist/AI discordant screens. Such improvements are likely to increase recall, requiring careful monitoring where AI is deployed for screen-reading.
引用
收藏
页码:48 / 52
页数:5
相关论文
共 50 条
  • [1] Variability in the interpretation of screening mammograms by US radiologists
    Shile, PE
    Pilgram, TK
    ACADEMIC RADIOLOGY, 1996, 3 (10) : 879 - 881
  • [2] Performance of an Artificial Intelligence System for Breast Cancer Detection on Screening Mammograms from BreastScreen Norway
    Larsen, Marthe
    Olstad, Camilla F.
    Lee, Christoph I.
    Hovda, Tone
    Hoff, Solveig R.
    Martiniussen, Marit A.
    Mikalsen, Karl Oyvind
    Lund-Hanssen, Hakon
    Solli, Helene S.
    Silberhorn, Marko
    Sulheim, Ase O.
    Auensen, Steinar
    Nygard, Jan F.
    Hofvind, Solveig
    RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2024, 6 (03)
  • [3] Agreement Between Radiologists' Interpretations of Screening Mammograms
    Nishikawa, Robert M.
    Comstock, Christopher E.
    Linver, Michael N.
    Newstead, Gillian M.
    Sandhir, Vinay
    Schmidt, Robert A.
    BREAST IMAGING, IWDM 2016, 2016, 9699 : 3 - 10
  • [4] Anticipating artificial intelligence in mammography screening: views of Swedish breast radiologists
    Hogberg, Charlotte
    Larsson, Stefan
    Lang, Kristina
    BMJ HEALTH & CARE INFORMATICS, 2023, 30 (01)
  • [5] Artificial intelligence to complement rather than replace radiologists in breast screening
    Taylor-Phillips, Sian
    Freeman, Karoline
    LANCET DIGITAL HEALTH, 2022, 4 (07): : E478 - E479
  • [6] Artificial intelligence to complement rather than replace radiologists in breast screening
    Taylor-Phillips, Sian
    Freeman, Karoline
    The Lancet Digital Health, 2022, 4 (07):
  • [7] Artificial Intelligence: Is It Armageddon for Breast Radiologists?
    Chiwome, Lawman
    Okojie, Onosetale M.
    Rahman, A. K. M. Jamiur
    Javed, Faheem
    Hamid, Pousettef
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2020, 12 (06)
  • [8] Artificial intelligence in the interpretation of breast cancer on MRI
    Sheth, Deepa
    Giger, Maryellen L.
    JOURNAL OF MAGNETIC RESONANCE IMAGING, 2020, 51 (05) : 1310 - 1324
  • [9] DISCORDANCE BETWEEN INTERPRETATION OF ERCP FILMS BY RADIOLOGISTS AND GASTROENTEROLOGISTS
    KAZI, S
    SILAS, D
    MCGUIRE, D
    ETZKORN, K
    VENU, RP
    GASTROINTESTINAL ENDOSCOPY, 1995, 41 (04) : 403 - 403
  • [10] Australian women's judgements about using artificial intelligence to read mammograms in breast cancer screening
    Carter, Stacy M.
    Carolan, Lucy
    Aquino, Yves Saint James
    Frazer, Helen
    Rogers, Wendy A.
    Hall, Julie
    Degeling, Chris
    Braunack-Mayer, Annette
    Houssami, Nehmat
    DIGITAL HEALTH, 2023, 9