External Evaluation of 3 Commercial Artificial Intelligence Algorithms for Independent Assessment of Screening Mammograms

被引:141
|
作者
Salim, Mattie [1 ,2 ]
Wahlin, Erik [3 ]
Dembrower, Karin [4 ,5 ]
Azavedo, Edward [1 ,6 ]
Foukakis, Theodoros [1 ,2 ]
Liu, Yue [7 ]
Smith, Kevin [8 ]
Eklund, Martin [9 ]
Strand, Fredrik [1 ,10 ]
机构
[1] Karolinska Inst, Dept Oncol Pathol, Stockholm, Sweden
[2] Karolinska Univ Hosp, Dept Radiol, Stockholm, Sweden
[3] Karolinska Univ Hosp, Dept Med Radiat Phys & Nucl Med, Stockholm, Sweden
[4] Karolinska Inst, Dept Physiol & Pharmacol, Stockholm, Sweden
[5] Capio Sankt Gorans Hosp, Dept Radiol, Stockholm, Sweden
[6] Karolinska Inst, Dept Mol Med & Surg, Stockholm, Sweden
[7] KTH Royal Inst Technol, Div Computat Sci & Technol, Sci Life Lab, Solna, Sweden
[8] KTH Royal Inst Technol, Sci Life Lab Solna, Solna, Sweden
[9] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
[10] Karolinska Univ Hosp, Breast Radiol, Stockholm, Sweden
关键词
BREAST-CANCER; ACCURACY; INTERVAL; AI;
D O I
10.1001/jamaoncol.2020.3321
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
ImportanceA computer algorithm that performs at or above the level of radiologists in mammography screening assessment could improve the effectiveness of breast cancer screening. ObjectiveTo perform an external evaluation of 3 commercially available artificial intelligence (AI) computer-aided detection algorithms as independent mammography readers and to assess the screening performance when combined with radiologists. Design, Setting, and ParticipantsThis retrospective case-control study was based on a double-reader population-based mammography screening cohort of women screened at an academic hospital in Stockholm, Sweden, from 2008 to 2015. The study included 8805 women aged 40 to 74 years who underwent mammography screening and who did not have implants or prior breast cancer. The study sample included 739 women who were diagnosed as having breast cancer (positive) and a random sample of 8066 healthy controls (negative for breast cancer). Main Outcomes and MeasuresPositive follow-up findings were determined by pathology-verified diagnosis at screening or within 12 months thereafter. Negative follow-up findings were determined by a 2-year cancer-free follow-up. Three AI computer-aided detection algorithms (AI-1, AI-2, and AI-3), sourced from different vendors, yielded a continuous score for the suspicion of cancer in each mammography examination. For a decision of normal or abnormal, the cut point was defined by the mean specificity of the first-reader radiologists (96.6%). ResultsThe median age of study participants was 60 years (interquartile range, 50-66 years) for 739 women who received a diagnosis of breast cancer and 54 years (interquartile range, 47-63 years) for 8066 healthy controls. The cases positive for cancer comprised 618 (84%) screen detected and 121 (16%) clinically detected within 12 months of the screening examination. The area under the receiver operating curve for cancer detection was 0.956 (95% CI, 0.948-0.965) for AI-1, 0.922 (95% CI, 0.910-0.934) for AI-2, and 0.920 (95% CI, 0.909-0.931) for AI-3. At the specificity of the radiologists, the sensitivities were 81.9% for AI-1, 67.0% for AI-2, 67.4% for AI-3, 77.4% for first-reader radiologist, and 80.1% for second-reader radiologist. Combining AI-1 with first-reader radiologists achieved 88.6% sensitivity at 93.0% specificity (abnormal defined by either of the 2 making an abnormal assessment). No other examined combination of AI algorithms and radiologists surpassed this sensitivity level. Conclusions and RelevanceTo our knowledge, this study is the first independent evaluation of several AI computer-aided detection algorithms for screening mammography. The results of this study indicated that a commercially available AI computer-aided detection algorithm can assess screening mammograms with a sufficient diagnostic performance to be further evaluated as an independent reader in prospective clinical trials. Combining the first readers with the best algorithm identified more cases positive for cancer than combining the first readers with second readers. This case-control study of women screened at an academic hospital in Stockholm, Sweden, evaluates 3 commercially available artificial intelligence algorithms to assess whether they perform independently as well or better than radiologists in mammography screening assessment or improve the performance of radiologists.
引用
收藏
页码:1581 / 1588
页数:8
相关论文
共 50 条
  • [1] Beyond the AJR: " External Evaluation of 3 Commercial Artificial Intelligence Algorithms for Independent Assessment of Screening Mammograms"
    Gastounioti, Aimilia
    Conant, Emily F.
    [J]. AMERICAN JOURNAL OF ROENTGENOLOGY, 2021, 216 (06) : 1436 - 1436
  • [2] Evaluation of Combined Artificial Intelligence and Radiologist Assessment to Interpret Screening Mammograms
    Schaffter, Thomas
    Buist, Diana S. M.
    Lee, Christoph, I
    Nikulin, Yaroslav
    Ribli, Dezso
    Guan, Yuanfang
    Lotter, William
    Jie, Zequn
    Du, Hao
    Wang, Sijia
    Feng, Jiashi
    Feng, Mengling
    Kim, Hyo-Eun
    Albiol, Francisco
    Albiol, Alberto
    Morrell, Stephen
    Wojna, Zbigniew
    Ahsen, Mehmet Eren
    Asif, Umar
    Yepes, Antonio Jimeno
    Yohanandan, Shivanthan
    Rabinovici-Cohen, Simona
    Yi, Darvin
    Hoff, Bruce
    Yu, Thomas
    Neto, Elias Chaibub
    Rubin, Daniel L.
    Lindholm, Peter
    Margolies, Laurie R.
    McBride, Russell Bailey
    Rothstein, Joseph H.
    Sieh, Weiva
    Ben-Ari, Rami
    Harrer, Stefan
    Trister, Andrew
    Friend, Stephen
    Norman, Thea
    Sahiner, Berkman
    Strand, Fredrik
    Guinney, Justin
    Stolovitzky, Gustavo
    [J]. JAMA NETWORK OPEN, 2020, 3 (03) : E200265
  • [3] Artificial Intelligence to Support Independent Assessment of Screening Mammograms-The Time Has Come
    Lehman, Constance Dobbins
    [J]. JAMA ONCOLOGY, 2020, 6 (10) : 1588 - 1589
  • [4] Population-wide evaluation of artificial intelligence and radiologist assessment of screening mammograms
    Kuhl, Johanne
    Elhakim, Mohammad Talal
    Stougaard, Sarah Wordenskjold
    Rasmussen, Benjamin Schnack Brandt
    Nielsen, Mads
    Gerke, Oke
    Larsen, Lisbet Bronsro
    Graumann, Ole
    [J]. EUROPEAN RADIOLOGY, 2024, 34 (06) : 3935 - 3946
  • [5] Independent External Validation of Artificial Intelligence Algorithms for Automated Interpretation of Screening Mammography: A Systematic Review
    Anderson, Anna W.
    Marinovich, M. Luke
    Houssami, Nehmat
    Lowry, Kathryn P.
    Elmore, Joann G.
    Buist, Diana S. M.
    Hofvind, Solveig
    Lee, Christoph I.
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2022, 19 (02) : 259 - 273
  • [6] Evaluation of combined artificial intelligence and radiologist assessment to interpret screening mammograms (vol 3, e200265, 2020)
    Albiol, Francisco
    Caballero, Luis
    [J]. JAMA NETWORK OPEN, 2020, 3 (03)
  • [7] Women's attitudes and perspectives on the use of artificial intelligence in the assessment of screening mammograms
    Holen, Asne Sorlien
    Martiniussen, Marit Almenning
    Bergan, Marie Burns
    Moshina, Nataliia
    Hovda, Tone
    Hofvind, Solveig
    [J]. EUROPEAN JOURNAL OF RADIOLOGY, 2024, 175
  • [8] Multi-vendor evaluation of artificial intelligence as an independent reader for double reading in breast cancer screening on 275,900 mammograms
    Sharma, Nisha
    Ng, Annie Y.
    James, Jonathan J.
    Khara, Galvin
    Ambrozay, Eva
    Austin, Christopher C.
    Forrai, Gabor
    Fox, Georgia
    Glocker, Ben
    Heindl, Andreas
    Karpati, Edit
    Rijken, Tobias M.
    Venkataraman, Vignesh
    Yearsley, Joseph E.
    Kecskemethy, Peter D.
    [J]. BMC CANCER, 2023, 23 (01)
  • [9] Multi-vendor evaluation of artificial intelligence as an independent reader for double reading in breast cancer screening on 275,900 mammograms
    Nisha Sharma
    Annie Y. Ng
    Jonathan J. James
    Galvin Khara
    Éva Ambrózay
    Christopher C. Austin
    Gábor Forrai
    Georgia Fox
    Ben Glocker
    Andreas Heindl
    Edit Karpati
    Tobias M. Rijken
    Vignesh Venkataraman
    Joseph E. Yearsley
    Peter D. Kecskemethy
    [J]. BMC Cancer, 23
  • [10] Identifying normal mammograms in a large screening population using artificial intelligence
    Lang, Kristina
    Dustler, Magnus
    Dahlblom, Victor
    Akesson, Anna
    Andersson, Ingvar
    Zackrisson, Sophia
    [J]. EUROPEAN RADIOLOGY, 2021, 31 (03) : 1687 - 1692