Artificial Intelligence Evaluation of 122 969 Mammography Examinations from a Population-based Screening Program

被引:58
|
作者
Larsen, Marthe [1 ]
Aglen, Camilla F. [1 ]
Lee, Christoph, I [5 ,6 ]
Hoff, Solveig R. [7 ,8 ]
Lund-Hanssen, Hakon [9 ]
Lang, Kristina [10 ,11 ]
Nygard, Jan F. [2 ]
Ursin, Giske [3 ]
Hofvind, Solveig [1 ,4 ]
机构
[1] Sect Breast Canc Screening, POB 5313, N-0304 Oslo, Norway
[2] Dept Register Informat, POB 5313, N-0304 Oslo, Norway
[3] Canc Registry Norway, POB 5313, N-0304 Oslo, Norway
[4] Arctic Univ Norway, Fac Hlth Sci, Dept Hlth & Care Sci, Tromso, Norway
[5] Univ Washington, Sch Med, Dept Radiol, Seattle, WA 98195 USA
[6] Univ Washington, Sch Publ Hlth, Dept Hlth Syst & Populat Hlth, Seattle, WA 98195 USA
[7] Alesund Hosp, More & Romsdal Hosp Trust, Dept Radiol, Alesund, Norway
[8] Natl Univ Sci & Technol, Fac Med & Hlth Sci, Dept Circulat & Med Imaging, Trondheim, Norway
[9] St Olavs Univ Hosp, Dept Radiol & Nucl Med, Trondheim, Norway
[10] Lund Univ, Dept Translat Med, Lund, Sweden
[11] Skane Univ Hosp, Unilabs Mammog Unit, Malmo, Sweden
关键词
CANCER; PERFORMANCE; INTERVAL; FILM;
D O I
10.1148/radiol.212381
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: Artificial intelligence (AI) has shown promising results for cancer detection with mammographic screening. However, evidence related to the use of AI in real screening settings remain sparse. Purpose: To compare the performance of a commercially available AI system with routine, independent double reading with consensus as performed in a population-based screening program. Furthermore, the histopathologic characteristics of tumors with different AI scores were explored. Materials and Methods: In this retrospective study, 122 969 screening examinations from 47 877 women performed at four screening units in BreastScreen Norway from October 2009 to December 2018 were included. The data set included 752 screen-detected cancers (6.1 per 1000 examinations) and 205 interval cancers (1.7 per 1000 examinations). Each examination had an AI score between 1 and 10, where 1 indicated low risk of breast cancer and 10 indicated high risk. Threshold 1, threshold 2, and threshold 3 were used to assess the performance of the AI system as a binary decision tool (selected vs not selected). Threshold 1 was set at an AI score of 10, threshold 2 was set to yield a selection rate similar to the consensus rate (8.8%), and threshold 3 was set to yield a selection rate similar to an average individual radiologist (5.8%). Descriptive statistics were used to summarize screening outcomes. Results: A total of 653 of 752 screen-detected cancers (86.8%) and 92 of 205 interval cancers (44.9%) were given a score of 10 by the AI system (threshold 1). Using threshold 3, 80.1% of the screen-detected cancers (602 of 752) and 30.7% of the interval cancers (63 of 205) were selected. Screen-detected cancer with AI scores not selected using the thresholds had favorable histopathologic characteristics compared to those selected; opposite results were observed for interval cancer. Conclusion: The proportion of screen-detected cancers not selected by the artificial intelligence (AI) system at the three evaluated thresholds was less than 20%. The overall performance of the AI system was promising according to cancer detection. (C) RSNA, 2022
引用
收藏
页码:502 / 511
页数:10
相关论文
共 50 条
  • [1] Possible strategies for use of artificial intelligence in screen-reading of mammograms, based on retrospective data from 122,969 screening examinations
    Larsen, Marthe
    Aglen, Camilla F.
    Hoff, Solveig R.
    Lund-Hanssen, Hakon
    Hofvind, Solveig
    EUROPEAN RADIOLOGY, 2022, 32 (12) : 8238 - 8246
  • [2] Possible strategies for use of artificial intelligence in screen-reading of mammograms, based on retrospective data from 122,969 screening examinations
    Marthe Larsen
    Camilla F. Aglen
    Solveig R. Hoff
    Håkon Lund-Hanssen
    Solveig Hofvind
    European Radiology, 2022, 32 : 8238 - 8246
  • [3] A POPULATION-BASED SCREENING MAMMOGRAPHY PROGRAM IN SASKATCHEWAN
    EDOUARD, L
    EBERTS, L
    BALDWIN, J
    AMERICAN JOURNAL OF PUBLIC HEALTH, 1994, 84 (06) : 1030 - 1031
  • [4] Combined screening with mammography and ultrasound in a population-based screening program
    Buchberger, Wolfgang
    Geiger-Gritsch, Sabine
    Knapp, Rudolf
    Gautsch, Kurt
    Oberaigner, Willi
    EUROPEAN JOURNAL OF RADIOLOGY, 2018, 101 : 24 - 29
  • [5] Screening outcome for consecutive examinations with digital breast tomosynthesis versus standard digital mammography in a population-based screening program
    Tone Hovda
    Siri H. B. Brandal
    Sofie Sebuødegård
    Åsne S. Holen
    Hilde Bjørndal
    Per Skaane
    Solveig Hofvind
    European Radiology, 2019, 29 : 6991 - 6999
  • [6] Screening outcome for consecutive examinations with digital breast tomosynthesis versus standard digital mammography in a population-based screening program
    Hovda, Tone
    Brandal, Siri H. B.
    Sebuadegard, Sofie
    Holen, Asne S.
    Bjorndal, Hilde
    Skaane, Per
    Hofvind, Solveig
    EUROPEAN RADIOLOGY, 2019, 29 (12) : 6991 - 6999
  • [7] Impact of real-life use of artificial intelligence as support for human reading in a population-based breast cancer screening program with mammography and tomosynthesis
    Elias-Cabot, Esperanza
    Romero-Martin, Sara
    Raya-Povedano, Jose Luis
    Brehl, A. -K.
    Alvarez-Benito, Marina
    EUROPEAN RADIOLOGY, 2024, 34 (06) : 3958 - 3966
  • [8] Potential Impact of an Artificial Intelligence-based Mammography Triage Algorithm on Performance and Workload in a Population-based Screening Sample
    Watanabe, Alyssa T.
    Vu, Hoanh
    Chim, Chi Y.
    Litt, Andrew W.
    Retson, Tara
    Mayo, Ray C.
    JOURNAL OF BREAST IMAGING, 2024,
  • [9] Sensitivity of screening mammography by density and texture: a cohort study from a population-based screening program in Denmark
    von Euler-Chelpin, My
    Lillholm, Martin
    Vejborg, Ilse
    Nielsen, Mads
    Lynge, Elsebeth
    BREAST CANCER RESEARCH, 2019, 21 (01)
  • [10] Sensitivity of screening mammography by density and texture: a cohort study from a population-based screening program in Denmark
    My von Euler-Chelpin
    Martin Lillholm
    Ilse Vejborg
    Mads Nielsen
    Elsebeth Lynge
    Breast Cancer Research, 21