Comparison of the validity and reliability of two image classification systems for the assessment of mammogram quality

被引:28
|
作者
Moreira, C
Svoboda, K
Poulos, A
Taylor, R
Page, A
Rickard, M
机构
[1] Cumberand Hosp, State Coordinat Unit, BreastScreen NSW, N Parramatta, NSW, Australia
[2] Univ Sydney, Sch Publ Hlth, Sydney, NSW 2006, Australia
[3] Univ Sydney, Sch Med Radiat Sci, Sydney, NSW 2006, Australia
关键词
D O I
10.1258/0969141053279149
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Objective: To compare the reliability and validity of two classification systems used to evaluate the quality of mammograms: PGMI ('perfect', 'good', 'moderate' and 'inadequate') and EAR ('excellent', 'acceptable' and 'repeat'). Setting: New South Wales (Australia) population-based mammography screening programme (BreastScreen NSW). Methods: Thirty sets of mammograms were rated by 21 radiographers and an expert panel. PGMI and EAR criteria were used to assign ratings to the medio-lateral oblique (MLO) and cranio-caudal (CC) views for each set of films. Inter-observer reliability and criterion validity (compared with expert panel ratings) were assessed using mean weighted observed agreement and kappa statistics. Results: Reliability: Kappa values for both classification systems were low (0.01-0.17). PGMI produced significantly higher values than EAR. Agreement between raters was higher using PGMI than EAR for the MLO view (77% versus 74%, P < 0.05), but was similar for the CC view. Dichotomized, ratings ('acceptable' or 'needs repeating') did not improve reliability estimates. Validity: Kappa values between raters and the reference standard were low for both classification systems (0.05-0.15). Agreement between raters and the reference standard was higher using PGMI than EAR for the MLO view (74% versus 63%), but was similar for the CC view. Dichotomized ratings of the MLO view showed slightly higher observer agreement. Conclusions: Both PGMI and EAR have poor reliability and validity in evaluating mammogram quality. EAR is not a suitable alternative to PGMI, which must be improved if it is to be useful.
引用
收藏
页码:38 / 42
页数:5
相关论文
共 50 条
  • [1] Assessment of the validity and reliability of three systems of medical record screening for quality of care assessment
    Camacho, LAB
    Rubin, HR
    MEDICAL CARE, 1998, 36 (05) : 748 - 751
  • [2] A comparison of the validity and reliability of established bone stock loss classification systems and the proposal of a novel classification system
    Parry, Michael C.
    Whitehouse, Michael R.
    Mehendale, Sanchit A.
    Smith, Lindsay K.
    Webb, Jason C.
    Spencer, Robert F.
    Blom, Ashley W.
    HIP INTERNATIONAL, 2010, 20 (01) : 50 - 55
  • [3] Assessment of validity and reliability of the feedback quality instrument
    Amirzadeh, Sahar
    Rasouli, Davood
    Dargahi, Helen
    BMC RESEARCH NOTES, 2024, 17 (01)
  • [4] A Comprehensive Strategy for Mammogram Image Classification using Learning Classifier Systems
    Siddique, Abubakar
    Iqbal, Muhammad
    Browne, Will N.
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 2201 - 2208
  • [5] Ultraviolet photography in vitiligo: image quality, validity and reliability
    Uitentuis, S. E.
    Heilmann, M. N.
    Verdaasdonk, R. M.
    Bae, J. M.
    Luiten, R. M.
    Wolkerstorfer, A.
    Bekkenk, M. W.
    JOURNAL OF THE EUROPEAN ACADEMY OF DERMATOLOGY AND VENEREOLOGY, 2020, 34 (07) : 1590 - 1594
  • [6] RELIABILITY AND VALIDITY OF WOUND ASSESSMENT WITH DIGITAL IMAGE ANALYSIS
    Bloemen, Monica
    Boekema, Bouke
    Vlig, Marcel
    Middelkoop, Esther
    WOUND REPAIR AND REGENERATION, 2010, 18 (06) : A78 - A78
  • [7] Mammography image assessment; validity and reliability of current scheme
    Hill, C.
    Robinson, L.
    RADIOGRAPHY, 2015, 21 (04) : 304 - 307
  • [8] VALIDITY AND RELIABILITY ISSUES IN ALTERNATIVE PATIENT CLASSIFICATION SYSTEMS
    CHARBONNEAU, C
    OSTROWSKI, C
    POEHNER, ET
    LINDSAY, P
    PANNIERS, TL
    HOUGHTON, P
    ALBRIGHT, J
    MEDICAL CARE, 1988, 26 (08) : 800 - 813
  • [9] CLASSIFICATION OF IMAGE DISTORTIONS FOR IMAGE QUALITY ASSESSMENT
    Alaql, Omar
    Ghazinour, Kambiz
    Lu, Cheng Chang
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 653 - 658
  • [10] Comparison of two arthroscopic pump systems based on image quality
    G. J. M. Tuijthof
    H. van den Boomen
    R. J. van Heerwaarden
    C. N. van Dijk
    Knee Surgery, Sports Traumatology, Arthroscopy, 2008, 16 : 590 - 594