Comparison of the validity and reliability of two image classification systems for the assessment of mammogram quality

被引：28

作者：

Moreira, C

Svoboda, K

Poulos, A

Taylor, R

Page, A

Rickard, M

机构：

[1] Cumberand Hosp, State Coordinat Unit, BreastScreen NSW, N Parramatta, NSW, Australia

[2] Univ Sydney, Sch Publ Hlth, Sydney, NSW 2006, Australia

[3] Univ Sydney, Sch Med Radiat Sci, Sydney, NSW 2006, Australia

来源：

JOURNAL OF MEDICAL SCREENING | 2005年 / 12卷 / 01期

关键词：

D O I：

10.1258/0969141053279149

中图分类号：

R1 [预防医学、卫生学];

学科分类号：

1004 ; 120402 ;

摘要：

Objective: To compare the reliability and validity of two classification systems used to evaluate the quality of mammograms: PGMI ('perfect', 'good', 'moderate' and 'inadequate') and EAR ('excellent', 'acceptable' and 'repeat'). Setting: New South Wales (Australia) population-based mammography screening programme (BreastScreen NSW). Methods: Thirty sets of mammograms were rated by 21 radiographers and an expert panel. PGMI and EAR criteria were used to assign ratings to the medio-lateral oblique (MLO) and cranio-caudal (CC) views for each set of films. Inter-observer reliability and criterion validity (compared with expert panel ratings) were assessed using mean weighted observed agreement and kappa statistics. Results: Reliability: Kappa values for both classification systems were low (0.01-0.17). PGMI produced significantly higher values than EAR. Agreement between raters was higher using PGMI than EAR for the MLO view (77% versus 74%, P < 0.05), but was similar for the CC view. Dichotomized, ratings ('acceptable' or 'needs repeating') did not improve reliability estimates. Validity: Kappa values between raters and the reference standard were low for both classification systems (0.05-0.15). Agreement between raters and the reference standard was higher using PGMI than EAR for the MLO view (74% versus 63%), but was similar for the CC view. Dichotomized ratings of the MLO view showed slightly higher observer agreement. Conclusions: Both PGMI and EAR have poor reliability and validity in evaluating mammogram quality. EAR is not a suitable alternative to PGMI, which must be improved if it is to be useful.

引用

页码：38 / 42

页数：5

共 50 条

[1] Assessment of the validity and reliability of three systems of medical record screening for quality of care assessment
Camacho, LAB
Rubin, HR
MEDICAL CARE, 1998, 36 (05) : 748 - 751
[2] A comparison of the validity and reliability of established bone stock loss classification systems and the proposal of a novel classification system
Parry, Michael C.
Whitehouse, Michael R.
Mehendale, Sanchit A.
Smith, Lindsay K.
Webb, Jason C.
Spencer, Robert F.
Blom, Ashley W.
HIP INTERNATIONAL, 2010, 20 (01) : 50 - 55
[3] Assessment of validity and reliability of the feedback quality instrument
Amirzadeh, Sahar
Rasouli, Davood
Dargahi, Helen
BMC RESEARCH NOTES, 2024, 17 (01)
[4] A Comprehensive Strategy for Mammogram Image Classification using Learning Classifier Systems
Siddique, Abubakar
Iqbal, Muhammad
Browne, Will N.
2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 2201 - 2208
[5] Ultraviolet photography in vitiligo: image quality, validity and reliability
Uitentuis, S. E.
Heilmann, M. N.
Verdaasdonk, R. M.
Bae, J. M.
Luiten, R. M.
Wolkerstorfer, A.
Bekkenk, M. W.
JOURNAL OF THE EUROPEAN ACADEMY OF DERMATOLOGY AND VENEREOLOGY, 2020, 34 (07) : 1590 - 1594
[6] RELIABILITY AND VALIDITY OF WOUND ASSESSMENT WITH DIGITAL IMAGE ANALYSIS
Bloemen, Monica
Boekema, Bouke
Vlig, Marcel
Middelkoop, Esther
WOUND REPAIR AND REGENERATION, 2010, 18 (06) : A78 - A78
[7] Mammography image assessment; validity and reliability of current scheme
Hill, C.
Robinson, L.
RADIOGRAPHY, 2015, 21 (04) : 304 - 307
[8] VALIDITY AND RELIABILITY ISSUES IN ALTERNATIVE PATIENT CLASSIFICATION SYSTEMS
CHARBONNEAU, C
OSTROWSKI, C
POEHNER, ET
LINDSAY, P
PANNIERS, TL
HOUGHTON, P
ALBRIGHT, J
MEDICAL CARE, 1988, 26 (08) : 800 - 813
[9] CLASSIFICATION OF IMAGE DISTORTIONS FOR IMAGE QUALITY ASSESSMENT
Alaql, Omar
Ghazinour, Kambiz
Lu, Cheng Chang
2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 653 - 658
[10] Comparison of two arthroscopic pump systems based on image quality
G. J. M. Tuijthof
H. van den Boomen
R. J. van Heerwaarden
C. N. van Dijk
Knee Surgery, Sports Traumatology, Arthroscopy, 2008, 16 : 590 - 594

← 1 2 3 4 5 →