Structured Approach for Evaluating Strategies for Cancer Ascertainment Using Large-Scale Electronic Health Record Data

被引:27
|
作者
Earles, Ashley [1 ]
Liu, Lin [2 ]
Bustamante, Ranier [1 ]
Coke, Pat [4 ]
Lynch, Julie [5 ]
Messer, Karen [2 ]
Martinez, Maria Elena [2 ]
Murphy, James D. [2 ]
Williams, Christina D. [7 ,8 ]
Fisher, Deborah A. [7 ,8 ]
Provenzale, Dawn T. [7 ,8 ]
Gawron, Andrew J. [5 ,6 ]
Kaltenbach, Tonya [2 ,3 ]
Gupta, Samir [1 ,2 ]
机构
[1] VA San Diego Healthcare Syst, San Diego, CA USA
[2] Univ Calif San Diego, San Diego, CA 92103 USA
[3] San Francisco VA Med Ctr, San Francisco, CA USA
[4] Cent Arkansas Vet Healthcare Syst, Little Rock, AR USA
[5] VA Salt Lake City Hlth Care Syst, Salt Lake City, UT USA
[6] Univ Utah, Salt Lake City, UT USA
[7] Durham VA Med Ctr, Durham, NC USA
[8] Duke Univ, Durham, NC USA
来源
关键词
D O I
10.1200/CCI.17.00072
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Purpose Cancer ascertainment using large-scale electronic health records is a challenge. Our aim was to propose and apply a structured approach for evaluating multiple candidate approaches for cancer ascertainment using colorectal cancer (CRC) ascertainment within the US Department of Veterans Affairs (VA) as a use case. Methods The proposed approach for evaluating cancer ascertainment strategies includes assessment of individual strategy performance, comparison of agreement across strategies, and review of discordant diagnoses. We applied this approach to compare three strategies for CRC ascertainment within the VA: administrative claims data consisting of International Classification of Diseases, Ninth Revision (ICD9) diagnosis codes; the VA Central Cancer Registry (VACCR); and the newly accessible Oncology Domain, consisting of cases abstracted by local cancer registrars. The study sample consisted of 1,839,043 veterans with index colonoscopy performed from 1999 to 2014. Strategy-specific performance was estimated based on manual record review of 100 candidate CRC cases and 100 colonoscopy controls. Strategies were further compared using Cohen's K and focused review of discordant CRC diagnoses. Results A total of 92,197 individuals met at least one CRC definition. All three strategies had high sensitivity and specificity for incident CRC. However, the ICD9-based strategy demonstrated poor positive predictive value (58%). VACCR and Oncology Domain had almost perfect agreement with each other (kappa, 0.87) but only moderate agreement with ICD9-based diagnoses (kappa, 0.51 and 0.57, respectively). Among discordant cases reviewed, 15% of ICD9-positive but VACCR- or Oncology Domain-negative cases had incident CRC. Conclusion Evaluating novel strategies for identifying cancer requires a structured approach, including validation against manual record review, agreement among candidate strategies, and focused review of discordant findings. Without careful assessment of ascertainment methods, analyses may be subject to bias and limited in clinical impact. (C) 2018 by American Society of Clinical Oncology
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [41] Cardiometabolic disease, comorbidities and risk of death: findings using data from large-scale electronic health records
    Canoy, D.
    Zottoli, M.
    Tran, J.
    Ramakrishnan, R.
    Hasseine, A.
    Nazarzadeh, M.
    Rao, S.
    Li, Y.
    Salimi-Khorshidi, G.
    Norton, R.
    Rahimi, K.
    EUROPEAN HEART JOURNAL, 2020, 41 : 2848 - 2848
  • [42] Electronic Health Record Error Prevention Approach Using Ontology in Big Data
    Gai, Keke
    Qiu, Meikang
    Chen, Li-Chiou
    Liu, Meiqin
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 752 - 757
  • [43] Large-scale data collection: a coordinated approach
    Cheng, WC
    Chou, CF
    Golubchik, L
    Khuller, S
    Wan, YC
    IEEE INFOCOM 2003: THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2003, : 218 - 228
  • [44] Intelligent approach for large-scale data mining
    Fouad, Khaled M.
    El-Bably, Doaa L.
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 63 (1-2) : 93 - 113
  • [45] Assessing primary health care readiness for large-scale electronic health record system implementation: Project team perspective
    Alzghaibi, Haitham
    Alharbi, Ali H.
    Mughal, Yasir H.
    Alwheeb, Mohammed H.
    Alhlayl, Adel S.
    HEALTH INFORMATICS JOURNAL, 2023, 29 (01)
  • [46] Proteogenomic strategies for identification of aberrant cancer peptides using large-scale next-generation sequencing data
    Woo, Sunghee
    Cha, Seong Won
    Na, Seungjin
    Guest, Clark
    Liu, Tao
    Smith, Richard D.
    Rodland, Karin D.
    Payne, Samuel
    Bafna, Vineet
    PROTEOMICS, 2014, 14 (23-24) : 2719 - 2730
  • [47] Evaluating preparedness in using blockchains for electronic health record systems
    Hosseini Sarkhosh, Seyyed Mahdi
    Akhavan, Peyman
    ELECTRONIC LIBRARY, 2023, 41 (01): : 87 - 110
  • [48] Consensus Clustering for Cancer Gene Expression Data Large-Scale Analysis using Evidence Accumulation Approach
    Sasic, Isidora
    Brdar, Sanja
    Loncar-Turukalo, Tatjana
    Aidos, Helena
    Fred, Ana
    PROCEEDINGS OF THE 10TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2017, : 176 - 183
  • [49] Evaluating physical urban features in several mental illnesses using electronic health record data
    Mahabadi, Zahra
    Mahabadi, Maryam
    Velupillai, Sumithra
    Roberts, Angus
    McGuire, Philip
    Ibrahim, Zina
    Patel, Rashmi
    FRONTIERS IN DIGITAL HEALTH, 2022, 4
  • [50] Monitoring and Evaluating the Transition of Large-Scale Programs in Global Health
    Bao, James
    Rodriguez, Daniela C.
    Paina, Ligia
    Ozawa, Sachiko
    Bennett, Sara
    GLOBAL HEALTH-SCIENCE AND PRACTICE, 2015, 3 (04): : 591 - 605