Development and Validation of Coding Algorithms to Identify Patients with Incident Non-Small Cell Lung Cancer in United States Healthcare Claims Data

被引:2
|
作者
Beyrer, Julie [1 ,3 ]
Nelson, David R. [1 ]
Sheffield, Kristin M. [1 ]
Huang, Yu-Jing [1 ]
Lau, Yiu-Keung [1 ]
Hincapie, Ana L. [2 ]
机构
[1] Eli Lilly & Co, Indianapolis, IN USA
[2] Univ Cincinnati James L Winkle, Coll Pharm, Cincinnati, OH USA
[3] Eli Lilly & Co, Lilly Corp Ctr, Indianapolis, IN 46285 USA
来源
CLINICAL EPIDEMIOLOGY | 2023年 / 15卷
关键词
algorithm; machine learning; medicare claims; non-small cell lung cancer; positive predictive value; sensitivity; validation;
D O I
10.2147/CLEP.S389824
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Purpose: We sought to develop and validate an incident non-small cell lung cancer (NSCLC) algorithm for United States (US) healthcare claims data. Diagnoses and procedures, but not medications, were incorporated to support longer-term relevance and reliability. Methods: Patients with newly diagnosed NSCLC per Surveillance, Epidemiology, and End Results (SEER) served as cases. Controls included newly diagnosed small-cell lung cancer and other lung cancers, and two 5% random samples for other cancer and without cancer. Algorithms derived from logistic regression and machine learning methods used the entire sample (Approach A) or started with a previous algorithm for those with lung cancer (Approach B). Sensitivity, specificity, positive predictive values (PPV), negative predictive values, and F-scores (compared for 1000 bootstrap samples) were calculated. Misclassification was evaluated by calculating the odds of selection by the algorithm among true positives and true negatives.Results: The best performing algorithm utilized neural networks (Approach B). A 10-variable point-score algorithm was derived from logistic regression (Approach B); sensitivity was 77.69% and PPV = 67.61% (F-score = 72.30%). This algorithm was less sensitive for patients >= 80 years old, with Medicare follow-up time <3 months, or missing SEER data on stage, laterality, or site and less specific for patients with SEER primary site of main bronchus, SEER summary stage 2000 regional by direct extension only, or pre-index chronic pulmonary disease.Conclusion: Our study developed and validated a practical, 10-variable, point-based algorithm for identifying incident NSCLC cases in a US claims database based on a previously validated incident lung cancer algorithm.
引用
收藏
页码:73 / 89
页数:17
相关论文
共 50 条
  • [1] Development and validation of coding algorithms to identify patients with incident lung cancer in United States healthcare claims data
    Beyrer, Julie
    Nelson, David R.
    Sheffield, Kristin M.
    Huang, Yu-Jing
    Ellington, Tim
    Hincapie, Ana L.
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 (11) : 1465 - 1479
  • [2] Validation of an Updated Algorithm to Identify Patients With Incident Non-Small Cell Lung Cancer in Administrative Claims Databases
    Patel, Sandip Pravin
    Wang, Rongrong
    Zhou, Summera Qiheng
    Sheinson, Daniel
    Johnson, Ann
    Lee, Janet Shin
    [J]. JCO CLINICAL CANCER INFORMATICS, 2024, 8 : e2300165
  • [3] Validation of Claims Algorithms for Progression to Metastatic Cancer in Patients with Breast, Non-Small Cell Lung, and Colorectal Cancer
    Simeone, Jason C.
    Nordstrom, Beth L.
    Malley, Karen G.
    Fraeman, Kathy H.
    Klippel, Zandra
    Durst, Mark
    Page, John H.
    Xu, Hairong
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2015, 24 : 511 - 511
  • [4] Validation of Claims Algorithms for Progression to Metastatic Cancer in Patients with Breast, Non-small Cell Lung, and Colorectal Cancer
    Nordstrom, Beth L.
    Simeone, Jason C.
    Malley, Karen G.
    Freeman, Kathy H.
    Klippel, Zandra
    Durst, Mark
    Page, John H.
    Xu, Hairong
    [J]. FRONTIERS IN ONCOLOGY, 2016, 6
  • [5] Validation of an updated algorithm to identify non-small cell lung cancer (NSCLC) patients in administrative claims databases
    Patel, Sandip Pravin
    Wang, Rongrong
    Zhou, Summera Qiheng
    Sheinson, Daniel
    Johnson, Ann
    Lee, Janet
    [J]. CANCER RESEARCH, 2023, 83 (07)
  • [6] Drug wastage and costs to the healthcare system in the care of patients with non-small cell lung cancer in the United States
    Hess, Lisa M.
    Cui, Zhanglin Lin
    Li, Xiaohong Ivy
    Oton, Ana B.
    Shortenhaus, Scott
    Watson, Ian A.
    [J]. JOURNAL OF MEDICAL ECONOMICS, 2018, 21 (08) : 755 - 761
  • [7] ASSESSING THE HEALTHCARE COST OF DIABETIC NON-SMALL CELL LUNG CANCER PATIENTS COMPARED TO NON-SMALL CELL LUNG CANCER PATIENTS
    Ruban, C.
    Blanchette, C. M.
    Howden, R.
    Kowalkowski, M.
    Marino, J.
    Saunders, W.
    [J]. VALUE IN HEALTH, 2017, 20 (09) : A428 - A428
  • [8] Trends and characteristics of young non-small cell lung cancer patients in the United States
    Thomas, Anish
    Chen, Yuanbin
    Yu, Tinghui
    Jakopovic, Marko
    Giaccone, Giuseppe
    [J]. FRONTIERS IN ONCOLOGY, 2015, 5
  • [9] Development of an algorithm to identify small cell lung cancer patients in claims databases
    Danese, Mark D.
    Balasubramanian, Akhila
    Bebb, D. Gwyn
    Pundole, Xerxes
    [J]. FRONTIERS IN ONCOLOGY, 2024, 14
  • [10] Mutational landscape of non-small cell lung cancer in the United States
    Elsayed, M. A.
    Allam, A. R.
    KhalafAllah, M. T.
    Aboshady, O.
    Gouda, M. A.
    [J]. ANNALS OF ONCOLOGY, 2022, 33 : S1604 - S1605