A claims-based, machine-learning algorithm to identify patients with pulmonary arterial hypertension

被引:5
|
作者
Hyde, Bethany [1 ,5 ]
Paoli, Carly J. [2 ]
Panjabi, Sumeet [2 ]
Bettencourt, Katherine C. [3 ]
Lynum, Karimah S. Bell S. [3 ]
Selej, Mona [4 ]
机构
[1] Janssen Business Technol Commercial Data Insights, Titusville, NJ USA
[2] Janssen Sci Affairs Inc, Titusville, NJ USA
[3] Actelion Pharmaceut US Inc, Titusville, NJ USA
[4] Janssen R&D Data Sci, South San Francisco, CA USA
[5] Janssen Business Technol Commercial Data Insights, Titusville, NJ 08560 USA
关键词
early diagnosis; rare disease; real-world evidence; ARTIFICIAL-INTELLIGENCE; MANAGEMENT; SURVIVAL; DIAGNOSIS; TIME;
D O I
10.1002/pul2.12237
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Many patients with pulmonary arterial hypertension (PAH) experience substantial delays in diagnosis, which is associated with worse outcomes and higher costs. Tools for diagnosing PAH sooner may lead to earlier treatment, which may delay disease progression and adverse outcomes including hospitalization and death. We developed a machine-learning (ML) algorithm to identify patients at risk for PAH earlier in their symptom journey and distinguish them from patients with similar early symptoms not at risk for developing PAH. Our supervised ML model analyzed retrospective, de-identified data from the US-based Optum (R) Clinformatics (R) Data Mart claims database (January 2015 to December 2019). Propensity score matched PAH and non-PAH (control) cohorts were established based on observed differences. Random forest models were used to classify patients as PAH or non-PAH at diagnosis and at 6 months prediagnosis. The PAH and non-PAH cohorts included 1339 and 4222 patients, respectively. At 6 months prediagnosis, the model performed well in distinguishing PAH and non-PAH patients, with area under the curve of the receiver operating characteristic of 0.84, recall (sensitivity) of 0.73, and precision of 0.50. Key features distinguishing PAH from non-PAH cohorts were a longer time between first symptom and the prediagnosis model date (i.e., 6 months before diagnosis); more diagnostic and prescription claims, circulatory claims, and imaging procedures, leading to higher overall healthcare resource utilization; and more hospitalizations. Our model distinguishes between patients with and without PAH at 6 months before diagnosis and illustrates the feasibility of using routine claims data to identify patients at a population level who might benefit from PAH-specific screening and/or earlier specialist referral.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Validation of a US health insurance claims-based algorithm to identify acute exacerbations of chronic obstructive pulmonary disease
    Mapel, Douglas W.
    Sama, Susan R.
    Roblin, Douglas W.
    Roberts, Melissa H.
    Bobbili, Priyanka J.
    Cheng, Wendy Y.
    Certa, Julia M.
    Sundaresan, Devi
    Whiting, Thomas S.
    Nguyen, Catherine
    Thompson-Leduc, Philippe
    Brown, Jennifer L.
    Van Dyke, Melissa K.
    Rothnie, Kieran J.
    Duh, Mei Sheng
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 394 - 394
  • [32] Development and Validation of a Claims-Based Algorithm to Identify Fibrostenotic Crohn's Disease
    Zhang, Yiran
    Mountford, William K.
    Thompson, Jennifer Su
    Zhang, Ling
    Carlyle, Maureen
    White, John
    Walker, Valery
    Rieder, Florian
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2024, 119 (10S): : S896 - S897
  • [33] A CLAIMS-BASED ALGORITHM TO IDENTIFY INADEQUATE RESPONSE TO IMMUNOMODULATORY THERAPIES AMONG PATIENTS WITH SELECTED AUTOIMMUNE DISEASES
    Grabner, M.
    Hunter, T.
    Teng, C. C.
    Isenberg, K.
    Burge, R. T.
    Birt, J.
    Naegeli, A.
    Shan, M.
    Curtis, J. R.
    VALUE IN HEALTH, 2021, 24 : S153 - S154
  • [34] A predictive algorithm to identify ever smoking in medical claims-based epidemiologic studies
    Faust, Irene
    Warden, Mark
    Camacho-Soto, Alejandra
    Racette, Brad A.
    Nielsen, Susan Searles
    ANNALS OF EPIDEMIOLOGY, 2023, 85 : 59 - +
  • [35] Utility of claims-based data to identify critical limb ischemia patients
    Bekwelem, Wobo
    Smith, Lindsay G.
    Hirsch, Alan T.
    Oldenburg, Niki C.
    Winden, Tamara J.
    Keo, Hong H.
    Duval, Sue
    VASCULAR MEDICINE, 2012, 17 (03) : 199 - 199
  • [36] Validation of a claims-based algorithm for myocarditis/ pericarditis in pediatric patients
    Amend, Kandace L.
    Song, Jennifer
    Wong, Hui-Lee
    Shoaibi, Azadeh
    Forshee, Richard A.
    Anderson, Steven A.
    Seeger, John D.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2023, 32 : 597 - 597
  • [37] Using cognitive interviews to develop a conceptual claims-based algorithm to identify patients with neuromyelitis optica spectrum disorder
    Exuzides, A.
    Yermilov, I.
    Campos, C.
    Gibbs, S.
    Broder, M.
    Cohan, S.
    Greenberg, B.
    Levy, M.
    MULTIPLE SCLEROSIS JOURNAL, 2021, 27 (2_SUPPL) : 167 - 168
  • [38] Two component claims-based algorithm to identify heart failure ejection fraction phenotypes
    Hunt, Phillip R.
    Andersson-Sundell, Karolina
    Chen, Hung-Ta
    Khordoc, Cindy
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2022, 31 : 555 - 556
  • [39] Validity of claims-based algorithms to identify patients with test-positive influenza
    Benack, Kirk
    Nyandege, Abner
    Nonnenmacher, Edward R.
    Jan, Saira
    Setoguchi, Soko
    Gerhard, Tobias
    Strom, Brian L.
    Horton, Daniel B.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2021, 30 : 395 - 395
  • [40] A MACHINE-LEARNING ELECTRONIC HEALTH RECORD-BASED ALGORITHM TO IDENTIFY PRESENCE OF CIRRHOSIS IN PATIENTS WITH PRIMARY BILIARY CHOLANGITIS
    Lu, Mei
    Bowlus, Christopher L.
    Lindor, Keith D.
    Rodriguez, Carla V.
    Romanelli, Robert J.
    Haller, Irina V.
    Anderson, Heather
    VanWormer, Jeffrey J.
    Boscarino, Joseph
    Schmidt, Mark A.
    Daida, Yihe G.
    Sahota, Amandeep
    Vincent, Jennifer
    Li, Jia
    Trudeau, Sheri
    Rupp, Loralee B.
    Gordon, Stuart C.
    HEPATOLOGY, 2019, 70 : 786A - 787A