A claims-based, machine-learning algorithm to identify patients with pulmonary arterial hypertension

被引:3
|
作者
Hyde, Bethany [1 ,5 ]
Paoli, Carly J. [2 ]
Panjabi, Sumeet [2 ]
Bettencourt, Katherine C. [3 ]
Lynum, Karimah S. Bell S. [3 ]
Selej, Mona [4 ]
机构
[1] Janssen Business Technol Commercial Data Insights, Titusville, NJ USA
[2] Janssen Sci Affairs Inc, Titusville, NJ USA
[3] Actelion Pharmaceut US Inc, Titusville, NJ USA
[4] Janssen R&D Data Sci, South San Francisco, CA USA
[5] Janssen Business Technol Commercial Data Insights, Titusville, NJ 08560 USA
关键词
early diagnosis; rare disease; real-world evidence; ARTIFICIAL-INTELLIGENCE; MANAGEMENT; SURVIVAL; DIAGNOSIS; TIME;
D O I
10.1002/pul2.12237
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Many patients with pulmonary arterial hypertension (PAH) experience substantial delays in diagnosis, which is associated with worse outcomes and higher costs. Tools for diagnosing PAH sooner may lead to earlier treatment, which may delay disease progression and adverse outcomes including hospitalization and death. We developed a machine-learning (ML) algorithm to identify patients at risk for PAH earlier in their symptom journey and distinguish them from patients with similar early symptoms not at risk for developing PAH. Our supervised ML model analyzed retrospective, de-identified data from the US-based Optum (R) Clinformatics (R) Data Mart claims database (January 2015 to December 2019). Propensity score matched PAH and non-PAH (control) cohorts were established based on observed differences. Random forest models were used to classify patients as PAH or non-PAH at diagnosis and at 6 months prediagnosis. The PAH and non-PAH cohorts included 1339 and 4222 patients, respectively. At 6 months prediagnosis, the model performed well in distinguishing PAH and non-PAH patients, with area under the curve of the receiver operating characteristic of 0.84, recall (sensitivity) of 0.73, and precision of 0.50. Key features distinguishing PAH from non-PAH cohorts were a longer time between first symptom and the prediagnosis model date (i.e., 6 months before diagnosis); more diagnostic and prescription claims, circulatory claims, and imaging procedures, leading to higher overall healthcare resource utilization; and more hospitalizations. Our model distinguishes between patients with and without PAH at 6 months before diagnosis and illustrates the feasibility of using routine claims data to identify patients at a population level who might benefit from PAH-specific screening and/or earlier specialist referral.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A Claims-Based, Machine-Learning Algorithm to Identify Patients with Pulmonary Artery Hypertension (PAH)
    Bettencourt, K.
    Hyde, B.
    Paoli, C. J.
    Lynum, K. S.
    Panjabi, S.
    Selej, M.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2022, 205
  • [2] Development Of A Claims-Based Algorithm To Identify Patients With Chronic Thromboembolic Pulmonary Hypertension
    Teal, S.
    Auger, W.
    Hughes, R. J.
    Lewis, K. S.
    Ramey, D. R.
    Fabian, J.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2017, 195
  • [3] Claims-Based Algorithms for Identifying Patients With Pulmonary Hypertension: A Comparison of Decision Rules and Machine-Learning Approaches
    Ong, Mei-Sing
    Klann, Jeffrey G.
    Lin, Kueiyu Joshua
    Maron, Bradley A.
    Murphy, Shawn N.
    Natter, Marc D.
    Mandl, Kenneth D.
    [J]. JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2020, 9 (19):
  • [4] Validation of claims-based algorithms for pulmonary arterial hypertension
    Papani, Ravikanth
    Sharma, Gulshan
    Agarwal, Amitesh
    Callahan, Sean J.
    Chan, Winston J.
    Kuo, Yong-Fang
    Shim, Yun M.
    Mihalek, Andrew D.
    Duarte, Alexander G.
    [J]. PULMONARY CIRCULATION, 2018, 8 (02)
  • [5] A machine-learning algorithm using claims data to identify patients with homozygous familial hypercholesterolemia
    Gu, Jing
    Epland, Matthew
    Ma, Xinshuo
    Park, Jina
    Sanchez, Robert J.
    Li, Ying
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [6] Validation of a claims-based algorithm to identify patients with chronic thromboembolic pulmonary hypertension using electronic health record data
    Teal, Simon
    Auger, William R.
    Hughes, Rodney J.
    Ramey, Dena Rosen
    Lewis, Kelly S.
    O'Brien, Gerald
    Yaldo, Avin
    Burton, Tanya M.
    Bancroft, Tim
    Seare, Jerry
    Fabian, Joerg
    [J]. PULMONARY CIRCULATION, 2018, 9 (01)
  • [7] Development of a Claims-Based Algorithm to Identify Patients with Chronic Cough
    Bali, V.
    Weaver, J.
    Turzhitsky, V.
    Schelfhout, J.
    Paudel, M.
    Hulbert, E.
    Peterson-Brandt, J.
    Hertzberg, J.
    Kelly, N. R.
    Patel, R. H.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2021, 203 (09)
  • [8] Development Of Administrative-Claims Based Algorithms To Identify Patients With Pulmonary Arterial Hypertension
    Papani, R.
    Sharma, G.
    Chan, W.
    Kuo, Y-F
    Agarwal, A.
    Duarte, A.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2017, 195
  • [9] ADMINISTRATIVE CLAIMS-BASED ALGORITHM TO IDENTIFY PATIENTS WITH PRIMARY SCLEROSING CHOLANGITIS
    Kowdley, K., V
    Levy, C.
    Kachru, N.
    Kaushik, A.
    Grossman, A.
    Wong, A. C.
    Veeranki, P.
    Bowlus, C.
    [J]. VALUE IN HEALTH, 2023, 26 (06) : S179 - S179
  • [10] USE OF MACHINE-LEARNING MODELS TO IDENTIFY CLINICAL FEATURES IN PATIENTS WITH PULMONARY ARTERIAL HYPERTENSION ASSOCIATED WITH A FUTURE CLINICAL WORSENING EVENT
    Dubrock, Hilary M.
    Tobore, Tobore
    Germack, Hayley
    Tang, Xiaoqin
    Carpenter, Corinne
    Carlson, Katherine
    Doddahonnaiah, Deeksha
    Silvert, Eli
    Wagner, Tyler
    [J]. CHEST, 2023, 164 (04) : 5931A - 5932A