Portability of an algorithm to identify rheumatoid arthritis in electronic health records

被引:166
|
作者
Carroll, Robert J. [1 ]
Thompson, Will K. [2 ]
Eyler, Anne E. [3 ]
Mandelin, Arthur M. [4 ]
Cai, Tianxi [5 ]
Zink, Raquel M. [1 ]
Pacheco, Jennifer A. [2 ]
Boomershine, Chad S. [3 ]
Lasko, Thomas A. [1 ]
Xu, Hua [1 ]
Karlson, Elizabeth W. [6 ]
Perez, Raul G. [7 ]
Gainer, Vivian S. [7 ]
Murphy, Shawn N. [7 ,8 ]
Ruderman, Eric M. [4 ]
Pope, Richard M. [4 ]
Plenge, Robert M. [6 ,9 ,10 ]
Kho, Abel Ngo [11 ]
Liao, Katherine P. [6 ]
Denny, Joshua C. [1 ,3 ]
机构
[1] Vanderbilt Univ, Sch Med, Dept Biomed Informat, Eskind Biomed Lib, Nashville, TN 37232 USA
[2] Northwestern Univ, Ctr Genet Med, Evanston, IL USA
[3] Vanderbilt Univ, Sch Med, Dept Med, Nashville, TN 37232 USA
[4] Northwestern Univ, Feinberg Sch Med, Div Rheumatol, Dept Med, Chicago, IL 60611 USA
[5] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[6] Brigham & Womens Hosp, Dept Med, Div Rheumatol Allergy & Immunol, Boston, MA 02115 USA
[7] Partners HealthCare, Res Comp, Charlestown, MA USA
[8] Massachusetts Gen Hosp, Dept Neurol, Boston, MA 02114 USA
[9] Broad Inst, Cambridge, MA USA
[10] Brigham & Womens Hosp, Div Genet, Boston, MA 02115 USA
[11] Northwestern Univ, Feinberg Sch Med, Div Gen Internal Med, Dept Med, Chicago, IL 60611 USA
关键词
MEDICAL-SCHOOL CURRICULUM; GENOME-WIDE ASSOCIATION; EXTRACTION SYSTEM; ICD-9-CM CODES; DISEASE; RISK; IDENTIFICATION; INFORMATICS; ENTERPRISE; DISCOVERY;
D O I
10.1136/amiajnl-2011-000583
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objectives Electronic health records (EHR) can allow for the generation of large cohorts of individuals with given diseases for clinical and genomic research. A rate-limiting step is the development of electronic phenotype selection algorithms to find such cohorts. This study evaluated the portability of a published phenotype algorithm to identify rheumatoid arthritis (RA) patients from EHR records at three institutions with different EHR systems. Materials and Methods Physicians reviewed charts from three institutions to identify patients with RA. Each institution compiled attributes from various sources in the EHR, including codified data and clinical narratives, which were searched using one of two natural language processing (NLP) systems. The performance of the published model was compared with locally retrained models. Results Applying the previously published model from Partners Healthcare to datasets from Northwestern and Vanderbilt Universities, the area under the receiver operating characteristic curve was found to be 92% for Northwestern and 95% for Vanderbilt, compared with 97% at Partners. Retraining the model improved the average sensitivity at a specificity of 97% to 72% from the original 65%. Both the original logistic regression models and locally retrained models were superior to simple billing code count thresholds. Discussion These results show that a previously published algorithm for RA is portable to two external hospitals using different EHR systems, different NLP systems, and different target NLP vocabularies. Retraining the algorithm primarily increased the sensitivity at each site. Conclusion Electronic phenotype algorithms allow rapid identification of case populations in multiple sites with little retraining.
引用
收藏
页码:E162 / E169
页数:8
相关论文
共 50 条
  • [21] Machine Learning Model to Accurately Identify Rheumatoid Arthritis Patients Using Raw Electronic Health Record Data
    Gilvaz, Vinit
    Reginato, Anthony
    Dalal, Deepan
    Crough, Brad
    ARTHRITIS & RHEUMATOLOGY, 2022, 74 : 2766 - 2767
  • [22] Extraction of Rheumatoid Arthritis Disease Activity Measures From Electronic Health Records Using Automated Processing Algorithms
    Cannon, Grant W.
    Rojas, Jorge
    Reimold, Andreas
    Mikuls, Ted R.
    Bergman, Debra
    Sauer, Brian C.
    ACR OPEN RHEUMATOLOGY, 2019, 1 (10) : 632 - 639
  • [23] Validity of an automated algorithm using diagnosis and procedure codes to identify decompensated cirrhosis using electronic health records
    Lu, Mei
    Chacra, Wadih
    Rabin, David
    Rupp, Loralee B.
    Trudeau, Sheri
    Li, Jia
    Gordon, Stuart C.
    CLINICAL EPIDEMIOLOGY, 2017, 9 : 369 - 376
  • [24] Development of a phenotyping algorithm to identify patients with autoimmune disease in electronic health records for future large scale studies
    Tong, M.
    Shang, N.
    Wang, R.
    Lebwohl, B.
    Mehl, K.
    Hripcsak, G.
    Weng, C.
    Kiryluk, K.
    Petukhova, L.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2018, 138 (05) : S54 - S54
  • [25] Validating an ontology-based algorithm to identify patients with Type 2 Diabetes Mellitus in Electronic Health Records
    Rahimi, Alireza
    Liaw, Siaw-Teng
    Taggart, Jane
    Ray, Pradeep
    Yu, Hairong
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2014, 83 (10) : 768 - 778
  • [26] Development and validation of an algorithm to better identify pregnant women with inherited bleeding disorders within electronic health records
    Lim, Ming Y.
    Sivaloganathan, Vasee
    Rodgers, George M.
    Simonsen, Sara E.
    THROMBOSIS RESEARCH, 2025, 246
  • [27] Validation of an algorithm to identify venous thromboembolism (VTE) in health insurance claims data among patients with rheumatoid arthritis
    Kim, Sangmi
    Martin, Carolyn
    White, John C.
    Carlyle, Maureen H.
    Gao, Shiyao
    Salinas, Claudia A.
    Bui, Bonnie
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2022, 31 : 207 - 207
  • [28] Automatic Prediction of Rheumatoid Arthritis Disease Activity from the Electronic Medical Records
    Lin, Chen
    Karlson, Elizabeth W.
    Canhao, Helena
    Miller, Timothy A.
    Dligach, Dmitriy
    Chen, Pei Jun
    Perez, Raul Natanael Guzman
    Shen, Yuanyan
    Weinblatt, Michael E.
    Shadick, Nancy A.
    Plenge, Robert M.
    Savova, Guergana K.
    PLOS ONE, 2013, 8 (08):
  • [29] A hybrid model to identify fall occurrence from electronic health records
    Fu, Sunyang
    Thorsteinsdottir, Bjoerg
    Zhang, Xin
    Lopes, Guilherme S.
    Pagali, Sandeep R.
    LeBrasseur, Nathan K.
    Wen, Andrew
    Liu, Hongfang
    Rocca, Walter A.
    Olson, Janet E.
    St Sauver, Jennifer
    Sohn, Sunghwan
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 162
  • [30] USING ELECTRONIC HEALTH RECORDS TO IDENTIFY MEDICAL COMORBIDITIES OF MOOD DISORDERS
    Kember, Rachel
    Verma, Shefali
    Verma, Anurag
    Ritchie, Marylyn
    Damrauer, Scott
    Rader, Daniel
    Merikangas, Alison
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2019, 29 : S13 - S14