Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus

被引:12
|
作者
Brunekreef, Tammo E. [1 ]
Otten, Henny G. [1 ]
van den Bosch, Suzanne C. [1 ]
Hoefer, Imo E. [1 ]
van Laar, Jacob M. [1 ]
Limper, Maarten [1 ]
Haitjema, Saskia [1 ]
机构
[1] Univ Utrecht, Univ Med Ctr Utrecht, Utrecht, Netherlands
关键词
INFORMATION;
D O I
10.1002/acr2.11211
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
ObjectiveElectronic health records (EHR) are increasingly being recognized as a major source of data reusable for medical research and quality monitoring, although patient identification and assessment of symptoms (characterization) remain challenging, especially in complex diseases such as systemic lupus erythematosus (SLE). Current coding systems are unable to assess information recorded in the physician's free-text notes. This study shows that text mining can be used as a reliable alternative. MethodsIn a multidisciplinary research team of data scientists and medical experts, a text mining algorithm on 4607 patient records was developed to assess the diagnosis of 14 different immune-mediated inflammatory diseases and the presence of 18 different symptoms in the EHR. The text mining algorithm included key words in the EHR, while mining the context for exclusion phrases. The accuracy of the text mining algorithm was assessed by manually checking the EHR of 100 random patients suspected of having SLE for diagnoses and symptoms and comparing the outcome with the outcome of the text mining algorithm. ResultsAfter evaluation of 100 patient records, the text mining algorithm had a sensitivity of 96.4% and a specificity of 93.3% in assessing the presence of SLE. The algorithm detected potentially life-threatening symptoms (nephritis, pleuritis) with good sensitivity (80%-82%) and high specificity (97%-97%). ConclusionWe present a text mining algorithm that can accurately identify and characterize patients with SLE using routinely collected data from the EHR. Our study shows that using text mining, data from the EHR can be reused in research and quality control.
引用
收藏
页码:65 / 71
页数:7
相关论文
共 50 条
  • [1] Using Electronic Health Record Algorithms to Accurately Identify Patients with Systemic Lupus Erythematosus
    Barnado, April
    Denny, Joshua C.
    Crofford, Leslie J.
    ARTHRITIS & RHEUMATOLOGY, 2015, 67
  • [2] Developing Electronic Health Record Algorithms That Accurately Identify Patients With Systemic Lupus Erythematosus
    Barnado, April
    Casey, Carolyn
    Carroll, Robert J.
    Wheless, Lee
    Denny, Joshua C.
    Crofford, Leslie J.
    ARTHRITIS CARE & RESEARCH, 2017, 69 (05) : 687 - 693
  • [3] Births to Women with Systemic Lupus Erythematosus Can be Identified Accurately in the Electronic Health Record
    Blaske, Ashley
    Eudy, Amanda M.
    Oates, Jim C.
    Clowse, Megan E. B.
    Barnado, April
    ARTHRITIS & RHEUMATOLOGY, 2018, 70
  • [4] Application of Text Mining Methods to Identify Lupus Nephritis from Electronic Health Records
    Gianfrancesco, Milena
    Tamang, Suzanne
    Schmajuk, Gabriela
    Yazdany, Jinoos
    ARTHRITIS & RHEUMATOLOGY, 2020, 72
  • [5] Text Mining Electronic Health Records to Identify Hospital Adverse Events
    Gerdes, Lars Ulrik
    Hardahl, Christian
    MEDINFO 2013: PROCEEDINGS OF THE 14TH WORLD CONGRESS ON MEDICAL AND HEALTH INFORMATICS, PTS 1 AND 2, 2013, 192 : 1145 - 1145
  • [6] Leveraging Electronic Health Records to Identify and Characterize Patients with Low Vision
    Swenor, Bonnielin K.
    Guo, Xinxing
    Boland, Michael, V
    Goldstein, Judith E.
    OPHTHALMIC EPIDEMIOLOGY, 2019, 26 (02) : 132 - 139
  • [7] Evaluation of structured data from electronic health records to identify clinical classification criteria attributes for systemic lupus erythematosus
    Walunas, Theresa L.
    Ghosh, Anika S.
    Pacheco, Jennifer A.
    Mitrovic, Vesna
    Wu, Andy
    Jackson, Kathryn L.
    Schusler, Ryan
    Chung, Anh
    Erickson, Daniel
    Mancera-Cuevas, Karen
    Luo, Yuan
    Kho, Abel N.
    Ramsey-Goldman, Rosalind
    LUPUS SCIENCE & MEDICINE, 2021, 8 (01):
  • [8] Utilizing Electronic Health Records to Identify Clinical Features of ANA-Positive Patients Imparting High Risk for Progression to Systemic Lupus Erythematosus
    Markus, Havell
    Khunsriraksakul, Chachrit
    Foulke, Galen
    Carrel, Laura
    Olsen, Nancy
    Liu, Dajiang
    ARTHRITIS & RHEUMATOLOGY, 2024, 76 : 3119 - 3121
  • [9] Microarray analysis of autoantibodies can identify future Systemic Lupus Erythematosus patients
    Brunekreef, Tammo E.
    Reteig, Leon C.
    Limper, Maarten
    Haitjema, Saskia
    Dias, Jorge
    Mathsson-Alm, Linda
    van Laar, Jacob M.
    Otten, Henny G.
    HUMAN IMMUNOLOGY, 2022, 83 (06) : 509 - 514
  • [10] Using Electronic Health Record Algorithms to Accurately Identify Patients with Systemic Sclerosis
    Jamian, Lia
    Crofford, Leslie
    Barnado, April
    ARTHRITIS & RHEUMATOLOGY, 2017, 69