Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus

被引:12
|
作者
Brunekreef, Tammo E. [1 ]
Otten, Henny G. [1 ]
van den Bosch, Suzanne C. [1 ]
Hoefer, Imo E. [1 ]
van Laar, Jacob M. [1 ]
Limper, Maarten [1 ]
Haitjema, Saskia [1 ]
机构
[1] Univ Utrecht, Univ Med Ctr Utrecht, Utrecht, Netherlands
关键词
INFORMATION;
D O I
10.1002/acr2.11211
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
ObjectiveElectronic health records (EHR) are increasingly being recognized as a major source of data reusable for medical research and quality monitoring, although patient identification and assessment of symptoms (characterization) remain challenging, especially in complex diseases such as systemic lupus erythematosus (SLE). Current coding systems are unable to assess information recorded in the physician's free-text notes. This study shows that text mining can be used as a reliable alternative. MethodsIn a multidisciplinary research team of data scientists and medical experts, a text mining algorithm on 4607 patient records was developed to assess the diagnosis of 14 different immune-mediated inflammatory diseases and the presence of 18 different symptoms in the EHR. The text mining algorithm included key words in the EHR, while mining the context for exclusion phrases. The accuracy of the text mining algorithm was assessed by manually checking the EHR of 100 random patients suspected of having SLE for diagnoses and symptoms and comparing the outcome with the outcome of the text mining algorithm. ResultsAfter evaluation of 100 patient records, the text mining algorithm had a sensitivity of 96.4% and a specificity of 93.3% in assessing the presence of SLE. The algorithm detected potentially life-threatening symptoms (nephritis, pleuritis) with good sensitivity (80%-82%) and high specificity (97%-97%). ConclusionWe present a text mining algorithm that can accurately identify and characterize patients with SLE using routinely collected data from the EHR. Our study shows that using text mining, data from the EHR can be reused in research and quality control.
引用
收藏
页码:65 / 71
页数:7
相关论文
共 50 条
  • [31] Trends of Pregnancy Outcomes in a Large Electronic Health Record Cohort of Systemic Lupus Erythematosus Patients
    Barnado, April
    Camai, Alex
    Wheless, Lee
    ARTHRITIS & RHEUMATOLOGY, 2020, 72
  • [32] Natural Language Processing to Identify Lupus Nephritis Phenotype in Electronic Health Records
    Deng, Yu
    Pacheco, Jennifer
    Chung, Anh
    Mao, Chengsheng
    Smith, Joshua
    Zhao, Juan
    Wei, Wei-Qi
    Barnado, April
    Weng, Chunhua
    Liu, Cong
    Gordon, Adam
    Yu, Jingzhi
    Tedla, Yacob
    Kho, Abel
    Ramsey-Goldman, Rosalind
    Walunas, Theresa
    Luo, Yuan
    ARTHRITIS & RHEUMATOLOGY, 2021, 73 : 666 - 667
  • [33] Natural language processing to identify lupus nephritis phenotype in electronic health records
    Deng, Yu
    Pacheco, Jennifer A.
    Ghosh, Anika
    Chung, Anh
    Mao, Chengsheng
    Smith, Joshua C.
    Zhao, Juan
    Wei, Wei-Qi
    Barnado, April
    Dorn, Chad
    Weng, Chunhua
    Liu, Cong
    Cordon, Adam
    Yu, Jingzhi
    Tedla, Yacob
    Kho, Abel
    Ramsey-Goldman, Rosalind
    Walunas, Theresa
    Luo, Yuan
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 22 (SUPPL 2)
  • [34] Natural language processing to identify lupus nephritis phenotype in electronic health records
    Yu Deng
    Jennifer A. Pacheco
    Anika Ghosh
    Anh Chung
    Chengsheng Mao
    Joshua C. Smith
    Juan Zhao
    Wei-Qi Wei
    April Barnado
    Chad Dorn
    Chunhua Weng
    Cong Liu
    Adam Cordon
    Jingzhi Yu
    Yacob Tedla
    Abel Kho
    Rosalind Ramsey-Goldman
    Theresa Walunas
    Yuan Luo
    BMC Medical Informatics and Decision Making, 22
  • [35] Citywide quality of health information system through text mining of electronic health records
    Anastasia A. Funkner
    Michil P. Egorov
    Sergey A. Fokin
    Gennady M. Orlov
    Sergey V. Kovalchuk
    Applied Network Science, 6
  • [36] Citywide quality of health information system through text mining of electronic health records
    Funkner, Anastasia A.
    Egorov, Michil P.
    Fokin, Sergey A.
    Orlov, Gennady M.
    Kovalchuk, Sergey, V
    APPLIED NETWORK SCIENCE, 2021, 6 (01)
  • [37] Rule-based and machine learning algorithms identify patients with systemic sclerosis accurately in the electronic health record
    Jamian, Lia
    Wheless, Lee
    Crofford, Leslie J.
    Barnado, April
    ARTHRITIS RESEARCH & THERAPY, 2019, 21 (01)
  • [38] Rule-based and machine learning algorithms identify patients with systemic sclerosis accurately in the electronic health record
    Lia Jamian
    Lee Wheless
    Leslie J. Crofford
    April Barnado
    Arthritis Research & Therapy, 21
  • [39] Hydroxychloroquine Levels Identify Four Distinct Subsets of Systemic Lupus Erythematosus Patients
    Petri, Michelle
    Fang, Hong
    Clarke, William
    ARTHRITIS AND RHEUMATISM, 2013, 65 : S770 - S770
  • [40] Development and Validation of an Algorithm to Accurately Identify Atopic Eczema Patients in Primary Care Electronic Health Records from the UK
    Abuabara, Katrina
    Magyari, Alexa M.
    Hoffstad, Ole
    Jabbar-Lopez, Zarif K.
    Smeeth, Liam
    Williams, Hywel C.
    Gelfand, Joel M.
    Margolis, David J.
    Langan, Sinead M.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2017, 137 (08) : 1655 - 1662