SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research

被引:61
|
作者
Wu, Honghan [1 ,2 ]
Toti, Giulia [3 ]
Morley, Katherine I. [3 ,4 ]
Ibrahim, Zina M. [1 ,5 ]
Folarin, Amos [1 ,5 ]
Jackson, Richard [1 ]
Kartoglu, Ismail [6 ]
Agrawal, Asha [7 ]
Stringer, Clive [7 ]
Gale, Darren [7 ]
Gorrell, Genevieve [8 ]
Roberts, Angus [8 ]
Broadbent, Matthew [9 ]
Stewart, Robert [9 ,10 ]
Dobson, Richard J. B. [1 ,5 ]
机构
[1] Kings Coll London, Inst Psychiat Psychol & Neurosci, Dept Biostat & Hlth Informat, London SE5 8AF, England
[2] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing, Jiangsu, Peoples R China
[3] Kings Coll London, Natl Addict Ctr, Inst Psychiat Psychol & Neurosci, London, England
[4] Univ Melbourne, Ctr Epidemiol & Biostat, Melbourne Sch Populat & Global Hlth, Melbourne, Vic, Australia
[5] UCL, Farr Inst Hlth Informat Res, London, England
[6] InterDigital Europe, London, England
[7] Kings Coll Hosp NHS Fdn Trust, London, England
[8] Univ Sheffield, Dept Comp Sci, Sheffield, S Yorkshire, England
[9] South London & Maudsley NHS Fdn Trust, London, England
[10] Kings Coll London, Inst Psychiat Psychol & Neurosci, Psychol Med, London, England
基金
欧盟地平线“2020”; 英国工程与自然科学研究理事会; 英国医学研究理事会; 英国经济与社会研究理事会; 英国惠康基金;
关键词
secondary use of EHR; information extraction; NLP; semantic search; ontology; FHIR; patient recruitment; HEALTH;
D O I
10.1093/jamia/ocx160
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Unlocking the data contained within both structured and unstructured components of electronic health records (EHRs) has the potential to provide a step change in data available for secondary research use, generation of actionable medical insights, hospital management, and trial recruitment. To achieve this, we implemented SemEHR, an open source semantic search and analytics tool for EHRs. Methods: SemEHR implements a generic information extraction (IE) and retrieval infrastructure by identifying contextualized mentions of a wide range of biomedical concepts within EHRs. Natural language processing annotations are further assembled at the patient level and extended with EHR-specific knowledge to generate a timeline for each patient. The semantic data are serviced via ontology-based search and analytics interfaces. Results: SemEHR has been deployed at a number of UK hospitals, including the Clinical Record Interactive Search, an anonymized replica of the EHR of the UK South London and Maudsley National Health Service Foundation Trust, one of Europe's largest providers of mental health services. In 2 Clinical Record Interactive Search-based studies, SemEHR achieved 93% (hepatitis C) and 99% (HIV) F-measure results in identifying true positive patients. At King's College Hospital in London, as part of the CogStack program (github. com/cogstack), SemEHR is being used to recruit patients into the UK Department of Health 100 000 Genomes Project (genomicsengland. co. uk). The validation study suggests that the tool can validate previously recruited cases and is very fast at searching phenotypes; time for recruitment criteria checking was reduced from days to minutes. Validated on open intensive care EHR data, Medical Information Mart for Intensive Care III, the vital signs extracted by SemEHR can achieve around 97% accuracy. Conclusion: Results from the multiple case studies demonstrate SemEHR's efficiency: weeks or months of work can be done within hours or minutes in some cases. SemEHR provides a more comprehensive view of patients, bringing in more and unexpected insight compared to study-oriented bespoke IE systems. SemEHR is open source, available at https://github. com/CogStack/SemEHR.
引用
收藏
页码:530 / 537
页数:8
相关论文
共 9 条
  • [1] SemEHR: surfacing semantic data from clinical notes in electronic health records for tailored care, trial recruitment, and clinical research
    Wu, Honghan
    Toti, Giulia
    Morley, Katherine I.
    Ibrahim, Zina
    Folarin, Amos
    Kartoglu, Ismail
    Jackson, Richard
    Agrawal, Asha
    Stringer, Clive
    Gale, Darren
    Gorrell, Genevieve M.
    Roberts, Angus
    Broadbent, Matthew
    Stewart, Robert
    Dobson, Richard J. B.
    [J]. LANCET, 2017, 390 : S97 - S97
  • [2] Biomedical Big Data for Clinical Research and Patient Care: Role of Semantic Computing
    Sahoo, Satya S.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2014, : 3 - 5
  • [3] Evaluation of Doc’EDS: a French semantic search tool to query health documents from a clinical data warehouse
    Thibaut Pressat-Laffouilhère
    Pierre Balayé
    Badisse Dahamna
    Romain Lelong
    Kévin Billey
    Stéfan J. Darmoni
    Julien Grosjean
    [J]. BMC Medical Informatics and Decision Making, 22
  • [4] Evaluation of Doc'EDS: a French semantic search tool to query health documents from a clinical data warehouse (vol 22, 34, 2022)
    Pressat-Laffouilhere, Thibaut
    Balaye, Pierre
    Dahamna, Badisse
    Lelong, Romain
    Billey, Kevin
    Darmoni, Stefan J.
    Grosjean, Julien
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [5] Preliminary Data From Clinical Trial to Survey Results of Flourish Vaginal Care System for Recurrent BV
    Chidawanyika, Tamutenda
    Yi, Chung Hwa Cathy
    Kelly-Martin, Rachel
    Cleland, Joshua
    DuPriest, Elizabeth
    [J]. OBSTETRICS AND GYNECOLOGY, 2022, 139 : 24S - 24S
  • [6] Statin use in cancer survivors versus the general population: cohort study using primary care data from the UK clinical practice research datalink
    Kendal Chidwick
    Helen Strongman
    Anthony Matthews
    Susannah Stanway
    Alexander R. Lyon
    Liam Smeeth
    Krishnan Bhaskaran
    [J]. BMC Cancer, 18
  • [7] Statin use in cancer survivors versus the general population: cohort study using primary care data from the UK clinical practice research datalink
    Chidwick, Kendal
    Strongman, Helen
    Matthews, Anthony
    Stanway, Susannah
    Lyon, Alexander R.
    Smeeth, Liam
    Bhaskaran, Krishnan
    [J]. BMC CANCER, 2018, 18
  • [8] Recruitment of minority ethnic groups into clinical cancer research trials to assess adherence to the principles of the Department of Health Research Governance Framework: national sources of data and general issues arising from a study in one hospital trust in England
    Godden, Sylvia
    Ambler, Gareth
    Pollock, Allyson M.
    [J]. JOURNAL OF MEDICAL ETHICS, 2010, 36 (06) : 358 - 362
  • [9] Collection and retrieval of structured clinical data from electronic patient records in general practice -: A first-phase study to create a health care database for research and quality assessment
    Månsson, J
    Nilsson, G
    Björkelund, C
    Strender, LE
    [J]. SCANDINAVIAN JOURNAL OF PRIMARY HEALTH CARE, 2004, 22 (01) : 6 - 10