Tool-supported Interactive Correction and Semantic Annotation of Narrative Clinical Reports

被引:4
|
作者
Zvara, Karel [1 ,2 ]
Tomeckova, Marie [2 ]
Peleska, Jan [2 ]
Svatek, Vojtech [3 ]
Zvarova, Jana [1 ,2 ]
机构
[1] Charles Univ Prague, Fac Med 1, Inst Hyg & Epidemiol, Studnickova 7, Prague 12800 2, Czech Republic
[2] EuroMISE Mentor Assoc, Prague, Czech Republic
[3] Univ Econ, Fac Informat & Stat, Dept Informat & Knowledge Engn, Prague, Czech Republic
关键词
Narrative clinical report; tokens; structured information; classification systems; nomenclatures; electronic health record; PATIENT RECORDS; ABBREVIATIONS; DOCUMENTS;
D O I
10.3414/ME16-01-0083
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objectives: Our main objective is to design a method of, and supporting software for, interactive correction and semantic annotation of narrative clinical reports, which would allow for their easier and less erroneous processing outside their original context: first, by physicians unfamiliar with the original language (and possibly also the source specialty), and second, by tools requiring structured information, such as decision-support systems. Our additional goal is to gain insights into the process of narrative report creation, including the errors and ambiguities arising therein, and also into the process of report annotation by clinical terms. Finally, we also aim to provide a dataset of ground-truth transformations (specific for Czech as the source language), set up by expert physicians, which can be reused in the future for subsequent analytical studies and for training automated transformation procedures. Methods: A three-phase preprocessing method has been developed to support secondary use of narrative clinical reports in electronic health record. Narrative clinical reports are narrative texts of healthcare documentation often stored in electronic health records. In the first phase a narrative clinical report is tokenized. In the second phase the tokenized clinical report is normalized. The normalized clinical report is easily readable for health professionals with the knowledge of the language used in the narrative clinical report. In the third phase the normalized clinical report is enriched with extracted structured information. The final result of the third phase is a semi-structured normalized clinical report where the extracted clinical terms are matched to codebook terms. Software tools for interactive correction, expansion and semantic annotation of narrative clinical reports has been developed and the three-phase preprocessing method validated in the cardiology area. Results: The three-phase preprocessing method was validated on 49 anonymous Czech narrative clinical reports in the field of cardiology. Descriptive statistics from the database of accomplished transformations has been calculated. Two cardiologists participated in the annotation phase. The first cardiologist annotated 1500 clinical terms found in 49 narrative clinical reports to code book terms using the classification systems ICD 10, SNOMED CT, LOINC and LEKY. The second cardiologist validated annotations of the first cardiologist. The correct clinical terms and the codebook terms have been stored in a database. Conclusions: We extracted structured information from Czech narrative clinical reports by the proposed three-phase preprocessing method and linked it to electronic health records. The software tool, although generic, is tailored for Czech as the specific language of electronic health record pool under study. This will provide a potential etalon for porting this approach to dozens of other less spoken languages. Structured information can support medical decision making, quality assurance tasks and further medical research.
引用
收藏
页码:217 / 229
页数:13
相关论文
共 4 条
  • [1] A tool-supported design framework for safety critical interactive systems
    Bastide, R
    Navarre, D
    Palanque, P
    [J]. INTERACTING WITH COMPUTERS, 2003, 15 (03) : 309 - 328
  • [2] A Statistics and UMLS-based Tool for Assisted Semantic Annotation of Brazilian Clinical Documents
    Oliveira, Lucas E. S.
    Gebeluca, Caroline P.
    Silva, Adalniza M. P.
    Moro, Claudia M. C.
    Hasan, Sadid A.
    Farri, Oladimeji
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1072 - 1078
  • [3] Fassieh®, a Semi-Automatic Visual Interactive Tool for Morphological, PoS-Tags, Phonetic, and Semantic Annotation of Arabic Text Corpora
    Attia, Mohamed
    Rashwan, Mohsen A. A.
    Al-Badrashiny, Mohamed A. S. A. A.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (05): : 916 - 925
  • [4] Effects of a computer-supported interactive tailored patient assessment tool on patient care, symptom distress, and patients' need for symptom management support: a randomized clinical trial
    Ruland, Cornelia M.
    Holte, Harald H.
    Roislien, Jo
    Heaven, Cathy
    Hamilton, Glenys A.
    Kristiansen, Jorn
    Sandbaek, Heidi
    Kvaloy, Stein O.
    Hasund, Line
    Ellison, Misoo C.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (04) : 403 - 410