Validation of clinical problems using a UMLS-based semantic parser

被引:0
|
作者
Goldberg, HS [1 ]
Hsu, C [1 ]
Law, V [1 ]
Safran, C [1 ]
机构
[1] Harvard Univ, Sch Med, Beth Israel Deaconess Med Ctr, Ctr Clin Comp, Boston, MA 02115 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The capture and symbolization of data from the clinical problem list facilitates the creation of high-fidelity, patient resumes for use in aggregate analysis and decision support. We report on the development of a UMLS-based semantic parser and present a preliminary evaluation of the parser in the recognition and validation of disease-related clinical problems. We randomly sampled 20% of the 26,858 unique non-dictionary clinical problems entered into OMR (Online Medical Record) between 1989 and August, 1997, and eliminated a series of qualified problem labels, e.g. history-of, to obtain a dataset of 4122 problem labels. Within this dataset, the authors identified 2810 labels (68.2%) as referring to a broad range of disease-related processes. The parser correctly recognized and validated 1398 of the 2810 disease-related labels (49.8+/-1.9%) and correctly excluded 1220 of 1312 non-disease-related labels (93.0+/-1.4%). 812 of the 1181 match failures (68.8%) were caused by terms either absent from UMLS or modifiers not accepted by the parser; 369 match failures (31.2%) were caused by labels having patterns not recognized by the parser. By enriching the UMLS lexicon with terms commonly found in provider-entered labels, it appears that performance of the parser can be significantly enhanced over a few subsequent iterations. This initial evaluation provides a foundation from which to make principled additions to the UMLS lexicon locally for use in symbolizing clinical data; further research is necessary to determine applicability to other health care settings.
引用
收藏
页码:805 / 809
页数:5
相关论文
共 50 条
  • [1] A Statistics and UMLS-based Tool for Assisted Semantic Annotation of Brazilian Clinical Documents
    Oliveira, Lucas E. S.
    Gebeluca, Caroline P.
    Silva, Adalniza M. P.
    Moro, Claudia M. C.
    Hasan, Sadid A.
    Farri, Oladimeji
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1072 - 1078
  • [2] UMLS-based access to CPR data
    van Mulligen, EM
    [J]. MEDINFO '98 - 9TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 1998, 52 : 166 - 170
  • [3] Information retrieval using UMLS-based structured queries
    Fagan, LM
    Berrios, DC
    Chan, A
    Cucina, R
    Datta, A
    Shah, M
    Surendran, S
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2001, : 902 - 902
  • [4] UMLS-based access to CPR data
    van Mulligen, EM
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 1999, 53 (2-3) : 125 - 131
  • [5] UMLS-based access to CPR data
    Van Mulligen, Erik M.
    [J]. International Journal of Medical Informatics, 53 (2-3): : 125 - 131
  • [6] Automated UMLS-Based Comparison of Medical Forms
    Dugas, Martin
    Fritz, Fleur
    Krumm, Rainer
    Breil, Bernhard
    [J]. PLOS ONE, 2013, 8 (07):
  • [7] UMLS-based data augmentation for natural language processing of clinical research literature
    Kang, Tian
    Perotte, Adler
    Tang, Youlan
    Ta, Casey
    Weng, Chunhua
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (04) : 812 - 823
  • [8] A UMLS-based method for integrating information databases into an Intranet
    Volot, F
    Joubert, M
    Fieschi, M
    Fieschi, D
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1997, : 495 - 499
  • [9] Using UMLS-based re-weighting terms as a query expansion strategy
    Zhu, Weizhong
    Xu, Xuheng
    Hu, Xiaohua
    Song, Il-Yeol
    Allen, Robert B.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 217 - +
  • [10] Entity normalization in a Spanish medical corpus using a UMLS-based lexicon: findings and limitations
    Baez, Pablo
    Campillos-Llanos, Leonardo
    Nunez, Fredy
    Dunstan, Jocelyn
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2024,