Construction of cardiovascular information extraction corpus based on electronic medical records

被引:1
|
作者
Chang, Hongyang [1 ]
Zan, Hongying [1 ,2 ]
Zhang, Shuai [1 ]
Zhao, Bingfei [1 ]
Zhang, Kunli [1 ,2 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
关键词
cardiovascular disease; corpus construction; electronic medical record; RECOGNITION;
D O I
10.3934/mbe.2023596
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cardiovascular disease has a significant impact on both society and patients, making it necessary to conduct knowledge-based research such as research that utilizes knowledge graphs and automated question answering. However, the existing research on corpus construction for cardiovascular disease is relatively limited, which has hindered further knowledge-based research on this disease. Electronic medical records contain patient data that span the entire diagnosis and treatment process and include a large amount of reliable medical information. Therefore, we collected electronic medical record data related to cardiovascular disease, combined the data with relevant work experience and developed a standard for labeling cardiovascular electronic medical record entities and entity relations. By building a sentence-level labeling result dictionary through the use of a rule-based semi-automatic method, a cardiovascular electronic medical record entity and entity relationship labeling corpus (CVDEMRC) was constructed. The CVDEMRC contains 7691 entities and 11,185 entity relation triples, and the results of consistency examination were 93.51% and 84.02% for entities and entity-relationship annotations, respectively, demonstrating good consistency results. The CVDEMRC constructed in this study is expected to provide a database for information extraction research related to cardiovascular diseases.
引用
收藏
页码:13379 / 13397
页数:19
相关论文
共 50 条
  • [1] Annotation Scheme and Corpus Construction for Cardiovascular Diseases Risk Factors From Chinese Electronic Medical Records
    Su, Jia
    He, Bin
    Wu, Hao
    Yang, Jin-Feng
    Guan, Yi
    Jiang, Jing-Chi
    Wang, Huan-Zheng
    Yu, Qiu-Bin
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (02): : 420 - 426
  • [2] Information Extraction for Intestinal Cancer Electronic Medical Records
    Wang, Sufen
    Pang, Minmin
    Pan, Changqing
    Yuan, Junyi
    Xu, Bo
    Du, Ming
    Zhang, Hong
    [J]. IEEE ACCESS, 2020, 8 : 125923 - 125934
  • [3] A Refinement System for Medical Information Extraction from Text-based Bilingual Electronic Medical Records
    Bae, Inho
    Kim, Jin-Sang
    [J]. HEALTHCARE INFORMATICS RESEARCH, 2008, 14 (03) : 267 - 274
  • [4] Developing a cardiovascular disease risk factor annotated corpus of Chinese electronic medical records
    Su, Jia
    He, Bin
    Guan, Yi
    Jiang, Jingchi
    Yang, Jinfeng
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2017, 17
  • [5] Developing a cardiovascular disease risk factor annotated corpus of Chinese electronic medical records
    Jia Su
    Bin He
    Yi Guan
    Jingchi Jiang
    Jinfeng Yang
    [J]. BMC Medical Informatics and Decision Making, 17
  • [6] The model of "taking electronic medical records as the core for information construction in hospitals"
    Wu Tao
    Xu Ke
    Li Ping
    Li Xian-feng
    Xu Wei-guo
    [J]. CHINESE MEDICAL JOURNAL, 2013, 126 (02) : 373 - 377
  • [7] The model of "taking electronic medical records as the core for information construction in hospitals"
    WU Tao
    XU Ke
    LI Ping
    LI Xian-feng
    XU Wei-guo
    [J]. 中华医学杂志(英文版), 2013, (02) : 373 - 377
  • [8] Medical Information System Based on Electronic Healthcare Records
    Paun, D.
    Sauciuc, D.
    Stan, O.
    Iosif, O.
    Dehelean, C.
    Miclea, L.
    [J]. PROCEEDINGS OF 2010 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR 2010), VOLS. 1-3, 2010,
  • [9] Extraction of risk factors for cardiovascular diseases from Chinese electronic medical records
    Su, Jia
    Hu, Jinpeng
    Jiang, Jingchi
    Xie, Jing
    Yang, Yang
    He, Bin
    Yang, Jinfeng
    Guan, Yi
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 172 : 1 - 10
  • [10] A BART-Based Study of Entity-Relationship Extraction for Electronic Medical Records of Cardiovascular Diseases
    Guo, Yifan
    Zan, Hongying
    Chang, Hongyang
    Zhou, Lijuan
    Zhang, Kunli
    [J]. HEALTH INFORMATION PROCESSING, CHIP 2023, 2023, 1993 : 82 - 97