Constructing TCM Knowledge Graph with Multi-Source Heterogeneous Data

被引:0
|
作者
Zhai, Dongsheng [1 ]
Lou, Ying [1 ]
Kan, Huimin [1 ]
He, Xijun [1 ]
Liang, Guoqiang [1 ]
Ma, Zifei [1 ]
机构
[1] College of Economics and Management, Beijing University of Technology, Beijing,100124, China
基金
中国国家自然科学基金;
关键词
Data mining - Deep learning - Graphic methods - Knowledge graph - Online systems - Patents and inventions - Semantics;
D O I
10.11925/infotech.2096-3467.2022.0893
中图分类号
学科分类号
摘要
[Objective] This paper constructs a knowledge graph for Traditional Chinese Medicine(TCM) with multi-source heterogeneous data. It supports research innovation in TCM.[Methods] First, we obtained the TCM patents from the IncoPat database. We retrieved the targets and disease data from the Traditional Chinese Medicine Systems Pharmacology Database and Analysis Platform(TCMSP) and Online Mendelian Inheritance in Man (OMIM). Then, we extracted the entity and relationship of TMC patents with the deep learning information joint extraction model. We also used string matching and dictionaries to finish the data specification and entity alignment. Third, we constructed the TCM knowledge graph based on the ontology structure we designed. Finally, we analyzed the optimization of TCM prescriptions with the frequency analysis and Apriori algorithm. [Results] The ontology structure designed in this paper contains 31 entity types and 48 semantic relationships, covering specific entities such as solutions and technical effects in TCM patents. We examined the effectiveness of the knowledge graph and the efficiency of optimizing prescriptions with the diabetic nephropathy data. [Limitations] It took us a long time to manually annotate some samples to extract textual information. [Conclusions] The knowledge graph constructed in this paper provides data support for TCM research. It also benefits prescription optimization and realizes multivariate research in TCM. © 2023 Data Analysis and Knowledge Discovery. All rights reserved.
引用
收藏
页码:146 / 158
相关论文
共 50 条
  • [1] Constructing the Power Knowledge graph by Multi-source Electricity Data
    Jiang, Guoyi
    Su, Linhua
    Liu, Haibo
    Cao, Yang
    Sun, Rui
    Diao, Fengxin
    [J]. PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2020, : 111 - 115
  • [2] Knowledge Graph Constructing and Applying for Neurosurgery Based on Multi-Source Heterogeneous Database
    Wang, Boran
    Zhou, Xuezhong
    Wei, Wei
    Wang, Rui
    Liu, Yiming
    Tian, Haoyu
    Dai, Xinyu
    [J]. Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2024, 44 (08): : 879 - 886
  • [3] A solution and practice for combining multi-source heterogeneous data to construct enterprise knowledge graph
    Yan, Chenwei
    Fang, Xinyue
    Huang, Xiaotong
    Guo, Chenyi
    Wu, Ji
    [J]. FRONTIERS IN BIG DATA, 2023, 6
  • [4] Construction and application of Chinese breast cancer knowledge graph based on multi-source heterogeneous data
    An, Bo
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (04) : 6776 - 6799
  • [5] Construction of Knowledge Graph of Multi-Source Heterogeneous Distribution Network Systems
    Qin, Dandan
    Zheng, Gaofeng
    Liu, Li
    Li, Longyue
    Wang, Xing
    Zhang, Shujuan
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 158 - 162
  • [6] COgKGE: A Knowledge Graph Embedding Toolkit and Benchmark for Representing Multi-source and Heterogeneous Knowledge
    Jin, Zhuoran
    Men, Tianyi
    Yuan, Hongbang
    He, Zhitao
    Sui, Dianbo
    Wang, Chenhao
    Xue, Zhipeng
    Chen, Yubo
    Zhao, Jun
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2022, : 166 - 173
  • [7] Modeling of Multi-Modal Knowledge Graph for Assembly Process of Wind Turbines with Multi-Source Heterogeneous Data
    Hu, Zhiqiang
    Liu, Mingfei
    Li, Qi
    Li, Xinyu
    Bao, Jinsong
    [J]. Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2024, 58 (08): : 1249 - 1263
  • [8] Urban Flow Pattern Mining Based on Multi-Source Heterogeneous Data Fusion and Knowledge Graph Embedding
    Liu, Jia
    Li, Tianrui
    Ji, Shenggong
    Xie, Peng
    Du, Shengdong
    Teng, Fei
    Zhang, Junbo
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 2133 - 2146
  • [9] Knowledge Graph Construction in Logistics Based on Multi-source Data Fusion
    Gao, Xinyu
    Zhang, Li
    Zhang, Wenping
    Chen, Haoxuan
    [J]. PROCEEDINGS OF TEPEN 2022, 2023, 129 : 792 - 802
  • [10] Multi-source Inductive Knowledge Graph Transfer
    Hao, Junheng
    Tang, Lu-An
    Sun, Yizhou
    Chen, Zhengzhang
    Chen, Haifeng
    Rhee, Junghwan
    Li, Zhichuan
    Wang, Wei
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II, 2023, 13714 : 155 - 171