Semantic Data Integration Techniques for Transforming Big Biomedical Data into Actionable Knowledge

被引:4
|
作者
Vidal, Maria-Esther [1 ]
Jozashoori, Samaneh [2 ]
Sakor, Ahmad [2 ]
机构
[1] TIB Leibniz Informat Ctr Sci & Technol, Hannover, Germany
[2] Leibniz Univ Hannover, L3S Inst, Hannover, Germany
关键词
D O I
10.1109/CBMS.2019.00116
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
FAIR principles and the Open Data initiatives have motivated the publication of large volumes of data. Specifically, in the biomedical domain, the size of the data has increased exponentially in the last decade, and with the advances in the technologies to collect and generate data, a faster growth rate is expected for the next years. The available collections of data are characterized by the dominant dimensions of big data, i.e., they are not only large in volume, but they can be also heterogeneous and present quality issues. These data complexity problems impact on the typical tasks of data management, and particularly, in the task of integrating big biomedical data sources. We tackle the problem of big data integration and present a knowledge-driven framework able to extract and integrate data collected from structured and unstructured data sources. The proposed framework resorts to Natural Language Processing techniques to extract knowledge from unstructured data and short text. Furthermore, ontologies and controlled vocabularies, e.g., UMLS, are utilized to annotate the extracted entities and relations with terms from the ontology or controlled vocabulary. The annotated data is integrated into a knowledge graph. A unified schema is used to describe the meaning of the integrated data as well as the main properties and relations. As proof of concept, we show the results of applying the proposed framework to integrate clinical records from lung cancer patients with data extracted from open data sources like Drugbank and PubMed. The created knowledge graph enables the discovery of interactions between drugs in the treatments prescribed to lung cancer patients.
引用
收藏
页码:563 / 566
页数:4
相关论文
共 50 条
  • [1] A SEMANTIC DATA MODEL FOR INTEGRATION OF DATA AND KNOWLEDGE
    HOUTSMA, MAW
    APERS, PMG
    [J]. COMPUTING AND INFORMATION, 1989, : 327 - 333
  • [2] From Data to Actionable Knowledge: Big Data Challenges in the Web of Things
    Barnaghi, Payam
    Sheth, Amit
    Henson, Cory
    [J]. IEEE INTELLIGENT SYSTEMS, 2013, 28 (06) : 6 - 11
  • [3] Transforming biomedical and structural data into information and knowledge
    Zimmerman, M.
    Cooper, D.
    Cymborowski, M.
    Konina, K.
    Grabowski, M.
    MacLean, E.
    Minor, W.
    [J]. ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2014, 70 : C492 - C492
  • [4] BioFactory: Semantic Integration of Biomedical Data and Applications
    Wang, Charles C. N.
    Hecht, David A.
    Sheu, Phillip
    Yamaguchi, Hiroshi
    Tsai, Jeffrey J. P.
    [J]. FIRST IEEE INTERNATIONAL CONFERENCE ON INTERNET OPERATING SYSTEMS AND NEW APPLICATIONS (ICIOS 2012), 2012, : 17 - 21
  • [5] Dissecting the Complex Architecture of Schizophrenia: Transforming Data Into Actionable Knowledge
    Zwir, Igor
    Arnedo, Javier
    Mesa, Alberto
    Mamah, Daniel
    de Erausquin, Gabriel A.
    Cloninger, C. R.
    [J]. NEUROPSYCHOPHARMACOLOGY, 2016, 41 : S561 - S562
  • [6] From biomedical big data to knowledge and action
    Mei Chen
    [J]. Machine Vision and Applications, 2018, 29 : 1209 - 1210
  • [7] From biomedical big data to knowledge and action
    Chen, Mei
    [J]. MACHINE VISION AND APPLICATIONS, 2018, 29 (08) : 1209 - 1210
  • [8] Big Data, Actionable Information, Scientific Knowledge and the Goal of Control
    Gray, Chris Hables
    [J]. TEKNOKULTURA: REVISTA DE CULTURA DIGITAL Y MOVIMIENTOS SOCIALES, 2014, 11 (03): : 529 - 554
  • [9] Transforming big data into knowledge: the role of knowledge management practice
    Chierici, Roberto
    Mazzucchelli, Alice
    Garcia-Perez, Alexeis
    Vrontis, Demetris
    [J]. MANAGEMENT DECISION, 2019, 57 (08) : 1902 - 1922
  • [10] Ocean knowledge representation through integration of big data employing semantic web technologies
    Anitha Velu
    Menakadevi Thangavelu
    [J]. Earth Science Informatics, 2022, 15 : 1563 - 1585