BioGraph: Data Model for Linking and Querying Diverse Biological Metadata

被引:3
|
作者
Veljkovic, Aleksandar N. N. [1 ]
Orlov, Yuriy L. L. [2 ,3 ,4 ]
Mitic, Nenad S. S. [1 ]
机构
[1] Univ Belgrade, Fac Math, Studentski Trg 16, Belgrade 11158, Serbia
[2] IM Sechenov First Moscow State Med Univ, Sechenov Univ, Digital Hlth Inst, Minist Hlth Russian Federat, Moscow 119991, Russia
[3] Inst Cytol & Genet SB RAS, Novosibirsk 630090, Russia
[4] Peoples Friendship Univ Russia, Agrarian & Technol Inst, Moscow 117198, Russia
基金
俄罗斯科学基金会;
关键词
gene network; associations with the diseases; connecting biological data; BioGraph; metadata; query data properties;
D O I
10.3390/ijms24086954
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Studying the association of gene function, diseases, and regulatory gene network reconstruction demands data compatibility. Data from different databases follow distinct schemas and are accessible in heterogenic ways. Although the experiments differ, data may still be related to the same biological entities. Some entities may not be strictly biological, such as geolocations of habitats or paper references, but they provide a broader context for other entities. The same entities from different datasets can share similar properties, which may or may not be found within other datasets. Joint, simultaneous data fetching from multiple data sources is complicated for the end-user or, in many cases, unsupported and inefficient due to differences in data structures and ways of accessing the data. We propose BioGraph-a new model that enables connecting and retrieving information from the linked biological data that originated from diverse datasets. We have tested the model on metadata collected from five diverse public datasets and successfully constructed a knowledge graph containing more than 17 million model objects, of which 2.5 million are individual biological entity objects. The model enables the selection of complex patterns and retrieval of matched results that can be discovered only by joining the data from multiple sources.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Approximating Certainty in Querying Data and Metadata
    Civili, Cristina
    Libkin, Leonid
    [J]. SIXTEENTH INTERNATIONAL CONFERENCE ON PRINCIPLES OF KNOWLEDGE REPRESENTATION AND REASONING, 2018, : 582 - 591
  • [2] Opportunistic Linked Data Querying Through Approximate Membership Metadata
    Vander Sande, Miel
    Verborgh, Ruben
    Van Herwegen, Joachim
    Mannens, Erik
    Van de Walle, Rik
    [J]. SEMANTIC WEB - ISWC 2015, PT I, 2015, 9366 : 92 - 110
  • [3] Designing, Specifying and Querying Metadata for Virtual Data Integration Systems
    Bertossi, Leopoldo
    Jayaraman, Gayathri
    [J]. DATA MANAGEMENT IN GRID AND PEER-TO-PEER SYSTEMS, PROCEEDINGS, 2009, 5697 : 72 - +
  • [4] LOD: Linking and Querying shared data on Web
    Jaglan, Gaurav
    Malik, Saniay Kumar
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE CONFLUENCE 2018 ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING, 2018, : 568 - 573
  • [5] Link traversal querying for a diverse Web of Data
    Umbrich, Juergen
    Hogan, Aidan
    Polleres, Axel
    Decker, Stefan
    [J]. SEMANTIC WEB, 2015, 6 (06) : 585 - 624
  • [6] A metadata-driven approach to loading and querying heterogeneous scientific data
    Leinfelder, Ben
    Tao, Jing
    Costa, Duane
    Jones, Matthew B.
    Servilla, Mark
    O'Brien, Margaret
    Burt, Chad
    [J]. ECOLOGICAL INFORMATICS, 2010, 5 (01) : 3 - 8
  • [7] ACAS LIMS simplifies diverse data loading, management, and querying
    McNeil, John
    Oshiro, Guy
    Fielder, Brian
    Gao, Eva
    Meyer, Samuel
    Bolt, Brian
    McNeil, Fiona
    Shaw, Matthew
    Carr, Kelley
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
  • [8] Extracting metadata from biological experimental data
    Al-Daihani, Badr
    Gray, Alex
    Kille, Peter
    [J]. SEVENTEENTH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, : 216 - +
  • [9] A Statistical Metadata Model for Simultaneous Manipulation of both Data and Metadata
    H. Papageorgiou
    Fragkiskos Pentaris
    Eirini Theodorou
    Maria Vardaki
    Michalis Petrakos
    [J]. Journal of Intelligent Information Systems, 2001, 17 : 169 - 192
  • [10] A statistical metadata model for simultaneous manipulation of both data and metadata
    Papageorgiou, H
    Pentaris, F
    Theodorou, E
    Vardaki, M
    Petrakos, M
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2001, 17 (2-3) : 169 - 192