KaBOB: ontology-based semantic integration of biomedical databases

被引:49
|
作者
Livingston, Kevin M. [1 ]
Bada, Michael [1 ]
Baumgartner, William A., Jr. [1 ]
Hunter, Lawrence E. [1 ]
机构
[1] Univ Colorado, Computat Biosci Program, Aurora, CO 80045 USA
来源
BMC BIOINFORMATICS | 2015年 / 16卷
关键词
Knowledge representation and reasoning; Semantic data integration; Biomedical; Databases; Open biomedical ontologies; Semantic web; OWL; RDF; COMMUNITY STANDARD; KNOWLEDGE; ACCESS; FORMAT;
D O I
10.1186/s12859-015-0559-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The ability to query many independent biological databases using a common ontology-based semantic model would facilitate deeper integration and more effective utilization of these diverse and rapidly growing resources. Despite ongoing work moving toward shared data formats and linked identifiers, significant problems persist in semantic data integration in order to establish shared identity and shared meaning across heterogeneous biomedical data sources. Results: We present five processes for semantic data integration that, when applied collectively, solve seven key problems. These processes include making explicit the differences between biomedical concepts and database records, aggregating sets of identifiers denoting the same biomedical concepts across data sources, and using declaratively represented forward-chaining rules to take information that is variably represented in source databases and integrating it into a consistent biomedical representation. We demonstrate these processes and solutions by presenting KaBOB (the Knowledge Base Of Biomedicine), a knowledge base of semantically integrated data from 18 prominent biomedical databases using common representations grounded in Open Biomedical Ontologies. An instance of KaBOB with data about humans and seven major model organisms can be built using on the order of 500 million RDF triples. All source code for building KaBOB is available under an open-source license. Conclusions: KaBOB is an integrated knowledge base of biomedical data representationally based in prominent, actively maintained Open Biomedical Ontologies, thus enabling queries of the underlying data in terms of biomedical concepts (e. g., genes and gene products, interactions and processes) rather than features of source specific data schemas or file formats. KaBOB resolves many of the issues that routinely plague biomedical researchers intending to work with data from multiple data sources and provides a platform for ongoing data integration and development and for formal reasoning over a wealth of integrated biomedical data.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Ontology-based Semantic Integration Scheme for Medical Image Grid
    Jin, Hai
    Sun, Aobing
    Zheng, Ran
    He, Ruhan
    Zhang, Qin
    [J]. CCGRID 2007: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, 2007, : 127 - +
  • [22] An ontology-based approach for semantic conflict resolution in database integration
    Liu, Qiang
    Huang, Tao
    Liu, Shao-Hua
    Zhong, Hua
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2007, 22 (02) : 218 - 227
  • [23] From Relational Databases to Ontology-Based Databases
    Kamal, Hamaz
    Fouzia, Benchikha
    [J]. ICEIS: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2013, : 289 - 297
  • [24] Ontology-based semantic clustering
    Batet, Montserrat
    [J]. AI COMMUNICATIONS, 2011, 24 (03) : 291 - 292
  • [25] Ontology-based Semantic Annotation in Semantic Query
    Wu, Chengwen
    Jin, Kezhong
    Huang, Changcheng
    Liu, Wenbin
    [J]. ACC 2009: ETP/IITA WORLD CONGRESS IN APPLIED COMPUTING, COMPUTER SCIENCE, AND COMPUTER ENGINEERING, 2009, : 280 - 283
  • [26] Ontology-based agent community for information integration system in semantic web
    Nyunt, Pa Pa
    Theint, Ni Lar
    [J]. AMS 2007: FIRST ASIA INTERNATIONAL CONFERENCE ON MODELLING & SIMULATION ASIA MODELLING SYMPOSIUM, PROCEEDINGS, 2007, : 106 - +
  • [27] A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain
    Harispe, Sébastien
    Sánchez, David
    Ranwez, Sylvie
    Janaqi, Stefan
    Montmain, Jacky
    [J]. Journal of Biomedical Informatics, 2014, 48 : 38 - 53
  • [28] Ontology-Based Query Interface in a System for Semantic Integration of XML Data
    Pankowski, Tadeusz
    [J]. AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, PROCEEDINGS, 2009, 5559 : 834 - 843
  • [29] An ontology-based architecture for implementing semantic integration of supply chain management
    Ye, Yan
    Yang, Dong
    Jiang, Zhibin
    Tong, Lixin
    [J]. INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2008, 21 (01) : 1 - 18
  • [30] Ontology-Based Data Integration for Semantic Interoperability in Air Traffic Management
    Egami, Shusaku
    Lu, Xiaodong
    Koga, Tadashi
    Sumiya, Yasuto
    [J]. 2020 IEEE 14TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2020), 2020, : 295 - 302