Computational framework to support integration of biomolecular and clinical data within a translational approach

被引:14
|
作者
Brandao Miyoshi, Newton Shydeo [1 ]
Pinheiro, Daniel Guariz [2 ]
Silva, Wilson Araujo, Jr. [3 ]
Felipe, Joaquim Cezar [1 ]
机构
[1] Univ Sao Paulo, Fac Philosophy Sci & Languages Ribeirao Preto, Dept Comp & Math, Sao Paulo, Brazil
[2] Univ Sao Paulo, Fac Philosophy Sci & Languages Ribeirao Preto, Dept Biol, Sao Paulo, Brazil
[3] Univ Sao Paulo, Fac Med Ribeirao Preto, Dept Genet, Sao Paulo, Brazil
来源
BMC BIOINFORMATICS | 2013年 / 14卷
基金
巴西圣保罗研究基金会;
关键词
GENOME; SYSTEM; RESOURCE; INFORMATION;
D O I
10.1186/1471-2105-14-180
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The use of the knowledge produced by sciences to promote human health is the main goal of translational medicine. To make it feasible we need computational methods to handle the large amount of information that arises from bench to bedside and to deal with its heterogeneity. A computational challenge that must be faced is to promote the integration of clinical, socio-demographic and biological data. In this effort, ontologies play an essential role as a powerful artifact for knowledge representation. Chado is a modular ontology-oriented database model that gained popularity due to its robustness and flexibility as a generic platform to store biological data; however it lacks supporting representation of clinical and socio-demographic information. Results: We have implemented an extension of Chado - the Clinical Module - to allow the representation of this kind of information. Our approach consists of a framework for data integration through the use of a common reference ontology. The design of this framework has four levels: data level, to store the data; semantic level, to integrate and standardize the data by the use of ontologies; application level, to manage clinical databases, ontologies and data integration process; and web interface level, to allow interaction between the user and the system. The clinical module was built based on the Entity-Attribute-Value (EAV) model. We also proposed a methodology to migrate data from legacy clinical databases to the integrative framework. A Chado instance was initialized using a relational database management system. The Clinical Module was implemented and the framework was loaded using data from a factual clinical research database. Clinical and demographic data as well as biomaterial data were obtained from patients with tumors of head and neck. We implemented the IPTrans tool that is a complete environment for data migration, which comprises: the construction of a model to describe the legacy clinical data, based on an ontology; the Extraction, Transformation and Load (ETL) process to extract the data from the source clinical database and load it in the Clinical Module of Chado; the development of a web tool and a Bridge Layer to adapt the web tool to Chado, as well as other applications. Conclusions: Open-source computational solutions currently available for translational science does not have a model to represent biomolecular information and also are not integrated with the existing bioinformatics tools. On the other hand, existing genomic data models do not represent clinical patient data. A framework was developed to support translational research by integrating biomolecular information coming from different "omics" technologies with patient's clinical and socio-demographic data. This framework should present some features: flexibility, compression and robustness. The experiments accomplished from a use case demonstrated that the proposed system meets requirements of flexibility and robustness, leading to the desired integration. The Clinical Module can be accessed in http://dcm.ffclrp.usp.br/caib/pg=iptrans.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Computational framework to support integration of biomolecular and clinical data within a translational approach
    Newton Shydeo Brandão Miyoshi
    Daniel Guariz Pinheiro
    Wilson Araújo Silva
    Joaquim Cezar Felipe
    [J]. BMC Bioinformatics, 14
  • [2] An Integration Framework for Clinical Decision Support Applications
    Zheng, Xiang
    Zhang, Yin-sheng
    Huang, Zhen-zhen
    Jia, Zheng
    Duan, Hui-long
    Li, Hao-min
    [J]. UBIQUITOUS COMPUTING APPLICATION AND WIRELESS SENSOR, 2015, 331 : 543 - 552
  • [3] Data Integration for Clinical Decision Support
    Jung, Yuchae
    Yoon, Yong Ik
    [J]. 2016 EIGHTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2016, : 164 - 166
  • [4] A computational framework for integration of lipidomics data into metabolic pathways
    Hadadi, Noushin
    Soh, Keng Cher
    Seijo, Marianne
    Zisaki, Aikaterini
    Guan, Xueli
    Wenk, Markus R.
    Hatzimanikatis, Vassily
    [J]. METABOLIC ENGINEERING, 2014, 23 : 1 - 8
  • [5] Managing disparate data generated from translational research activities: Clinical implementation to support data integration and sharing for the IPCP
    Barrett, Jeffrey S.
    Vijayakumar, Kalpana
    Vijayakumar, Sundararajan
    Patel, Dimple
    Narayan, Mahesh
    Jayaraman, Bhuvana
    Cummings, Erin
    Douglas, Steven
    [J]. JOURNAL OF CLINICAL PHARMACOLOGY, 2008, 48 (09): : 1126 - 1126
  • [6] From bytes to bedside: Data integration and computational biology for translational cancer research
    Mathew, Jomol P.
    Taylor, Barry S.
    Bader, Gary D.
    Pyarajan, Saiju
    Antoniotti, Marco
    Chinnaiyan, Arul M.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (02) : 153 - 163
  • [7] An approach of tailorability within a collective activity support framework
    Betbeder, ML
    Taurisson, N
    Tchounikine, P
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION: SHAPING THE FUTURE OF LEARNING THROUGH INTELLIGENT TECHNOLOGIES, 2003, 97 : 383 - 385
  • [8] A semantic interoperability approach to support integration of gene expression and clinical data in breast cancer
    Alonso-Calvo, Raul
    Paraiso-Medina, Sergio
    Perez-Rey, David
    Alonso-Oset, Enrique
    van Stiphout, Ruud
    Yu, Sheng
    Taylor, Marian
    Buffa, Francesca
    Fernandez-Lozano, Carlos
    Pazos, Alejandro
    Maojo, Victor
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2017, 87 : 179 - 186
  • [9] Integration of Groundwater Models within an Economical Decision Support System Framework
    Beddek, R.
    ElMahdi, A.
    Barnett, B.
    Kennedy, T.
    [J]. MODSIM 2005: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING, 2005, : 608 - 614
  • [10] Understanding enterprise data warehouses to support clinical and translational research
    Campion, Thomas R., Jr.
    Craven, Catherine K.
    Dorr, David A.
    Knosp, Boyd M.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (09) : 1352 - 1358