MetReS: a Metabolic Reconstruction Database for Cloud Computing

被引:1
|
作者
Vilaplana, Jordi [1 ,2 ]
Solsona, Francesc [1 ,2 ]
Teixido, Ivan [1 ,2 ]
Mateo, Jordi [1 ,2 ]
Usie, Anabel [3 ]
Torres, Nestor [4 ,5 ]
Comas, Jorge [4 ,5 ]
Alves, Rui [4 ,5 ]
机构
[1] Univ Lleida, Dept Comp Sci, Lleida, Spain
[2] Univ Lleida, INSPIRES, Lleida, Spain
[3] CEBAL, P-7800 Beja, Portugal
[4] Univ Lleida, Dept Basic Med Sci, Lleida, Spain
[5] Univ Lleida, IRBLleida, Lleida, Spain
关键词
INTERACTION NETWORKS;
D O I
10.1109/INCoS.2014.31
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
When designing a cloud infrastructure, it is critical to ensure beforehand that the system will be able to offer the desired level of QoS (Quality of Service). Our attention is focused here on efficient QoS accessing to a biological database in cloud computing systems. Our group developed two software applications that address important biological problems, Biblio-MetReS and Homol-MetReS. Biblio-MetReS is a data-mining tool that facilitates the reconstruction of molecular networks based on automated text-mining analysis of published scientific literature. Homol-MetReS allows functional (re) annotation of proteomes, to properly identify both the individual proteins involved in the process(es) of interest and their function. Reconstruction of molecular networks is essential to understand how organisms work at the molecular level and has strong implication, for example, in finding targets to treat different types of disease. In addition, the identification and functional annotation of the individual components of the network is crucial to understand what those targets might do in the context of the organism. These two software applications access the same database of organisms with annotated genes. The efficiency of the two applications is directly related to the design of the shared database. This database is continuously growing, as hundreds to thousands of new genomes are sequenced and annotated each year. The main goal of the current work was to improve the current database performance and to test if this improvement would scale to larger data-sets and more complex types of analysis that are not yet done by either of the applications. To achieve this goal, different database architectures were designed and analyzed. We started the study with a public relational database, MySQL, which was the current database server used by these applications. Then, due to the large size of the database, Apache Hadoop, a framework used for large-scale data processing, was considered and studied as an alternative. Although Big Data systems are not always a replacement of traditional relational databases, we proved by extensive tests the applicability of Apache Hadoop to a standard biological database containing some of the most frequently used types of information in molecular and systems biology. With time, as this database will continuously grow, the proposed solution will further improve its efficiency. Furthermore, this solution allows to extract additional valuable information from the data-sets that was not being currently considered.
引用
收藏
页码:653 / 658
页数:6
相关论文
共 50 条
  • [21] Research on Database Technology and Its Application based on Cloud Computing
    Du Shirong
    INTERNATIONAL SYMPOSIUM 2017 - MECHANICAL AND ELECTRONICAL SYSTEMS AND CONTROL ENGINEERING, 2017, : 98 - 102
  • [22] Distributed Relational Database Performance in Cloud Computing: an Investigative Study
    Litchfield, Alan
    Althwab, Awadh
    Sharma, Chandan
    AMCIS 2017 PROCEEDINGS, 2017,
  • [23] DES: Dynamic and Elastic Scalability in Cloud Computing Database Architecture
    Chitra, K.
    Rani, B. Jeeva
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (01) : 173 - 175
  • [24] The Research of Commercial Database Service Schema Based on Cloud Computing
    Chang, Min
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON COOPERATION AND PROMOTION OF INFORMATION RESOURCES IN SCIENCE AND TECHNOLOGY(COINFO 10), 2010, : 39 - 42
  • [25] Mathematical Model for Higher Utilization of Database Resources in Cloud Computing
    Kaveri, Parag Ravikant
    Chavan, Vinay
    2013 4TH NIRMA UNIVERSITY INTERNATIONAL CONFERENCE ON ENGINEERING (NUICONE 2013), 2013,
  • [26] CET Resource Database Construction Model Based on Cloud Computing
    Xie, Lin
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [27] Using Blockchain in Cloud Computing to Enhance Relational Database Security
    Awadallah, Ruba
    Samsudin, Azman
    IEEE ACCESS, 2021, 9 : 137353 - 137366
  • [28] Harnessing Cloud Computing for Dynamic Resource Requirement by Database Workloads
    Tan, Chee-Heng
    Teh, Ying-Wah
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (05) : 793 - 810
  • [29] Cloud Computing / Cloud Computing
    Maya Proano, Isabel
    RETOS-REVISTA DE CIENCIAS DE LA ADMINISTRACION Y ECONOMIA, 2011, 1 (01): : 35 - 40
  • [30] Database Constraints Applied to Metabolic Pathway Reconstruction Tools
    Vilaplana, Jordi
    Solsona, Francesc
    Teixido, Ivan
    Usie, Anabel
    Karathia, Hiren
    Alves, Rui
    Mateo, Jordi
    SCIENTIFIC WORLD JOURNAL, 2014,