A performance evaluation of NoSQL databases to manage proteomics data

被引:2
|
作者
Messaoudi, Chaimaa [1 ]
Fissoune, Rachida [1 ]
Badir, Hassan [1 ]
机构
[1] Abdelmalek Essaadi Univ, Natl Sch Appl Sci, BP 1818, Tangier 90000, Morocco
关键词
proteomics; MongoDB; multi-model; Neo4j; OrientDB; polyglot persistence; GRAPH DATABASES; BIOINFORMATICS; MODEL; BIOLOGY; CLOUD; SQL;
D O I
10.1504/IJDMB.2018.095556
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
NoSQL databases have recently been introduced as alternatives to traditional relational database management systems because of their capabilities in terms of storing data and query retrieval. Biological datasets can be modelled using various models, for example, graphs (protein-protein interaction) or documents (protein sequence information). Applications that involve these two data models can be combined into a single unique architecture either using the polyglot persistence approach or using a multi-model approach. This paper evaluates the performance of a polyglot persistence approach versus a multi-model store. The polyglot persistence approach combines a graph-oriented database (Neo4j) and a document-oriented database (MongoDB); and the multi-model system is OrientDB. The comparisons are made following these aspects: importation, single operations, and query performance. OrientDB demonstrates a potential to manage large proteomics dataset for query retrieval and graph importation. However, when updating records, OrientDB was found to be slow. There is no single store that performs better in all cases.
引用
收藏
页码:70 / 89
页数:20
相关论文
共 50 条
  • [1] Performance Analysis in NoSQL Databases, Relational Databases and NoSQL Databases as a Service in the Cloud
    Marrero, Luciano
    Olsowy, Verena
    Tesone, Fernando
    Thomas, Pablo
    Delia, Lisandro
    Pesado, Patricia
    COMPUTER SCIENCE - CACIC 2020, 2021, 1409 : 157 - 170
  • [2] A proposed performance evaluation of NoSQL databases in the field of IoT
    Al-Sakran, Aya
    Qattous, Hazem
    Hijjawi, Mohammad
    2018 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2018, : 32 - 37
  • [3] QUERYING DATA IN NOSQL DATABASES
    Babic, Andrea
    Jaksic, Danijela
    Poscic, Patrizia
    ZBORNIK VELEUCILISTA U RIJECI-JOURNAL OF THE POLYTECHNICS OF RIJEKA, 2019, 7 (01): : 257 - 270
  • [4] Performance Analysis of DML Operations on NoSQL Databases for Streaming Data
    Magdum, Junaid
    Barhate, Rahul
    2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [5] Experimental Performance Analysis of Data Consistency Levels in NoSQL Databases
    Ferreira, Saulo
    Mendonca, Julio
    Andrade, Ermeson
    SOFTWARE-PRACTICE & EXPERIENCE, 2025,
  • [6] Change Data Capture in NoSQL Databases: A Functional and Performance Comparison
    Schmidt, Felipe Mathias
    Geyer, Claudio
    Schaeffer-Filho, Alberto
    Dessloch, Stefan
    Hu, Yong
    2015 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATION (ISCC), 2015, : 562 - 567
  • [7] Performance Evaluation of NoSQL Document Databases: Couchbase, CouchDB, and MongoDB
    Carvalho, Ines
    Sa, Filipe
    Bernardino, Jorge
    ALGORITHMS, 2023, 16 (02)
  • [8] NoSQL Databases for RDF: An Empirical Evaluation
    Cudre-Mauroux, Philippe
    Enchev, Iliya
    Fundatureanu, Sever
    Groth, Paul
    Haque, Albert
    Harth, Andreas
    Keppmann, Felix Leif
    Miranker, Daniel P.
    Sequeda, Juan F.
    Wylot, Marcin
    SEMANTIC WEB - ISWC 2013, PART II, 2013, 8219 : 310 - 325
  • [9] NoSQL Databases for Large Volumes of Data
    Telnarova, Zdenka
    Zacek, Martin
    Smolka, Pavel
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2019 (ICCMSE-2019), 2019, 2186
  • [10] NoSQL Databases for Big Data Management
    Gaspar, Drazena
    Mabic, Mirela
    CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS (CECIIS 2016), 2016, : 3 - 10