A Study of Genomic Data Provenance in NoSQL Document-Oriented Database Systems

被引:0
|
作者
Guimaraes, Valeria [1 ]
Hondo, Fernanda [1 ]
Almeida, Rodrigo [1 ]
Vera, Harley [1 ]
Holanda, Maristela [1 ]
Araujo, Aleteia [1 ]
Walter, Maria Emilia [1 ]
Lifschitz, Sergio [2 ]
机构
[1] Univ Brasilia, Dept Comp Sci, Brasilia, DF, Brazil
[2] Pontifical Catholic Univ Rio de Janeiro, Dept Informat, Rio De Janeiro, Brazil
关键词
provenance; workflow; genomics; bioinformatics; data modeling; NoSQL; MongoDB;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
this work considers a scientific experiment as a computational workflow. Provenance models store details of each workflow execution, including produced data, computational tools parameters and their versions, among others. This way, scientists can review details of a particular workflow execution, compare information generated among different executions and plan new ones efficiently. In the bioinformatics domain, particularly in the presence of large volumes of data, persistency of those data generated during the workflow execution is still a research challenge. In this article, we consider a study on provenance data storage for bioinformatics in a document-oriented NoSQL database system. We present data modeling issues and discuss an actual implementation into MongoDB.
引用
收藏
页码:1525 / 1531
页数:7
相关论文
共 50 条
  • [31] A document-oriented approach to the development of knowledge based systems
    Sierra, JL
    Fernández-Manjón, B
    Fernández-Valmayor, A
    Navarro, A
    [J]. CURRENT TOPICS IN ARTIFICIAL INTELLIGENCE, 2004, 3040 : 16 - 25
  • [32] A Data Placement Strategy for Distributed Document-oriented Data Warehouse
    Khalil, Abdelhak
    Belaissaoui, Mustapha
    Toufik, Fouad
    [J]. IAENG International Journal of Computer Science, 2023, 50 (04)
  • [33] Genomic Data Persistency on a NoSQL Database System
    Aniceto, Rodrigo
    Xavier, Rene
    Holanda, Maristela
    Walter, Maria Emilia
    Lifschitz, Sergio
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
  • [34] Document-Oriented Data Warehouses: Complex Hierarchies and Summarizability
    Chevalier, Max
    El Malki, Mohammed
    Kopliku, Arlind
    Teste, Olivier
    Tournier, Ronan
    [J]. ADVANCES IN UBIQUITOUS NETWORKING 2, 2017, 397 : 671 - 683
  • [35] DB-SECaaS: a cloud-based protection system for document-oriented NoSQL databases
    Ghazi, Yumna
    Masood, Rahat
    Rauf, Abid
    Shibli, Muhammad Awais
    Hassan, Osman
    [J]. EURASIP JOURNAL ON INFORMATION SECURITY, 2016,
  • [36] A new approach for the construction of historical databases-NoSQL Document-oriented databases: the example of AtlantoCracies
    Diaz-Ordonez, Manuel
    Baena, Domingo Savio Rodriguez
    Yun-Casalilla, Bartolome
    [J]. DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2023, 38 (03) : 1014 - 1032
  • [37] Research and Implementation of the Document-oriented Database Test Methods for GIS Applications
    Li, Jie
    Liu, Zhuo
    Xing, Chunxiao
    Zhang, Yong
    Li, Chao
    [J]. 2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 3 - +
  • [38] Document Oriented NoSQL Databases: An Empirical Study
    Mishra, Omji
    Lodhi, Pooja
    Mehta, Shikha
    [J]. DATA SCIENCE AND ANALYTICS, 2018, 799 : 126 - 136
  • [39] Evaluating Redundancy and Partitioning of Geospatial Data in Document-Oriented Data Warehouses
    Ferro, Marcio
    Lima, Rinaldo
    Fidalgo, Robson
    [J]. BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2019, 2019, 11708 : 221 - 235
  • [40] AStar: A modeling language for document-oriented geospatial data warehouses
    Ferro, Marcio
    Silva, Edson
    Fidalgo, Robson
    [J]. DATA & KNOWLEDGE ENGINEERING, 2023, 145