Experiences in integrated data and research object publishing using GigaDB

被引:12
|
作者
Edmunds S.C. [1 ]
Li P. [1 ]
Hunter C.I. [1 ]
Xiao S.Z. [1 ]
Davidson R.L. [1 ,2 ]
Nogoy N. [1 ]
Goodman L. [1 ]
机构
[1] GigaScience, BGI-Hong Kong Co, Ltd, 16 Dai Fu Street, Tai Po Industrial Estate, NT, Hong Kong SAR
[2] Office for National Statistics, Duffryn, Government Buildings, Cardiff Rd, Newport
基金
欧盟地平线“2020”;
关键词
Computational biology; Data citation; Data publishing; Open-data; Reproducibility;
D O I
10.1007/s00799-016-0174-6
中图分类号
学科分类号
摘要
In the era of computation and data-driven research, traditional methods of disseminating research are no longer fit-for-purpose. New approaches for disseminating data, methods and results are required to maximize knowledge discovery. The “long tail” of small, unstructured datasets is well catered for by a number of general-purpose repositories, but there has been less support for “big data”. Outlined here are our experiences in attempting to tackle the gaps in publishing large-scale, computationally intensive research. GigaScience is an open-access, open-data journal aiming to revolutionize large-scale biological data dissemination, organization and re-use. Through use of the data handling infrastructure of the genomics centre BGI, GigaScience links standard manuscript publication with an integrated database (GigaDB) that hosts all associated data, and provides additional data analysis tools and computing resources. Furthermore, the supporting workflows and methods are also integrated to make published articles more transparent and open. GigaDB has released many new and previously unpublished datasets and data types, including as urgently needed data to tackle infectious disease outbreaks, cancer and the growing food crisis. Other “executable” research objects, such as workflows, virtual machines and software from several GigaScience articles have been archived and shared in reproducible, transparent and usable formats. With data citation producing evidence of, and credit for, its use in the wider research community, GigaScience demonstrates a move towards more executable publications. Here data analyses can be reproduced and built upon by users without coding backgrounds or heavy computational infrastructure in a more democratized manner. © 2016, The Author(s).
引用
收藏
页码:99 / 111
页数:12
相关论文
共 50 条
  • [1] On research data publishing
    Leonardo Candela
    Donatella Castelli
    Paolo Manghi
    Sarah Callaghan
    [J]. International Journal on Digital Libraries, 2017, 18 (2) : 73 - 75
  • [2] Experiences of Undergraduates Publishing Biomechanics Research
    McErlain-Naylor, Stuart A.
    [J]. JOURNAL OF APPLIED BIOMECHANICS, 2020, 36 (05) : 351 - 359
  • [3] Experiences of undergraduates publishing biomechanics research
    McErlain-Naylor, Stuart A.
    [J]. Journal of Applied Biomechanics, 2020, 36 (05): : 351 - 359
  • [4] Experiences using object data management in the real world
    Chaudhri, AB
    [J]. THEORY AND PRACTICE OF OBJECT SYSTEMS, 1999, 5 (04): : 199 - 200
  • [5] Data and information management for integrated research - requirements, experiences and solutions
    Zander, F.
    Kralisch, S.
    Fluegel, W. -A.
    [J]. 20TH INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION (MODSIM2013), 2013, : 2201 - 2206
  • [6] Vocabularies for Publishing Research Data
    Tomoyose, Kazumi
    Simionato Arakaki, Ana Carolina
    [J]. CATALOGING & CLASSIFICATION QUARTERLY, 2022, 60 (01) : 69 - 85
  • [7] An integrated system for publishing environmental observations data
    Horsburgh, Jeffery S.
    Tarboton, David G.
    Piasecki, Michael
    Maidment, David R.
    Zaslavsky, Ilya
    Valentine, David
    Whitenack, Thomas
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2009, 24 (08) : 879 - 888
  • [8] Providing Research Infrastructures with Data Publishing
    Assante, Massimiliano
    Candela, Leonardo
    Manghi, Paolo
    Pagano, Pasquale
    Castelli, Donatella
    [J]. ERCIM NEWS, 2015, (100): : 20 - 21
  • [9] Publishing Earthquake Engineering Research Data
    Pejsa, Stanislav
    Song, Cheng
    [J]. JCDL'13: PROCEEDINGS OF THE 13TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES, 2013, : 417 - 418
  • [10] Linkitup: Semantic Publishing of Research Data
    Hoekstra, Rinke
    Groth, Paul
    Charlaganov, Marat
    [J]. SEMANTIC WEB EVALUATION CHALLENGE, 2014, 475 : 95 - 100