Open data and open code for big science of science studies

被引:43
|
作者
Light, Robert P. [1 ]
Polley, David E. [1 ]
Boerner, Katy [1 ]
机构
[1] Indiana Univ, Sch Informat & Comp, Cyberinfrastruct Network Sci Ctr, Bloomington, IN 47405 USA
基金
美国国家科学基金会;
关键词
Open data; Visualization software; Big data; Scalability; Workflows;
D O I
10.1007/s11192-014-1238-2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science'' approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data'', i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http://sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.
引用
收藏
页码:1535 / 1551
页数:17
相关论文
共 50 条
  • [41] OPEN DATA INFRASTRUCTURES: EUROPEAN OPEN SCIENCE CLOUD
    Vevera, V. A.
    Barbu, D.
    [J]. 14TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED2020), 2020, : 5573 - 5577
  • [42] Open science: The open clinical trials data journey
    Rockhold, Frank
    Bromley, Christina
    Wagner, Erin K.
    Buyse, Marc
    [J]. CLINICAL TRIALS, 2019, 16 (05) : 539 - 546
  • [43] Open access to research data as a driver for open science
    Giglia, Elena
    [J]. JLIS.IT, 2015, 6 (02): : 225 - 247
  • [44] FAQ on Open Data and Open Science in the Sport Psychology
    Schoenbrodt, Felix D.
    Scheel, Anne
    [J]. ZEITSCHRIFT FUR SPORTPSYCHOLOGIE, 2017, 24 (04): : 134 - 139
  • [45] Open Molecular Science for the Open Science Cloud
    Lagana, Antonio
    Terstyanszky, Gabor
    Krueger, Jens
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2017, PT III, 2017, 10406 : 29 - 43
  • [46] SMS: a linked open data infrastructure for science and innovation studies
    van den Besselaar, Peter
    Khalili, Ali
    Idrissou, Al
    Loizou, Antonis
    Schlobach, Stefan
    van Harmelen, Frank
    [J]. 21ST INTERNATIONAL CONFERENCE ON SCIENCE AND TECHNOLOGY INDICATORS (STI 2016), 2016, : 106 - 114
  • [47] The Science International Accord on Open Data in a Big Data World and the IUCr's response
    Hackert, Marvin
    Van Meervelt, Luc
    Helliwell, John
    McMahon, Brian
    [J]. ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2017, 73 : A100 - A100
  • [48] The 13th National Information Day: Open Science, Open Data, Open Access, Bulgarian Open Science Cloud
    Stanchev, Peter
    Karaivanova, Aneta
    Zherkova, Yanita
    Klisarova, Hristiyaniya
    Pavlov, Radoslav
    Simeonov, Georgi
    [J]. DIGITAL PRESENTATION AND PRESERVATION OF CULTURAL AND SCIENTIFIC HERITAGE, 2022, 12 : 309 - 316
  • [49] The 14th National Information Day: Open Science, Open Data, Open Access, Bulgarian Open Science Cloud
    Stanchev, Peter
    Karaivanova, Aneta
    Zherkova, Yanita
    Klisarova, Hristiyaniya
    Iliev, Jordan
    Pavlov, Radoslav
    Simeonov, Georgi
    [J]. DIGITAL PRESENTATION AND PRESERVATION OF CULTURAL AND SCIENTIFIC HERITAGE, 2023, 13 : 343 - 351
  • [50] The 12th National Information Day: Open Science, Open Data, Open Access, Bulgarian Open Science Cloud
    Stanchev, Peter
    Ancheva, Hristiyaniya
    Karaivanova, Aneta
    Pavlov, Radoslav
    Simeonov, George
    [J]. DIGITAL PRESENTATION AND PRESERVATION OF CULTURAL AND SCIENTIFIC HERITAGE, 2021, 11 : 333 - 339