Open data and open code for big science of science studies

被引:0
|
作者
Robert P. Light
David E. Polley
Katy Börner
机构
[1] Indiana University,Cyberinfrastructure for Network Science Center, School of Informatics and Computing
来源
Scientometrics | 2014年 / 101卷
关键词
Open data; Visualization software; Big data; Scalability; Workflows;
D O I
暂无
中图分类号
学科分类号
摘要
Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a “Big Science” approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize “big data”, i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stakeholders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http://sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.
引用
收藏
页码:1535 / 1551
页数:16
相关论文
共 50 条
  • [21] Open Science in Sport and Exercise Sciences? A Tutorial for Preregistration of Research Projects, Open Material, Open Data, and Open Code
    Utesch, Till
    Dreiskaemper, Dennis
    Geukes, Katharina
    ZEITSCHRIFT FUR SPORTPSYCHOLOGIE, 2017, 24 (03): : 92 - 99
  • [22] Open Archives for Open Science at the ESAC Science Data Centre (ESDC)
    Gonzalez-Nunez, J.
    Rios, C.
    Salgado, J.
    Macfarlane, A.
    Barbariai, I.
    Arviset, C.
    Alvarez, R.
    Geiger, B.
    Rourke, L. O.
    Baldwin, E.
    Merin, B.
    Agudo-Agelet, F.
    Mullane, W. O.
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XXV, 2017, 512 : 141 - 144
  • [23] Enabling Interdisciplinary Research in Open Science: Open Science Data Network
    Dang, Vincent-Nam
    Aussenac-Gilles, Nathalie
    Megdiche, Imen
    Ravat, Franck
    RESEARCH CHALLENGES IN INFORMATION SCIENCE, PT I, RCIS 2024, 2024, 513 : 19 - 34
  • [24] The Eleventh National Information Day: Open Science, Open Data, Open Access, Bulgarian Open Science Cloud
    Stanchev, Peter
    Ancheva, Hristiyaniya
    Pavlov, Radoslav
    Simeonov, George
    DIGITAL PRESENTATION AND PRESERVATION OF CULTURAL AND SCIENTIFIC HERITAGE, 2020, 10 : 275 - 281
  • [25] The Tenth National Information Day: Open Science, Open Data, Open Access, Bulgarian Open Science Cloud
    Stanchev, Peter
    Angelieva, Karina
    Zherkova, Yanita
    Pavlov, Radoslav
    Simeonov, George
    DIGITAL PRESENTATION AND PRESERVATION OF CULTURAL AND SCIENTIFIC HERITAGE, 2019, 9 : 403 - 408
  • [26] Open Science, Open Research Data and some Open Questions
    Novotny, Jakub
    HRADEC ECONOMIC DAYS, PT II, 2019, 2019, 9 : 174 - 181
  • [27] BIG DATA, DATA SCIENCE AND THEIR CONTRIBUTIONS TO THE DEVELOPMENT OF THE USE OF OPEN SOURCE INTELLIGENCE
    dos Passos, Danielle Sandler
    SISTEMAS & GESTAO, 2016, 11 (04): : 392 - 396
  • [28] OPEN SCIENCE DATA CATALOGUE
    Schindler, F.
    Pari, S.
    Meissl, S.
    Smith, G.
    Dobrowolska, E.
    Anghelea, A.
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 997 - 1003
  • [29] Open science and trustworthy data
    Morris, Peter E.
    Fritz, Catherine O.
    PSYCHOLOGIST, 2016, 29 (01) : 4 - 4
  • [30] Open data for better science
    Weijie Zhao
    National Science Review, 2018, 5 (04) : 593 - 597