Reproducibility Starts at the Source: R, Python']Python, and Julia Packages for Retrieving USGS Hydrologic Data

被引:3
|
作者
Hodson, Timothy O. [1 ]
Decicco, Laura A. [2 ]
Hariharan, Jayaram A. [3 ]
Stanish, Lee F. [3 ]
Black, Scott [4 ]
Horsburgh, Jeffery S. [5 ]
机构
[1] US Geol Survey, Cent Midwest Water Sci Ctr, Urbana, IL 61801 USA
[2] US Geol Survey, Upper Midwest Water Sci Ctr, Madison, WI 53726 USA
[3] US Geol Survey, Water Mission Area, Reston, VA 20192 USA
[4] Consortium Univ Advancement Hydrol Sci Inc CUAHSI, Arlington, MA 02476 USA
[5] Utah State Univ, Civil & Environm Engn, Logan, UT 84322 USA
基金
美国国家科学基金会;
关键词
packaged workflows; water data; reproducibility; open science; open data; open source; R; !text type='Python']Python[!/text; Julia; Jupyter; USGS; JUPYTER;
D O I
10.3390/w15244236
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Much of modern science takes place in a computational environment, and, increasingly, that environment is programmed using R, Python, or Julia. Furthermore, most scientific data now live on the cloud, so the first step in many workflows is to query a cloud database and load the response into a computational environment for further analysis. Thus, tools that facilitate programmatic data retrieval represent a critical component in reproducible scientific workflows. Earth science is no different in this regard. To fulfill that basic need, we developed R, Python, and Julia packages providing programmatic access to the U.S. Geological Survey's National Water Information System database and the multi-agency Water Quality Portal. Together, these packages create a common interface for retrieving hydrologic data in the Jupyter ecosystem, which is widely used in water research, operations, and teaching. Source code, documentation, and tutorials for the packages are available on GitHub. Users can go there to learn, raise issues, or contribute improvements within a single platform, which helps foster better engagement and collaboration between data providers and their users.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] pyActigraphy, an open-source python']python package for actigraphy data visualisation and analysis
    Hammad, G.
    Reyt, M.
    Beliy, N.
    Baillet, M.
    Deantoni, M.
    Lesoinne, A.
    Muto, V.
    Schmidt, C.
    JOURNAL OF SLEEP RESEARCH, 2020, 29 : 291 - 292
  • [32] Modeling the impact of Python and R packages using dependency and contributor networks
    Gizem Korkmaz
    Claire Kelling
    Carol Robbins
    Sallie Keller
    Social Network Analysis and Mining, 2020, 10
  • [33] TOWARDS AN OPEN SOURCE PYTHON']PYTHON LIBRARY FOR AUTOMATED EXPLORATORY SPATIAL DATA ANALYSIS
    de Kock, Nicholas
    Rautenbach, Victoria
    Fabris-Rotelli, Inger
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION IV, 2022, 43-B4 : 91 - 98
  • [34] Python']Python Indian Weather Radar Toolkit (pyiwr): An open-source Python']Python library for processing, analyzing and visualizing weather radar data
    Singh, Nitig
    Tyagi, Vaibhav
    Das, Saurabh
    Sahoo, Udaya Kumar
    Kundu, Shyam Sundar
    JOURNAL OF COMPUTATIONAL SCIENCE, 2024, 81
  • [35] Picasso: A Sparse Learning Library for High Dimensional Data Analysis in R and Python']Python
    Ge, Jason
    Li, Xingguo
    Jiang, Haoming
    Liu, Han
    Zhang, Tong
    Wang, Mengdi
    Zhao, Tuo
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [36] Sleep: An Open-Source Python']Python Software for Visualization, Analysis, and Staging of Sleep Data
    Combrisson, Etienne
    Vallat, Raphael
    Eichenlaub, Jean-Baptiste
    O'Reilly, Christian
    Lajnef, Tarek
    Guillot, Aymeric
    Ruby, Perrine M.
    Jerbi, Karim
    FRONTIERS IN NEUROINFORMATICS, 2017, 11
  • [37] A new open source data analysis python']python script for QSAR study and its validation
    Kurdekar, Vadiraj
    Jadhav, Hemant R.
    MEDICINAL CHEMISTRY RESEARCH, 2015, 24 (04) : 1617 - 1625
  • [38] K Nearest Neighbor OveRsampling approach: An open source python']python package for data augmentation
    Islam, Ashhadul
    Belhaouari, Samir Brahim
    Rehman, Atiq Ur
    Bensmail, Halima
    SOFTWARE IMPACTS, 2022, 12
  • [39] PyDicer: An open-source python']python library for conversion and analysis of radiotherapy DICOM data
    Chlap, Phillip
    Al Mouiee, Daniel
    Finnegan, Robert N.
    Cui, Janet
    Chin, Vicky
    Deshpande, Shrikant
    Holloway, Lois
    SOFTWAREX, 2025, 29
  • [40] pyPhotometry: Open source Python']Python based hardware and software for fiber photometry data acquisition
    Akam, Thomas
    Walton, Mark E.
    SCIENTIFIC REPORTS, 2019, 9 (1)