Reproducibility Starts at the Source: R, Python']Python, and Julia Packages for Retrieving USGS Hydrologic Data

被引:3
|
作者
Hodson, Timothy O. [1 ]
Decicco, Laura A. [2 ]
Hariharan, Jayaram A. [3 ]
Stanish, Lee F. [3 ]
Black, Scott [4 ]
Horsburgh, Jeffery S. [5 ]
机构
[1] US Geol Survey, Cent Midwest Water Sci Ctr, Urbana, IL 61801 USA
[2] US Geol Survey, Upper Midwest Water Sci Ctr, Madison, WI 53726 USA
[3] US Geol Survey, Water Mission Area, Reston, VA 20192 USA
[4] Consortium Univ Advancement Hydrol Sci Inc CUAHSI, Arlington, MA 02476 USA
[5] Utah State Univ, Civil & Environm Engn, Logan, UT 84322 USA
基金
美国国家科学基金会;
关键词
packaged workflows; water data; reproducibility; open science; open data; open source; R; !text type='Python']Python[!/text; Julia; Jupyter; USGS; JUPYTER;
D O I
10.3390/w15244236
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Much of modern science takes place in a computational environment, and, increasingly, that environment is programmed using R, Python, or Julia. Furthermore, most scientific data now live on the cloud, so the first step in many workflows is to query a cloud database and load the response into a computational environment for further analysis. Thus, tools that facilitate programmatic data retrieval represent a critical component in reproducible scientific workflows. Earth science is no different in this regard. To fulfill that basic need, we developed R, Python, and Julia packages providing programmatic access to the U.S. Geological Survey's National Water Information System database and the multi-agency Water Quality Portal. Together, these packages create a common interface for retrieving hydrologic data in the Jupyter ecosystem, which is widely used in water research, operations, and teaching. Source code, documentation, and tutorials for the packages are available on GitHub. Users can go there to learn, raise issues, or contribute improvements within a single platform, which helps foster better engagement and collaboration between data providers and their users.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] pyPMU - Open Source Python']Python Package for Synchrophasor Data Transfer
    Sandi, Stevan
    Krstajic, Bozo
    Popovic, Tomo
    2016 24TH TELECOMMUNICATIONS FORUM (TELFOR), 2016, : 861 - 864
  • [12] Big Data Visualization: Allotting by R and Python']Python with GUI Tools
    Fahad, S. K. Ahammad
    Yahya, Abdulsamad Ebrahim
    2018 INTERNATIONAL CONFERENCE ON SMART COMPUTING AND ELECTRONIC ENTERPRISE (ICSCEE), 2018,
  • [13] Cgmquantify: Python']Python and R Software Packages for Comprehensive Analysis of Interstitial Glucose and Glycemic Variability from Continuous Glucose Monitor Data
    Bent, Brinnae
    Henriquez, Maria
    Dunn, Jessilyn
    IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY, 2021, 2 (02): : 263 - 266
  • [14] Assessing Accuracy: A Study of Lexicon and Rule-Based Packages in R and Python']Python for Sentiment Analysis
    Mahmoudi, Amin
    Jemielniak, Dariusz
    Ciechanowski, Leon
    IEEE ACCESS, 2024, 12 : 20169 - 20180
  • [15] Mining Imperfect Data: With Examples in R and Python']Python, Second Edition
    Stroch, Joel A.
    IEEE CONTROL SYSTEMS MAGAZINE, 2021, 41 (06): : 110 - 110
  • [16] Multi-Source Data Preprocessing Method Research Based on Python']Python
    Liu, Miao
    Ma, Hongli
    Zhang, Yongzhen
    Yue, Feng
    2024 5TH INTERNATIONAL CONFERENCE ON GEOLOGY, MAPPING AND REMOTE SENSING, ICGMRS 2024, 2024, : 221 - 224
  • [17] SigMT: An open-source Python']Python package for magnetotelluric data processing
    Ajithabh, K. S.
    Patro, Prasanta K.
    COMPUTERS & GEOSCIENCES, 2023, 171
  • [18] Razorback, an Open Source Python']Python Library for Robust Processing of Magnetotelluric Data
    Smai, Farid
    Wawrzyniak, Pierre
    FRONTIERS IN EARTH SCIENCE, 2020, 8
  • [19] Atomic data mining numerical methods, source code SQlite with Python']Python
    Khwaldeh, Ali
    Tahat, Amani
    Marti, Jordi
    Tahat, Mofleh
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTEGRATED INFORMATION (IC-ININFO 2012), 2013, 73 : 232 - 239
  • [20] Nmrglue: an open source Python']Python package for the analysis of multidimensional NMR data
    Helmus, Jonathan J.
    Jaroniec, Christopher P.
    JOURNAL OF BIOMOLECULAR NMR, 2013, 55 (04) : 355 - 367