tableone: An open source Python']Python package for producing summary statistics for research papers

被引:110
|
作者
Pollard, Tom J. [1 ]
Johnson, Alistair E. W. [1 ]
Raffa, Jesse D. [1 ]
Mark, Roger G. [1 ]
机构
[1] MIT, Lab Computat Physiol, 77 Massachusetts Ave, Cambridge, MA 02139 USA
基金
美国国家卫生研究院;
关键词
descriptive statistics; !text type='python']python[!/text; quantitative research;
D O I
10.1093/jamiaopen/ooy012
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives: In quantitative research, understanding basic parameters of the study population is key for interpretation of the results. As a result, it is typical for the first table ("Table 1") of a research paper to include summary statistics for the study data. Our objectives are 2-fold. First, we seek to provide a simple, reproducible method for providing summary statistics for research papers in the Python programming language. Second, we seek to use the package to improve the quality of summary statistics reported in research papers. Materials and Methods: The tableone package is developed following good practice guidelines for scientific computing and all code is made available under a permissive MIT License. A testing framework runs on a continuous integration server, helping to maintain code stability. Issues are tracked openly and public contributions are encouraged. Results: The tableone software package automatically compiles summary statistics into publishable formats such as CSV, HTML, and LaTeX. An executable Jupyter Notebook demonstrates application of the package to a subset of data from the MIMIC-III database. Tests such as Tukey's rule for outlier detection and Hartigan's Dip Test for modality are computed to highlight potential issues in summarizing the data. Discussion and Conclusion: We present open source software for researchers to facilitate carrying out reproducible studies in Python, an increasingly popular language in scientific research. The toolkit is intended to mature over time with community feedback and input. Development of a common tool for summarizing data may help to promote good practice when used as a supplement to existing guidelines and recommendations. We encourage use of tableone alongside other methods of descriptive statistics and, in particular, visualization to ensure appropriate data handling. We also suggest seeking guidance from a statistician when using tableone for a research study, especially prior to submitting the study for publication.
引用
收藏
页码:26 / 31
页数:6
相关论文
共 50 条
  • [1] Novel Open Source Python']Python Neutrosophic Package
    El-Ghareeb, Haitham A.
    NEUTROSOPHIC SETS AND SYSTEMS, 2019, 25 : 136 - 160
  • [2] astroplan: An Open Source Observation Planning Package in Python']Python
    Morris, Brett M.
    Tollerud, Erik
    Sipocz, Brigitta
    Deil, Christoph
    Douglas, Stephanie T.
    Medina, Jazmin Berlanga
    Vyhmeister, Karl
    Smith, Toby R.
    Littlefair, Stuart
    Price-Whelan, Adrian M.
    Gee, Wilfred T.
    Jeschke, Eric
    ASTRONOMICAL JOURNAL, 2018, 155 (03):
  • [3] Sherpa: An Open-source Python']Python Fitting Package
    Siemiginowska, Aneta
    Burke, Douglas
    Gunther, Hans Moritz
    Lee, Nicholas P.
    McLaughlin, Warren
    Principe, David A.
    Cheer, Harlan
    Fruscione, Antonella
    Laurino, Omar
    McDowell, Jonathan
    Terrell, Marie
    ASTROPHYSICAL JOURNAL SUPPLEMENT SERIES, 2024, 274 (02):
  • [4] pyIDEAS: an Open Source Python']Python Package for Model Analysis
    Van Daele, Timothy
    Van Hoey, Stijn
    Nopens, Ingmar
    12TH INTERNATIONAL SYMPOSIUM ON PROCESS SYSTEMS ENGINEERING (PSE) AND 25TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING (ESCAPE), PT A, 2015, 37 : 569 - 574
  • [5] HFTools - An open source python']python package for microwave engineering
    Stenarson, J.
    2014 83RD ARFTG MICROWAVE MEASUREMENT CONFERENCE (ARFTG): MICROWAVE MEASUREMENTS FOR EMERGING TECHNOLOGIES, 2014,
  • [6] pyPMU - Open Source Python']Python Package for Synchrophasor Data Transfer
    Sandi, Stevan
    Krstajic, Bozo
    Popovic, Tomo
    2016 24TH TELECOMMUNICATIONS FORUM (TELFOR), 2016, : 861 - 864
  • [7] HYSUPP: AN OPEN-SOURCE HYPERSPECTRAL UNMIXING PYTHON']PYTHON PACKAGE
    Rasti, Behnood
    Zouaoui, Alexandre
    Mairal, Julien
    Chanussot, Jocelyn
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 1134 - 1137
  • [8] OpenSoundscape: An open-source bioacoustics analysis package for Python']Python
    Lapp, Sam
    Rhinehart, Tessa
    Freeland-Haynes, Louis
    Khilnani, Jatin
    Syunkova, Alexandra
    Kitzes, Justin
    METHODS IN ECOLOGY AND EVOLUTION, 2023, 14 (09): : 2321 - 2328
  • [9] PySeqLab: an open source Python']Python package for sequence labeling and segmentation
    Allam, Ahmed
    Krauthammer, Michael
    BIOINFORMATICS, 2017, 33 (21) : 3497 - 3499
  • [10] Inverse Source Estimation Tools in SDynPy, an Open-Source Python']Python Package
    Carter, Steven
    Rohe, Daniel
    DYNAMIC SUBSTRUCTURES, VOL. 4, IMAC 2024, 2025, : 111 - 123