Unveiling the power of R: a comprehensive perspective for laboratory medicine data analysis

被引:0
|
作者
Ma, Chaochao [1 ,3 ]
Qiu, Ling [1 ,2 ]
机构
[1] Peking Union Med Coll & Chinese Acad Med Sci, Peking Union Med Coll Hosp, Dept Lab Med, Beijing 100730, Peoples R China
[2] Peking Union Med Coll & Chinese Acad Med Sci, Peking Union Med Coll Hosp, State Key Lab Complex Severe & Rare Dis, Beijing 100730, Peoples R China
[3] Peking Univ, Sch Publ Hlth, Dept Occupat & Environm Hlth Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
R language; laboratory medicine; data analysis; code; predictive models; WORLD BIG-DATA; REFERENCE INTERVALS; MISSING DATA; OUTLIERS; MODELS;
D O I
10.1515/cclm-2024-1193
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
R language has gained traction in laboratory medicine for its statistical power and dynamic tools like RMarkdown and RShiny. However, there is limited literature summarizing R packages and functions tailored for laboratory medicine, making it difficult for clinical laboratory workers to access these tools. Additionally, varying algorithms across R packages can lead to inconsistencies in published reports. This review addresses these challenges by providing an overview of R's evolution and its key features, followed by a summary of statistical methods implemented in R, including platform comparisons, precision verification, factor analysis, and the establishment of reference intervals (RIs). We also highlight the development and validation of predictive models using techniques such as linear and logistic regression, decision trees, random forests, support vector machines, naive Bayes, K-Nearest Neighbors, k-means clustering, and backpropagation neural networks - all implemented in R. To ensure transparency and reproducibility in research, a checklist is provided for authors publishing papers using R for data analysis in laboratory medicine. In the final section, the potential of R in big data analytics is explored, focusing on standardized reporting through RMarkdown and the creation of user-friendly data visualization platforms with RShiny. Moreover, the integration of large language models (LLMs), such as ChatGPT, is discussed for their benefits in enhancing R programming, automating reporting, and offering insights from data analysis, thus improving the efficiency and accuracy of laboratory data analysis.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Optimized R functions for analysis of ecological community data using the R virtual laboratory (RvLab)
    Varsos, Constantinos
    Patkos, Theodore
    Oulas, Anastasis
    Pavloudi, Christina
    Gougousis, Alexandros
    Ijaz, Umer Zeeshan
    Filiopoulou, Irene
    Pattakos, Nikolaos
    Berghe, Edward Vanden
    Fernandez-Guerra, Antonio
    Faulwetter, Sarah
    Chatzinikolaou, Eva
    Pafilis, Evangelos
    Bekiari, Chryssoula
    Doerr, Martin
    Arvanitidis, Christos
    BIODIVERSITY DATA JOURNAL, 2016, 4
  • [32] R'evolution in laboratory medicine: SFBC and Euromedlab'15
    Goudable, Joelle
    ANNALES DE BIOLOGIE CLINIQUE, 2015, 73 (01) : 5 - 5
  • [33] Paper Smart Cities data analysis with Power BI and R
    Mora-Arciniegas, Maria-Belen
    Tenesaca Luna, Gladys Alicia
    PROCEEDINGS OF THE 2022 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON 2022), 2022, : 1824 - 1828
  • [34] bcRep: R Package for Comprehensive Analysis of B Cell Receptor Repertoire Data
    Bischof, Julia
    Ibrahim, Saleh M.
    PLOS ONE, 2016, 11 (08):
  • [35] Unveiling Insights: Harnessing the Power of the Most-Frequent-Value Method for Sensor Data Analysis
    Golovko, Victor V.
    Kamaev, Oleg
    Sun, Jiansheng
    SENSORS, 2023, 23 (21)
  • [36] Unveiling the power of mitochondrial transfer in cancer progression: a perspective in ovarian cancer
    Wang, Caixia
    Xie, Chuan
    JOURNAL OF OVARIAN RESEARCH, 2024, 17 (01)
  • [37] Unveiling the potential for combined heat and power in Chilean industry - A policy perspective
    Valdes, Javier
    Poque Gonzalez, Axel Bastian
    Masip Macia, Yunesky
    Dorner, Wolfgang
    Camargo, Luis Ramirez
    ENERGY POLICY, 2020, 140 (140)
  • [38] Unveiling the persistence of meteorological drought in Iraq: a comprehensive spatiotemporal analysis
    Hatem, Israa
    Alwan, Imzahim A.
    Ziboon, Abdul Razzak T.
    Kuriqi, Alban
    SUSTAINABLE WATER RESOURCES MANAGEMENT, 2024, 10 (05)
  • [39] Unveiling racial disparities in hepatic neuroendocrine tumors: A comprehensive analysis
    Han, D.
    Zhang, Z.
    Du, H.
    JOURNAL OF NEUROENDOCRINOLOGY, 2024, 36 : 100 - 100
  • [40] Comprehensive Study of Compression and Texture Integration for Digital Imaging and Communications in Medicine Data Analysis
    Shakya, Amit Kumar
    Vidyarthi, Anurag
    TECHNOLOGIES, 2024, 12 (02)