Big Data Analysis Using Modern Statistical and Machine Learning Methods in Medicine

被引:55
|
作者
Yoo, Changwon [1 ]
Ramirez, Luis [1 ]
Liuzzi, Juan [2 ]
机构
[1] Florida Int Univ, Dept Biostat, Miami, FL 33199 USA
[2] Florida Int Univ, Dept Nutr & Dietet, Miami, FL 33199 USA
关键词
Bayesian analysis; Statistical data interpretation; Systems biology; DIMENSIONALITY REDUCTION METHOD; CELLULAR CONTROL PROCESSES; GENE-GENE INTERACTIONS; BAYESIAN NETWORKS; SYSTEM; CANCER; IDENTIFICATION; MATHEMATICS; DISCOVERY; PATHWAYS;
D O I
10.5213/inj.2014.18.2.50
中图分类号
R5 [内科学]; R69 [泌尿科学(泌尿生殖系疾病)];
学科分类号
1002 ; 100201 ;
摘要
In this article we introduce modern statistical machine learning and bioinformatics approaches that have been used in learning statistical relationships from big data in medicine and behavioral science that typically include clinical, genomic (and proteomic) and environmental variables. Every year, data collected from biomedical and behavioral science is getting larger and more complicated. Thus, in medicine, we also need to be aware of this trend and understand the statistical tools that are available to analyze these datasets. Many statistical analyses that are aimed to analyze such big datasets have been introduced recently. However, given many different types of clinical, genomic, and environmental data, it is rather uncommon to see statistical methods that combine knowledge resulting from those different data types. To this extent, we will introduce big data in terms of clinical data, single nucleotide polymorphism and gene expression studies and their interactions with environment. In this article, we will introduce the concept of well-known regression analyses such as linear and logistic regressions that has been widely used in clinical data analyses and modern statistical models such as Bayesian networks that has been introduced to analyze more complicated data. Also we will discuss how to represent the interaction among clinical, genomic, and environmental data in using modern statistical models. We conclude this article with a promising modern statistical method called Bayesian networks that is suitable in analyzing big data sets that consists with different type of large data from clinical, genomic, and environmental data. Such statistical model form big data will provide us with more comprehensive understanding of human physiology and disease.
引用
收藏
页码:50 / 57
页数:8
相关论文
共 50 条
  • [1] ASSESSMENT OF MODERN MACHINE LEARNING METHODS AND CONVENTIONAL STATISTICAL REGRESSION TECHNIQUES IN DIAGNOSIS AND PREDICTION OF OUTCOME AFTER ACUTE STROKE USING BIG DATA
    Woodhouse, L.
    Chen, X.
    Garibaldi, J.
    Havard, D.
    Montgomery, A.
    Quinn, T.
    Sprigg, N.
    Bath, P.
    [J]. INTERNATIONAL JOURNAL OF STROKE, 2021, 16 (3_SUPPL) : 54 - 54
  • [2] Quality Assessment of Data Using Statistical and Machine Learning Methods
    Singh, Prerna
    Suri, Bharti
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 2, 2015, 32 : 89 - 97
  • [3] A Survey of Machine Learning Methods for Big Data
    Ruiz, Zoila
    Salvador, Jaime
    Garcia-Rodriguez, Jose
    [J]. BIOMEDICAL APPLICATIONS BASED ON NATURAL AND ARTIFICIAL COMPUTING, PT II, 2017, 10338 : 259 - 267
  • [4] How Big Data changes Statistical Machine Learning
    Bottou, Leon
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1 - 1
  • [5] A Bayesian perspective of statistical machine learning for big data
    Rajiv Sambasivan
    Sourish Das
    Sujit K. Sahu
    [J]. Computational Statistics, 2020, 35 : 893 - 930
  • [6] A Bayesian perspective of statistical machine learning for big data
    Sambasivan, Rajiv
    Das, Sourish
    Sahu, Sujit K.
    [J]. COMPUTATIONAL STATISTICS, 2020, 35 (03) : 893 - 930
  • [7] Translational Medicine in the Era of Big Data and Machine Learning
    Weintraub, William S.
    Fahed, Akl C.
    Rumsfeld, John S.
    [J]. CIRCULATION RESEARCH, 2018, 123 (11) : 1202 - 1204
  • [8] PRECISION MEDICINE, BIG DATA AND MACHINE LEARNING IN OA
    Krawetz, R. J.
    [J]. OSTEOARTHRITIS AND CARTILAGE, 2018, 26 : S5 - S5
  • [9] Editorial Commentary: Big Data and Machine Learning in Medicine
    Hohmann, Erik
    [J]. ARTHROSCOPY-THE JOURNAL OF ARTHROSCOPIC AND RELATED SURGERY, 2022, 38 (03): : 848 - 849
  • [10] Tension in big data using machine learning: Analysis and applications
    Wang, Huamao
    Yao, Yumei
    Salhi, Said
    [J]. TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2020, 158