Statistical Perspectives on "Big Data"

被引:37
|
作者
Megahed, Fadel M. [1 ]
Jones-Farmer, L. Allison [2 ]
机构
[1] Auburn Univ, Dept Ind & Syst Engn, Auburn, AL 36849 USA
[2] Miami Univ, Dept Informat Syst & Analyt, Oxford, OH 45056 USA
关键词
Analytics; Control charts; Data mining; High-dimensional data; Image-monitoring; Surveillance; Text mining; CONTROL CHART; SUPPORT; SHIFTS;
D O I
10.1007/978-3-319-12355-4_3
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
As our information infrastructure evolves, our ability to store, extract, and analyze data is rapidly changing. Big data is a popular term that is used to describe the large, diverse, complex and/or longitudinal datasets generated from a variety of instruments, sensors and/or computer-based transactions. The term big data refers not only to the size or volume of data, but also to the variety of data and the velocity or speed of data accrual. As the volume, variety, and velocity of data increase, our existing analytical methodologies are stretched to new limits. These changes pose new opportunities for researchers in statistical methodology, including those interested in surveillance and statistical process control methods. Although it is well documented that harnessing big data to make better decisions can serve as a basis for innovative solutions in industry, healthcare, and science, these solutions can be found more easily with sound statistical methodologies. In this paper, we discuss several big data applications to highlight the opportunities and challenges for applied statisticians interested in surveillance and statistical process control. Our goal is to bring the research issues into better focus and encourage methodological developments for big data analysis in these areas.
引用
收藏
页码:29 / 47
页数:19
相关论文
共 50 条
  • [21] Data Visualization and Statistical Literacy for Open and Big Data
    Shanmugam, Ramalingam
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2020,
  • [22] Data Visualization and Statistical Graphics in Big Data Analysis
    Cook, Dianne
    Lee, Eun-Kyung
    Majumder, Mahbubul
    [J]. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 3, 2016, 3 : 133 - 159
  • [23] Big data and data processing in rheumatology: bioethical perspectives
    Amaranta Manrique de Lara
    Ingris Peláez-Ballestas
    [J]. Clinical Rheumatology, 2020, 39 : 1007 - 1014
  • [24] Big data and data processing in rheumatology: bioethical perspectives
    Manrique de Lara, Amaranta
    Pelaez-Ballestas, Ingris
    [J]. CLINICAL RHEUMATOLOGY, 2020, 39 (04) : 1007 - 1014
  • [25] Causes of deaths data, linkages and big data perspectives
    Rey, Gregoire
    Bounebache, Karim
    Rondet, Claire
    [J]. JOURNAL OF FORENSIC AND LEGAL MEDICINE, 2018, 57 : 37 - 40
  • [26] Data assimilation: Mathematical and statistical perspectives
    Apte, A.
    Jones, C. K. R. T.
    Stuart, A. M.
    Voss, J.
    [J]. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2008, 56 (08) : 1033 - 1046
  • [27] Study on the use of big data in Statistical Production
    Hypolito, Elizabeth Belo
    da Silva, Andrea Diniz
    Xavier, Antonia
    Chiquito, Atila Kopplin
    Gomes, Lucas Uchoa Moreira
    Peixoto, Isis Goncalves
    de Oliveira, Beatriz Menezes Marques
    Teixeira Junior, Antonio Etevaldo
    Frota, Alvaro de Moraes
    [J]. REVISTA TECNOLOGIA E SOCIEDADE, 2024, 20 (59): : 160 - 177
  • [28] An approach to big data inspired by statistical mechanics
    Cortese, John A.
    [J]. 2016 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2016,
  • [29] Statistical Challenges in "Big Data'' Human Neuroimaging
    Smith, Stephen M.
    Nichols, Thomas E.
    [J]. NEURON, 2018, 97 (02) : 263 - 268
  • [30] A Tool for Statistical Analysis on Network Big Data
    Ordonez, Carlos
    Johnson, Theodore
    Srivastava, Divesh
    Urbanek, Simon
    [J]. 2017 28TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2017, : 32 - 36