Massive datasets

被引:7
|
作者
Kettenring, Jon R. [1 ]
机构
[1] Drew Univ, Res Inst Scientists Emeriti RISE, Madison, NJ 07940 USA
关键词
large datasets; complex data; high dimensions; data reduction; multivariate methods;
D O I
10.1002/wics.15
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Massive datasets are so labeled because of their size and complexity. They do not yield readily to standard statistical analyses. The resulting frustration has served as a spur to researchers to develop better tools. Some progress has been made, but the need for considerably more explains why this line of research remains a top priority. Interdisciplinary teamwork is at least as important as tools and can be the key to cracking the hard challenges that these datasets pose. This review includes background information, examples, and statistical strategies to illustrate the state-of-the- art. (C) 2009 John Wiley & Sons, Inc.
引用
收藏
页码:25 / 32
页数:8
相关论文
共 50 条
  • [1] Mining of Massive Datasets
    Richter, Lothar
    [J]. BIOMETRICS, 2018, 74 (04) : 1520 - 1521
  • [2] Workshop on Massive Datasets
    Wren, Christopher R.
    Ivanov, Yuri A.
    [J]. ICMI'07: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, 2007, : 385 - 385
  • [3] DOE and Massive Datasets
    不详
    [J]. JOURNAL OF NUCLEAR MEDICINE, 2012, 53 (06): : 26N - 26N
  • [4] Error correction for massive datasets
    Bruni, R
    [J]. OPTIMIZATION METHODS & SOFTWARE, 2005, 20 (2-3): : 291 - 310
  • [5] PROCESSING MASSIVE DATASETS IN GENOMICS
    Artiguenave, F.
    [J]. GAIA: AT THE FRONTIERS OF ASTROMETRY, 2011, 45 : 95 - 96
  • [6] Regression analysis for massive datasets
    Fan, Tsai-Hung
    Lin, Dennis K. J.
    Cheng, Kuang-Fu
    [J]. DATA & KNOWLEDGE ENGINEERING, 2007, 61 (03) : 554 - 562
  • [7] Adaptive quantile regressions for massive datasets
    Rong Jiang
    Wei-wei Chen
    Xin Liu
    [J]. Statistical Papers, 2021, 62 : 1981 - 1995
  • [8] Cluster analysis of massive datasets in astronomy
    Jang, Woncheol
    Hendry, Martin
    [J]. STATISTICS AND COMPUTING, 2007, 17 (03) : 253 - 262
  • [9] Distributed Subtrajectory Join on Massive Datasets
    Tampakis, Panagiotis
    Doulkeridis, Christos
    Pelekis, Nikos
    Theodoridis, Yannis
    [J]. ACM TRANSACTIONS ON SPATIAL ALGORITHMS AND SYSTEMS, 2020, 6 (02)
  • [10] Coping with high dimensionality in massive datasets
    Kettenring, Jon R.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2011, 3 (02): : 95 - 103