Data Science: the impact of statistics

被引:33
|
作者
Weihs, Claus [1 ]
Ickstadt, Katja [2 ]
机构
[1] TU Dortmund Univ, Computat Stat, D-44221 Dortmund, Germany
[2] TU Dortmund Univ, Math Stat & Biometr Applicat, D-44221 Dortmund, Germany
关键词
Structures of data science; Impact of statistics on data science; Fallacies in data science;
D O I
10.1007/s41060-018-0102-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we substantiate our premise that statistics is one of the most important disciplines to provide tools and methods to find structure in and to give deeper insight into data, and the most important discipline to analyze and quantify uncertainty. We give an overview over different proposed structures of Data Science and address the impact of statistics on such steps as data acquisition and enrichment, data exploration, data analysis and modeling, validation and representation and reporting. Also, we indicate fallacies when neglecting statistical reasoning.
引用
收藏
页码:189 / 194
页数:6
相关论文
共 50 条
  • [1] Statistics, data science, and big data
    Kauermann G.
    Küchenhoff H.
    [J]. AStA Wirtschafts- und Sozialstatistisches Archiv, 2016, 10 (2-3) : 141 - 150
  • [2] Data science, big data and statistics
    Galeano, Pedro
    Pena, Daniel
    [J]. TEST, 2019, 28 (02) : 289 - 329
  • [3] The future of statistics and data science
    Olhede, Sofia C.
    Wolfe, Patrick J.
    [J]. STATISTICS & PROBABILITY LETTERS, 2018, 136 : 46 - 50
  • [4] Importance of Statistics for Data Mining and Data Science
    Ribeiro, Vitor
    Rocha, Andre
    Peixoto, Rui
    Portela, Filipe
    Santos, Manuel Filipe
    [J]. 2017 5TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW) 2017, 2017, : 156 - 163
  • [5] Comments on: Data science, big data and statistics
    Ricardo Cao
    [J]. TEST, 2019, 28 : 664 - 670
  • [6] Comments on: Data science, big data and statistics
    Cao, Ricardo
    [J]. TEST, 2019, 28 (03) : 664 - 670
  • [7] Comments on: Data science, big data and statistics
    Ruey S. Tsay
    [J]. TEST, 2019, 28 : 357 - 359
  • [8] Comments on: Data science, big data and statistics
    Tsay, Ruey S.
    [J]. TEST, 2019, 28 (02) : 357 - 359
  • [9] Rejoinder on: Data science, big data and statistics
    Pedro Galeano
    Daniel Peña
    [J]. TEST, 2019, 28 : 363 - 368
  • [10] The science of statistics versus data science: What is the future?
    Hassani, Hossein
    Beneki, Christina
    Silva, Emmanuel Sirimal
    Vandeput, Nicolas
    Madsen, Dag Oivind
    [J]. TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2021, 173