Data mining: Statistics and more?

被引:175
|
作者
Hand, DJ [1 ]
机构
[1] Open Univ, Dept Stat, Milton Keynes MK7 6AA, Bucks, England
来源
AMERICAN STATISTICIAN | 1998年 / 52卷 / 02期
关键词
databases; exploratory data analysis; knowledge discovery;
D O I
10.2307/2685468
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Data mining is a new discipline lying at the interface of statistics, database technology, pattern recognition, machine learning, and other areas. It is concerned with the secondary analysis of large databases in order to find previously unsuspected relationships which are of interest or value to the database owners. New problems arise, partly as a consequence of the sheer size of the data sets involved, and partly because of issues of pattern matching. However, since statistics provides the intellectual glue underlying the effort, it is important for statisticians to become involved. There are very real opportunities for statisticians to make significant contributions.
引用
收藏
页码:112 / 118
页数:7
相关论文
共 50 条
  • [1] The Lure of Statistics in Data Mining
    Grover, Lovleen Kumar
    Mehra, Rajni
    [J]. JOURNAL OF STATISTICS EDUCATION, 2008, 16 (01):
  • [2] Data Mining and Statistics — Introduction
    Heike Hofmann
    Antony Unwin
    Adalbert Wilhem
    [J]. Computational Statistics, 2001, 16 : 317 - 321
  • [3] Data mining and statistics - Introduction
    Hofmann, H
    Unwin, A
    Wilhelm, A
    [J]. COMPUTATIONAL STATISTICS, 2001, 16 (03) : 317 - 321
  • [4] Data mining and more
    O'Leary, DE
    [J]. IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 2000, 15 (02): : 2 - 3
  • [5] Importance of Statistics for Data Mining and Data Science
    Ribeiro, Vitor
    Rocha, Andre
    Peixoto, Rui
    Portela, Filipe
    Santos, Manuel Filipe
    [J]. 2017 5TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW) 2017, 2017, : 156 - 163
  • [6] Data mining, statistical methods mining, and history of statistics
    Parzen, E
    [J]. MINING AND MODELING MASSIVE DATA SETS IN SCIENCE, ENGINEERING, AND BUSINESS WITH A SUBTHEME IN ENVIRONMENTAL STATISTICS, 1997, 29 (01): : 365 - 374
  • [7] Mining California vital statistics data
    Zhang, D
    Ha, QL
    Lu, ML
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 671 - 672
  • [8] Application of data mining in basketball statistics
    Ma, Bin
    Wang, Yingchun
    Li, Zheng
    [J]. APPLIED MATHEMATICS AND NONLINEAR SCIENCES, 2022, 8 (01) : 2179 - 2188
  • [9] Trends need more data - Statistics
    不详
    [J]. EUROPEAN CHEMICAL NEWS, 1997, 67 (1758): : 11 - 11
  • [10] Data mining for official statistics Challenges and opportunities
    Buelens, Bart
    Daas, Piet
    van den Brakel, Jan
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 915 - 915