Data mining and knowledge discovery in databases - An overview

被引:15
|
作者
MacKinnon, MJ
Glick, N
机构
[1] Univ British Columbia, Dept Stat, Vancouver, BC V5Z 1M9, Canada
[2] Univ British Columbia, Dept Hlth Care & Epidemiol, Vancouver, BC V5Z 1M9, Canada
关键词
data mining; knowledge discovery; machine learning; observational study;
D O I
10.1111/1467-842X.00081
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Data mining seeks to extract useful, but previously unknown, information from typically massive collections of non-experimental, sometimes non-traditional data. From the perspective of statisticians, this paper surveys techniques used and contributions from fields such as data warehousing, machine learning from artificial intelligence, and visualization as well as statistics. It concludes that statistical thinking and design of analysis, as exemplified by achievements in clinical epidemiology, may fit well with the emerging activities of data mining and 'knowledge discovery in databases' (DM&KDD).
引用
收藏
页码:255 / 275
页数:21
相关论文
共 50 条