An Introduction to Data Mining

被引:9
|
作者
Apostolakis, Joannis [1 ]
机构
[1] Univ Munich, Inst Informat, D-80538 Munich, Germany
来源
关键词
MULTILAYER FEEDFORWARD NETWORKS; RELATE; 2; SETS; POTENTIALS; ROTATION;
D O I
10.1007/430-2009_1
中图分类号
O61 [无机化学];
学科分类号
070301 ; 081704 ;
摘要
Data mining aims at the automated discovery of knowledge from typically large repositories of data. In science this knowledge is most often integrated into a model describing a particular process or natural phenomenon. Requirements with respect to the predictivity and the generality of the resulting models are usually significantly higher than in other application domains. Therefore, in the use of data mining in the sciences, and crystallography in particular, methods from machine learning and statistics play a significantly higher role than in other application areas. In the context of Crystallography, data collection, cleaning, and warehousing are aspects from standard data mining that play an important role, whereas for the analysis of the data techniques from machine learning and statistical analysis are mostly used. The purpose of this chapter is to introduce the reader to the concepts from that latter part of the knowledge discovery process and to provide a general intuition for the methods and possibilities of the different tools for learning from databases.
引用
收藏
页码:1 / 35
页数:35
相关论文
共 50 条
  • [1] Introduction to Data Mining
    Langin, Chet
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2008, 9 (04): : 353 - 354
  • [2] Data mining on multimedia data - Introduction
    Perner, P
    [J]. DATA MINING ON MULTIMEDIA DATA, 2002, 2558 : 1 - +
  • [3] Data Mining and Statistics — Introduction
    Heike Hofmann
    Antony Unwin
    Adalbert Wilhem
    [J]. Computational Statistics, 2001, 16 : 317 - 321
  • [4] Introduction to data mining for sustainability
    Morik, Katharina
    Bhaduri, Kanishka
    Kargupta, Hillol
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 24 (02) : 311 - 324
  • [5] Agents and Data Mining INTRODUCTION
    Cao, Longbing
    Gorodetsky, Vladimir
    Mitkas, Pericles A.
    [J]. IEEE INTELLIGENT SYSTEMS, 2009, 24 (03) : 14 - 15
  • [6] Introduction to the mining of clinical data
    Harrison, James H., Jr.
    [J]. CLINICS IN LABORATORY MEDICINE, 2008, 28 (01) : 1 - +
  • [7] Introduction: Data Mining on the Internet
    Shivakumar Vaithyanathan
    [J]. Artificial Intelligence Review, 1999, 13 : 343 - 344
  • [8] Data mining and statistics - Introduction
    Hofmann, H
    Unwin, A
    Wilhelm, A
    [J]. COMPUTATIONAL STATISTICS, 2001, 16 (03) : 317 - 321
  • [9] Introduction to data mining for sustainability
    Katharina Morik
    Kanishka Bhaduri
    Hillol Kargupta
    [J]. Data Mining and Knowledge Discovery, 2012, 24 : 311 - 324
  • [10] Introduction: Data mining on the internet
    Vaithyanathan, S
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 1999, 13 (5-6) : 343 - 344