A step towards the foundations of data mining

被引:10
|
作者
Yao, YY [1 ]
机构
[1] Univ Regina, Dept Comp Sci, Regina, SK S4S 0A2, Canada
关键词
foundations of data mining; rule mining; formal concepts; explanation oriented mining; data mining models;
D O I
10.1117/12.509161
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses some fundamental issues related to the foundations of data mining. It is argued that there is an urgent need for formal and mathematical modeling of data mining. A formal framework provides a solid basis for a systematic study of many fundamental issues, such as representations and interpretations of primitive notions of data mining, data mining algorithms, explanations and applications of data mining results. A multi-level framework is proposed for modeling data mining based on results from many related fields. Formal concepts are adopted as the primitive notion. A concept is jointly defined as a pair consisting of the intension and the extension of the concept, namely, a formula in a certain language and a subset of the universe. An object satisfies the formula of a concept if the object has the properties as specified by the formula, and the object belongs to the extension of the concept. Rules are used to describe relationships between concepts. A rule is expressed in terms of the intensions of the two concepts and is interpreted in terms of the extensions of the concepts. Several different types of rules are investigated. The usefulness and meaningfulness of discovered knowledge are examined using a utility model and an explanation model.
引用
收藏
页码:254 / 263
页数:10
相关论文
共 50 条
  • [21] Towards Integrated Study of Data Management and Data Mining
    Chen, Zhengxin
    3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2015, 2015, 55 : 1331 - 1339
  • [22] Statistical Data Analytics. Foundations for Data Mining, Informatics, and Knowledge Discovery
    Lalanne, Christophe
    JOURNAL OF STATISTICAL SOFTWARE, 2016, 69 (B3):
  • [23] FOUNDATIONS OF ADAPTIVE DATA STREAM MINING FOR MOBILE AND EMBEDDED APPLICATIONS
    Gaber, Mohamed Medhat
    2008 CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE, 2008, : 298 - 303
  • [24] Development of conceptual foundations of medico-statistical data mining
    Muradova, Gulara
    2018 IEEE 12TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2018, : 78 - 82
  • [25] UniMiner: Towards a Unified Framework for Data Mining
    Rehman, Muhammad Habib Ur
    Liew, Chee Sun
    Teh, Ying Wah
    2014 4TH WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES (WICT), 2014, : 134 - 139
  • [26] Sonification: A novel approach towards data mining
    Halim, Zahid
    Baig, Rauf
    Bashir, Shariq
    SECOND INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES 2006, PROCEEDINGS, 2006, : 548 - +
  • [27] Towards a logic query language for data mining
    Giannotti, F
    Manco, G
    Turini, F
    DATABASE SUPPORT FOR DATA MINING APPLICATIONS: DISCOVERING KNOWLEDGE WITH INDUCTIVE QUERIES, 2004, 2682 : 76 - 94
  • [28] Towards a Metaquery Language for Mining the Web of Data
    Lisi, Francesca A.
    DATA ANALYTICS, 2017, 10365 : 90 - 93
  • [29] A fuzzy approach towards inferential data mining
    Rubin, SH
    COMPUTERS & INDUSTRIAL ENGINEERING, 1998, 35 (1-2) : 267 - 270
  • [30] Towards data warehousing and mining of protein unfolding simulation data
    Berrar D.
    Stahl F.
    Silva C.
    Rodrigues J.R.
    Brito R.M.M.
    Dubitzky W.
    Journal of Clinical Monitoring and Computing, 2005, 19 (4-5) : 307 - 317