A step towards the foundations of data mining

被引:10
|
作者
Yao, YY [1 ]
机构
[1] Univ Regina, Dept Comp Sci, Regina, SK S4S 0A2, Canada
关键词
foundations of data mining; rule mining; formal concepts; explanation oriented mining; data mining models;
D O I
10.1117/12.509161
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses some fundamental issues related to the foundations of data mining. It is argued that there is an urgent need for formal and mathematical modeling of data mining. A formal framework provides a solid basis for a systematic study of many fundamental issues, such as representations and interpretations of primitive notions of data mining, data mining algorithms, explanations and applications of data mining results. A multi-level framework is proposed for modeling data mining based on results from many related fields. Formal concepts are adopted as the primitive notion. A concept is jointly defined as a pair consisting of the intension and the extension of the concept, namely, a formula in a certain language and a subset of the universe. An object satisfies the formula of a concept if the object has the properties as specified by the formula, and the object belongs to the extension of the concept. Rules are used to describe relationships between concepts. A rule is expressed in terms of the intensions of the two concepts and is interpreted in terms of the extensions of the concepts. Several different types of rules are investigated. The usefulness and meaningfulness of discovered knowledge are examined using a utility model and an explanation model.
引用
收藏
页码:254 / 263
页数:10
相关论文
共 50 条
  • [31] Data and knowledge mining with big data towards smart production
    Cheng, Ying
    Chen, Ken
    Sun, Hemeng
    Zhang, Yongping
    Tao, Fei
    JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2018, 9 : 1 - 13
  • [32] Scaling the data mining step in knowledge discovery using oceanographic data
    Wooley, B
    Bridges, S
    Hodges, J
    Skjellum, A
    INTELLIGENT PROBLEM SOLVING: METHODOLOGIES AND APPROACHES, PRODEEDINGS, 2000, 1821 : 85 - 92
  • [33] DATABASE COMPUTERS - STEP TOWARDS DATA UTILITIES
    BAUM, RI
    HSIAO, DK
    IEEE TRANSACTIONS ON COMPUTERS, 1976, 25 (12) : 1254 - 1259
  • [34] Data Prospecting-A Step Towards Data Intensive Science
    Ramachandran, Rahul
    Rushing, John
    Lin, Amy
    Conover, Helen
    Li, Xiang
    Graves, Sara
    Nair, U. S.
    Kuo, Kwo-Sen
    Smith, Deborah K.
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2013, 6 (03) : 1233 - 1241
  • [35] Step by Step Towards Energy-Aware Data Warehouse Design
    Bellatreche, Ladjel
    Roukh, Amine
    Bouarar, Selma
    BUSINESS INTELLIGENCE (EBISS 2016), 2017, 280 : 105 - 138
  • [36] Frequent set meta mining: Towards multi-agent data mining
    Albashiri, Kamal Ali
    Coenen, Frans
    Sanderson, Rob
    Leng, Paul
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXIV, 2008, : 139 - 151
  • [37] Data augmentation and data mining towards microstructure and property relationship for composites
    Guo, Ziyan
    Liu, Xuhao
    Pan, Zehua
    Zhou, Yexin
    Zhong, Zheng
    Yan, Zilin
    ENGINEERING COMPUTATIONS, 2023, 40 (7/8) : 1617 - 1632
  • [38] A step-by-step protocol based on data mining to explore purinergic signaling in glioblastoma
    Bedeschi, Martina
    Agrawal, Ankita
    Adinolfi, Elena
    Tesei, Anna
    Vouret-Craviari, Valerie
    PURINERGIC SIGNALLING, 2025,
  • [39] Towards an App Based on FIWARE Architecture and Data Mining with Imperfect Data
    Cadenas, Jose M.
    Carmen Garrido, M.
    Villa, Cristina
    INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: THEORY AND FOUNDATIONS, PT II, 2018, 854 : 75 - 87
  • [40] A Heuristic Data Mining Framework Towards Dynamic Data of Social Media
    Kee, Estelle Xin Ying
    Hong, Jer Lang
    NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 403 - 409