A Representation Based on Essence for the CRISP-DM Methodology

被引:0
|
作者
Vanegas, Claudia Elena Durango [1 ]
Mejia, Juan Camilo Giraldo [2 ]
Agudelo, Fabio Alberto Vargas [2 ]
Duran, Dario Enrique Soto [2 ]
机构
[1] Univ San Buenaventura, Fac Ingn, Medellin, Colombia
[2] Tecnol Antioquia, Fac Ingn, Medellin, Colombia
来源
COMPUTACION Y SISTEMAS | 2023年 / 27卷 / 03期
关键词
CRISP-DM methodology; data mining; representation model; essence;
D O I
10.13053/CyS-27-3-3446
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
CRoss Industry Standard Process for Data Mining (CRISP-DM) is a data mining project development methodology that establishes tasks and levels of abstraction, hierarchically structured to facilitate its implementation through a set of actions that help in making decisions. Essence is a theory that helps identify best practices and essential, common, and universal elements to all endeavor in the software development cycle. In the literature, there are different models of representation of the CRISP-DM methodology, such as verbal model, conceptual model, process understanding model, and ontology. However, it considered that these representation models lack the incorporation of some elements, such as, activities, work products, and roles of the CRISP-DM methodology. In this paper we propose a representation based on Essence of the CRISP-DM methodology, incorporating the essential elements that we believe are missing from existing representations. With the representation in Essence that is proposed, the aim is to improve the understanding of best practices and the essential, common, and universal elements of the CRISP-DM methodology for future implementations in data mining projects. In addition, it seeks to validate that Essence can be used in different of data mining projects.
引用
收藏
页码:675 / 689
页数:15
相关论文
共 50 条
  • [41] Web Scraping Scientific Repositories for Augmented Relevant Literature Search Using CRISP-DM
    Hassanien, Hossam El-Din
    APPLIED SYSTEM INNOVATION, 2019, 2 (04) : 1 - 22
  • [42] Exploring the Relationship Between Data Science and Circular Economy: An Enhanced CRISP-DM Process Model
    Kristoffersen, Eivind
    Aremu, Oluseun Omotola
    Blomsma, Fenna
    Mikalef, Patrick
    Li, Jingyue
    DIGITAL TRANSFORMATION FOR A SUSTAINABLE SOCIETY IN THE 21ST CENTURY, 2019, 11701 : 177 - 189
  • [43] 建立CRISP-DM模型分析移动用户离网情况
    李佳林
    徐亮
    通信企业管理, 2016, (06) : 72 - 74
  • [44] Analyzing and Processing of Supplier Database Based on the Cross-Industry Standard Process for Data Mining (CRISP-DM) Algorithm
    Nodeh, Mohsen Jafari
    Calp, M. Hanefi
    Sahin, Ismail
    ARTIFICIAL INTELLIGENCE AND APPLIED MATHEMATICS IN ENGINEERING PROBLEMS, 2020, 43 : 544 - 558
  • [45] CRISP-DM Twenty Years Later: From Data Mining Processes to Data Science Trajectories
    Martinez-Plumed, Fernando
    Contreras-Ochando, Lidia
    Ferri, Cesar
    Hernandez-Orallo, Jose
    Kull, Meelis
    Lachiche, Nicolas
    Ramirez-Quintana, Maria Jose
    Flach, Peter
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (08) : 3048 - 3061
  • [46] Data-driven analysis of carbon emissions from buildingization under the CRISP-DM framework
    Wang W.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [47] Analysing warranty claims of automobiles - An application description following the CRISP-DM data mining process
    Hipp, J
    Lindner, G
    INTERNET APPLICATIONS, 1999, 1749 : 31 - 40
  • [48] 基于CRISP-DM模型的时序预测Web服务设计与实现
    王慧敏
    陈泽宇
    张驰
    计算机应用与软件, 2011, 28 (01) : 92 - 95
  • [49] Applying the CRISP-DM data mining process in the financial services industry: Elicitation of adaptation requirements
    Plotnikova, Veronika
    Dumas, Marlon
    Milani, Fredrik P.
    DATA & KNOWLEDGE ENGINEERING, 2022, 139
  • [50] Exploring the Performance of Large Language Models for Data Analysis Tasks Through the CRISP-DM Framework
    Musazade, Nurlan
    Mezei, Jozsef
    Wang, Xiaolu
    GOOD PRACTICES AND NEW PERSPECTIVES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 5, WORLDCIST 2024, 2024, 989 : 56 - 65