A comparative study of dimensionality reduction techniques to enhance trace clustering performances

被引:52
|
作者
Song, M. [1 ]
Yang, H. [1 ]
Siadat, S. H. [1 ]
Pechenizkiy, M. [2 ]
机构
[1] Ulsan Natl Inst Sci & Technol, Sch Technol Management, Ulsan 689798, South Korea
[2] Eindhoven Univ Technol, Dept Comp Sci, NL-5612 AZ Eindhoven, Netherlands
基金
新加坡国家研究基金会;
关键词
Process mining; Trace clustering; Singular value decomposition; Random projection; PCA; SINGULAR VALUE DECOMPOSITION; RANDOM PROJECTIONS; PROCESS MODELS; CHECKING; SUPPORT;
D O I
10.1016/j.eswa.2012.12.078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Process mining techniques have been used to analyze event logs from information systems in order to derive useful patterns. However, in the big data era, real-life event logs are huge, unstructured, and complex so that traditional process mining techniques have difficulties in the analysis of big logs. To reduce the complexity during the analysis, trace clustering can be used to group similar traces together and to mine more structured and simpler process models for each of the clusters locally. However, a high dimensionality of the feature space in which all the traces are presented poses different problems to trace clustering. In this paper, we study the effect of applying dimensionality reduction (preprocessing) techniques on the performance of trace clustering. In our experimental study we use three popular feature transformation techniques; singular value decomposition (SVD), random projection (RP), and principal components analysis (PCA), and the state-of-the art trace clustering in process mining. The experimental results on the dataset constructed from a real event log recorded from patient treatment processes in a Dutch hospital show that dimensionality reduction can improve trace clustering performance with respect to the computation time and average fitness of the mined local process models. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3722 / 3737
页数:16
相关论文
共 50 条
  • [31] Comparative study of linear and nonlinear dimensionality reduction for speaker identification
    Errity, Andrew
    McKenna, John
    [J]. PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 587 - +
  • [32] A Variant of the Trace Quotient Formulation for Dimensionality Reduction
    Wang, Peng
    Shen, Chunhua
    Zheng, Hong
    Ren, Zhang
    [J]. COMPUTER VISION - ACCV 2009, PT III, 2010, 5996 : 277 - +
  • [33] Comparative Analysis of Dimensionality Reduction Algorithms, Case Study: PCA
    Agarwal, Sugandha
    Ranjan, Priya
    Ujlayan, Amit
    [J]. PROCEEDINGS OF 2017 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO 2017), 2017, : 255 - 259
  • [34] Dimensionality reduction techniques for blog visualization
    Tsai, Flora S.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 2766 - 2773
  • [35] Dimensionality reduction techniques for proximity problems
    Indyk, P
    [J]. PROCEEDINGS OF THE ELEVENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2000, : 371 - 378
  • [36] Scalable Supervised Dimensionality Reduction Using Clustering
    Raeder, Troy
    Perlich, Claudia
    Dalessandro, Brian
    Stitelman, Ori
    Provost, Foster
    [J]. 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 1213 - 1221
  • [37] Patent Document Clustering Using Dimensionality Reduction
    Girthana, K.
    Swamynathan, S.
    [J]. PROGRESS IN ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, VOL 2, 2018, 564 : 167 - 176
  • [38] Dimensionality reduction via genetic value clustering
    Topchy, A
    Punch, W
    [J]. GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2003, PT II, PROCEEDINGS, 2003, 2724 : 1431 - 1443
  • [39] Word Embedding of Dimensionality Reduction for Document Clustering
    Zhu, Pengyu
    Lang, Qi
    Liu, Xiaodong
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4371 - 4376
  • [40] A Feature Clustering Approach for Dimensionality Reduction and Classification
    VinayKumar, Kotte
    Srinivasan, R.
    Singh, Elijah Blessing
    [J]. MENDEL 2015: RECENT ADVANCES IN SOFT COMPUTING, 2015, 378 : 257 - 268