A comparative study of dimensionality reduction techniques to enhance trace clustering performances

被引:52
|
作者
Song, M. [1 ]
Yang, H. [1 ]
Siadat, S. H. [1 ]
Pechenizkiy, M. [2 ]
机构
[1] Ulsan Natl Inst Sci & Technol, Sch Technol Management, Ulsan 689798, South Korea
[2] Eindhoven Univ Technol, Dept Comp Sci, NL-5612 AZ Eindhoven, Netherlands
基金
新加坡国家研究基金会;
关键词
Process mining; Trace clustering; Singular value decomposition; Random projection; PCA; SINGULAR VALUE DECOMPOSITION; RANDOM PROJECTIONS; PROCESS MODELS; CHECKING; SUPPORT;
D O I
10.1016/j.eswa.2012.12.078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Process mining techniques have been used to analyze event logs from information systems in order to derive useful patterns. However, in the big data era, real-life event logs are huge, unstructured, and complex so that traditional process mining techniques have difficulties in the analysis of big logs. To reduce the complexity during the analysis, trace clustering can be used to group similar traces together and to mine more structured and simpler process models for each of the clusters locally. However, a high dimensionality of the feature space in which all the traces are presented poses different problems to trace clustering. In this paper, we study the effect of applying dimensionality reduction (preprocessing) techniques on the performance of trace clustering. In our experimental study we use three popular feature transformation techniques; singular value decomposition (SVD), random projection (RP), and principal components analysis (PCA), and the state-of-the art trace clustering in process mining. The experimental results on the dataset constructed from a real event log recorded from patient treatment processes in a Dutch hospital show that dimensionality reduction can improve trace clustering performance with respect to the computation time and average fitness of the mined local process models. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3722 / 3737
页数:16
相关论文
共 50 条
  • [41] Soft dimensionality reduction for reinforcement data clustering
    Fatemeh Fathinezhad
    Peyman Adibi
    Bijan Shoushtarian
    Hamidreza Baradaran Kashani
    Jocelyn Chanussot
    [J]. World Wide Web, 2023, 26 : 3027 - 3054
  • [42] Dimensionality Reduction for Distance Based Video Clustering
    Thiagarajan, Jayaraman J.
    Ramamurthy, Karthikeyan N.
    Spanias, Andreas
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2010, 339 : 270 - 277
  • [43] On Some Fuzzy Clustering Algorithms with Dimensionality Reduction
    Kawamura, Masanori
    Kanzawa, Yuchi
    [J]. 2022 JOINT 12TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 23RD INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS&ISIS), 2022,
  • [44] Analysis of Unsupervised Dimensionality Reduction Techniques
    Kumar, Ch. Aswani
    [J]. COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2009, 6 (02) : 217 - 227
  • [45] A Review Paper on Dimensionality Reduction Techniques
    Mulla, Faizan Riyaz
    Gupta, Anil Kumar
    [J]. JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 1263 - 1272
  • [46] Classification through Hierarchical Clustering and Dimensionality Reduction
    Syrris, Vassilis
    Petridis, Vassilios
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1598 - 1603
  • [47] Dimensionality reduction by feature clustering for regression problems
    Xu, Rong-Fang
    Lee, Shie-Jue
    [J]. INFORMATION SCIENCES, 2015, 299 : 42 - 57
  • [48] A Computational Framework for Nonlinear Dimensionality Reduction and Clustering
    Wismueller, Axel
    [J]. ADVANCES IN SELF-ORGANIZING MAPS, PROCEEDINGS, 2009, 5629 : 334 - 343
  • [49] Dimensionality reduction techniques for data exploration
    Tsai, Flora S.
    Chan, Kap Luk
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1568 - 1572
  • [50] Soft dimensionality reduction for reinforcement data clustering
    Fathinezhad, Fatemeh
    Adibi, Peyman
    Shoushtarian, Bijan
    Baradaran Kashani, Hamidreza
    Chanussot, Jocelyn
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (05): : 3027 - 3054