A new classification of datasets for frequent itemsets

被引:0
|
作者
Frédéric Flouvat
Fabien De Marchi
Jean-Marc Petit
机构
[1] University of New Caledonia,
[2] PPME,undefined
[3] Université de Lyon,undefined
[4] Université Lyon 1,undefined
[5] LIRIS,undefined
[6] UMR5205 CNRS,undefined
[7] Université de Lyon,undefined
[8] INSA-Lyon,undefined
[9] LIRIS,undefined
[10] UMR5205 CNRS,undefined
关键词
Pattern mining; Classification of datasets; Experimental study;
D O I
暂无
中图分类号
学科分类号
摘要
The discovery of frequent patterns is a famous problem in data mining. While plenty of algorithms have been proposed during the last decade, only a few contributions have tried to understand the influence of datasets on the algorithms behavior. Being able to explain why certain algorithms are likely to perform very well or very poorly on some datasets is still an open question. In this setting, we describe a thorough experimental study of datasets with respect to frequent itemsets. We study the distribution of frequent itemsets with respect to itemsets size together with the distribution of three concise representations: frequent closed, frequent free and frequent essential itemsets. For each of them, we also study the distribution of their positive and negative borders whenever possible. The main outcome of these experiments is a new classification of datasets invariant w.r.t. minsup variations and robust to explain efficiency of several implementations.
引用
收藏
页码:1 / 19
页数:18
相关论文
共 50 条
  • [1] A new classification of datasets for frequent itemsets
    Flouvat, Frederic
    De Marchi, Fabien
    Petit, Jean-Marc
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2010, 34 (01) : 1 - 19
  • [2] A thorough experimental study of datasets for frequent itemsets
    Flouvat, F
    De Marchi, F
    Petit, JM
    [J]. FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 162 - 169
  • [3] Algorithms for mining frequent itemsets in static and dynamic datasets
    Hernandez-Leon, R.
    Hernandez-Palancar, J.
    Carrasco-Ochoa, Jesus A.
    Fco Martinez-Trinidad, Jose
    [J]. INTELLIGENT DATA ANALYSIS, 2010, 14 (03) : 419 - 435
  • [4] Comparing Datasets Using Frequent Itemsets: Dependency on the Mining Parameters
    Ntoutsi, Irene
    Theodoridis, Yannis
    [J]. ARTIFICIAL INTELLIGENCE: THEORIES, MODELS AND APPLICATIONS, SETN 2008, 2008, 5138 : 212 - 225
  • [5] A Novel Incremental Algorithm for Frequent Itemsets Mining in Dynamic Datasets
    Hernandez-Leon, Raudel
    Hernandez-Palancar, Jose
    Carrasco-Ochoa, J. A.
    Martinez-Trinidad, J. Fco
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2008, 5197 : 145 - +
  • [6] On the Efficient Representation of Datasets as Graphs to Mine Maximal Frequent Itemsets
    Halim, Zahid
    Ali, Omer
    Khan, Muhammad Ghufran
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) : 1674 - 1691
  • [7] Text classification using sentential frequent itemsets
    Liu, Shi-Zhu
    Hu, He-Ping
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2007, 22 (02) : 334 - 337
  • [8] Text Classification Using Sentential Frequent Itemsets
    Shi-Zhu Liu
    He-Ping Hu
    [J]. Journal of Computer Science and Technology, 2007, 22 : 334 - 337
  • [9] Associative classification based on closed frequent itemsets
    [J]. Li, X.-M., 1600, Univ. of Electronic Science and Technology of China (41):
  • [10] A new feature selection method based on frequent and associated itemsets for text classification
    Farghaly, Heba Mamdouh
    Abd El-Hafeez, Tarek
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (25):