A new classification of datasets for frequent itemsets

被引:0
|
作者
Frédéric Flouvat
Fabien De Marchi
Jean-Marc Petit
机构
[1] University of New Caledonia,
[2] PPME,undefined
[3] Université de Lyon,undefined
[4] Université Lyon 1,undefined
[5] LIRIS,undefined
[6] UMR5205 CNRS,undefined
[7] Université de Lyon,undefined
[8] INSA-Lyon,undefined
[9] LIRIS,undefined
[10] UMR5205 CNRS,undefined
关键词
Pattern mining; Classification of datasets; Experimental study;
D O I
暂无
中图分类号
学科分类号
摘要
The discovery of frequent patterns is a famous problem in data mining. While plenty of algorithms have been proposed during the last decade, only a few contributions have tried to understand the influence of datasets on the algorithms behavior. Being able to explain why certain algorithms are likely to perform very well or very poorly on some datasets is still an open question. In this setting, we describe a thorough experimental study of datasets with respect to frequent itemsets. We study the distribution of frequent itemsets with respect to itemsets size together with the distribution of three concise representations: frequent closed, frequent free and frequent essential itemsets. For each of them, we also study the distribution of their positive and negative borders whenever possible. The main outcome of these experiments is a new classification of datasets invariant w.r.t. minsup variations and robust to explain efficiency of several implementations.
引用
收藏
页码:1 / 19
页数:18
相关论文
共 50 条
  • [11] A new algorithm for discovery of frequent itemsets
    Kotasek, P
    Zendulka, J
    [J]. KNOWLEDGE-BASED SOFTWARE ENGINEERING, 1998, 48 : 183 - 186
  • [12] Frequent Itemsets Mining with a Guaranteed Local Differential Privacy in Small Datasets
    Afrose, Sharmin
    Hashem, Tanzima
    Ali, Mohammed Eunus
    [J]. 33RD INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2021), 2020, : 232 - 236
  • [13] A mining frequent itemsets procedure using a classification network
    Lu, Tzu-Chuen
    Changb, Chin-Chen
    Changchien, S. Wesley
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INFORMATION AND MANAGEMENT SCIENCES, 2002, 1 : 76 - 82
  • [14] ACCF: Associative Classification Based on Closed Frequent Itemsets
    Li, Xueming
    Qin, Dongxia
    Yu, Cun
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 380 - 384
  • [15] A novel parallel algorithm for frequent itemsets mining in massive small files datasets
    [J]. Zhang, Z. (zhangzl@swu.edu.cn), 1600, ICIC Express Letters Office, Tokai University, Kumamoto Campus, 9-1-1, Toroku, Kumamoto, 862-8652, Japan (05):
  • [16] A New Method for Mining Maximal Frequent Itemsets
    Nadimi-Shahraki, Mohammad
    Mustapha, Norwati
    Sulaiman, Md Nasir B.
    Mamat, Ali B.
    [J]. INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 1064 - 1067
  • [17] HUT: A new method for mining frequent itemsets
    Tsay, Yuh-Jiuan
    Hsu, Tain-Jung
    Yu, Jing-Rung
    [J]. INFORMATION SCIENCES, 2009, 179 (11) : 1724 - 1737
  • [18] New Algorithm for Finding Frequent and Rare Itemsets
    Kaushal, Chetna
    Singh, Harpreet
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [19] New algorithm of mining frequent closed itemsets
    School of Computer and Information Technology, Liaoning Normal University, Dalian 116029, China
    [J]. J. Southeast Univ. Engl. Ed., 2008, 3 (335-338):
  • [20] Classification of HIV data By Constructing A Social Network with Frequent Itemsets
    Kocak, Yunuscan
    Ozyer, Tansel
    Alhajj, Reda
    [J]. PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING ASONAM 2016, 2016, : 949 - 953