Frequent Item Mining When Obtaining Support Is Costly

被引:0
|
作者
Lin, Joe Wing-Ho [1 ]
Wong, Raymond Chi-Wing [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China
来源
BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2019 | 2019年 / 11708卷
关键词
Frequent item mining; Random sampling;
D O I
10.1007/978-3-030-27520-4_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Suppose there are n users and m items, and the preference of each user for the items is revealed only upon probing, which takes time and is therefore costly. How can we quickly discover all the frequent items that are favored individually by at least a given number of users? This new problem not only has strong connections with several well-known problems, such as the frequent item mining problem, it also finds applications in fields such as sponsored search and marketing surveys. Unlike traditional frequent item mining, however, our problem assumes no prior knowledge of users' preferences, and thus obtaining the support of an item becomes costly. Although our problem can be settled naively by probing the preferences of all n users, the number of users is typically enormous, and each probing itself can also incur a prohibitive cost. We present a sampling algorithm that drastically reduces the number of users needed to probe to O(logm)-regardless of the number of users-as long as slight inaccuracy in the output is permitted. For reasonably sized input, our algorithm needs to probe only 0.5% of the users, whereas the naive approach needs to probe all of them.
引用
收藏
页码:37 / 56
页数:20
相关论文
共 50 条
  • [41] A NEW FREQUENT ITEM SET MINING ALGORITHM BASED ON INTERVAL INTERSECTION
    Yungho-Leu
    Utami, Vania
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOL. 2, 2015, : 471 - 477
  • [42] Frequent Item Set Mining of Large Datasets Using CUDA Computing
    Karthik, Peddi
    Banu, J. Saira
    SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2018, VOL 2, 2020, 1057 : 739 - 747
  • [43] Compressing Neural Networks by Applying Frequent Item-Set Mining
    Dou, Zi-Yi
    Huang, Shu-Jian
    Su, Yi-Fan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 696 - 704
  • [44] Data Elimination Based Technique for Mining Frequent Closed Item Set
    Ahuja, Kamlesh
    Jain, Sarika
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ICT IN BUSINESS INDUSTRY & GOVERNMENT (ICTBIG), 2016,
  • [45] INSULATION - A COSTLY ITEM.
    Ripley, Don
    Naval Architect, 1988,
  • [46] Computing the minimum-support for mining frequent patterns
    Shichao Zhang
    Xindong Wu
    Chengqi Zhang
    Jingli Lu
    Knowledge and Information Systems, 2008, 15 : 233 - 257
  • [47] Computing the minimum-support for mining frequent patterns
    Zhang, Shichao
    Wu, Xindong
    Zhang, Chengqi
    Lu, Jingli
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 15 (02) : 233 - 257
  • [48] Index support for frequent itemset mining in a relational DBMS
    Baralis, E
    Cerquitelli, T
    Chiusano, S
    ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 754 - 765
  • [49] A Compact Data Structure Based Technique for Mining Frequent Closed Item Sets
    Ahuja, Kamlesh
    Mishra, Durgesh Kumar
    Jain, Sarika
    SMART TRENDS IN INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS, SMARTCOM 2016, 2016, 628 : 503 - 508
  • [50] Maximal Frequent Item Sequences Mining of Datasets with Few Attributes and Large Instances
    Zhou, Lijuan
    Zhang, Zhang
    Li, Shuang
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 3304 - 3308