Summarizing Uncertain Transaction Databases by Probabilistic Tiles

被引:0
|
作者
Liu, Chunyang [1 ]
Chen, Ling [1 ]
机构
[1] Univ Technol, Ctr Quantum Computat & Intelligent Syst, Sydney, NSW, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transaction data mining is ubiquitous in various domains and has been researched extensively. In recent years, observing that uncertainty is inherent in many real world applications, uncertain data mining has attracted much research attention. Among the research problems, summarization is important because it produces concise and informative results, which facilitates further analysis. However, there are few works exploring how to effectively summarize uncertain transaction data. In this paper, we formulate the problem of summarizing uncertain transaction data as Minimal Probabilistic Tile Cover Mining, which aims to find a high-quality probabilistic tile set covering an uncertain database with minimal cost. We define the concept of Probabilistic Price and Probabilistic Price Order to evaluate and compare the quality of tiles, and propose a framework to discover the minimal probabilistic tile cover. The bottleneck is to check whether a tile is better than another according to the Probabilistic Price Order, which involves the computation of a joint probability. We prove that it can be decomposed into independent terms and calculated efficiently. Several optimization techniques are devised to further improve the performance. Experimental results on real world datasets demonstrate the conciseness of the produced tiles and the effectiveness and efficiency of our approach.
引用
收藏
页码:4375 / 4382
页数:8
相关论文
共 50 条
  • [1] Probabilistic Mining in Large Transaction Databases
    Anand, Hareendran S.
    Chandra, S. S. Vinod
    DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 486 - 494
  • [2] Special issue on uncertain and probabilistic databases
    Haas, Peter J.
    Suciu, Dan
    VLDB JOURNAL, 2009, 18 (05): : 987 - 988
  • [3] ProUD: Probabilistic ranking in uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Renz, Matthias
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2008, 5069 : 558 - 565
  • [4] Special issue on uncertain and probabilistic databases
    Peter J. Haas
    Dan Suciu
    The VLDB Journal, 2009, 18 : 987 - 988
  • [5] Making Aggregation Work in Uncertain and Probabilistic Databases
    Murthy, Raghotham
    Ikeda, Robert
    Widom, Jennifer
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (08) : 1261 - 1273
  • [6] Probabilistic Frequent Itemset Mining in Uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Renz, Matthias
    Verhein, Florian
    Zuefle, Andreas
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 119 - 127
  • [7] Scalable Probabilistic Similarity Ranking in Uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Mamoulis, Nikos
    Renz, Matthias
    Zuefle, Andreas
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (09) : 1234 - 1246
  • [8] Efficient Probabilistic Query Ranking in Uncertain Databases
    Katukoori, Divya
    Bhima, K.
    Sri, T. Aruna
    Chowdary, S. Hemanth
    Bhattacharya, Sujoy
    GLOBAL TRENDS IN COMPUTING AND COMMUNICATION SYSTEMS, PT 1, 2012, 269 : 169 - 177
  • [9] Probabilistic inverse ranking queries in uncertain databases
    Lian, Xiang
    Chen, Lei
    VLDB JOURNAL, 2011, 20 (01): : 107 - 127
  • [10] Probabilistic inverse ranking queries in uncertain databases
    Xiang Lian
    Lei Chen
    The VLDB Journal, 2011, 20 : 107 - 127