Efficient Summarization Framework for Multi-Attribute Uncertain Data

被引:10
|
作者
Xu, Jie [1 ]
Kalashnikov, Dmitri, V [1 ]
Mehrotra, Sharad [1 ]
机构
[1] Univ Irvine, Dept Comp Sci, Irvine, CA 92697 USA
关键词
summarization; uncertain data; multi-attributes;
D O I
10.1145/2588555.2588580
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the problem of automatically selecting a small subset of representatives from a set of objects, where objects: (a) are multi-attributed with each attribute corresponding to different aspects of the object and (b) are associated with uncertainty - the problem that has received little attention in the past. Such object set leads to new challenges in modeling information contained in data, defining appropriate criteria for selecting objects, and in devising efficient algorithms for such a selection. We propose a framework that models objects as a set of the corresponding information units and reduces the summarization problem to that of optimizing prob-abilistic coverage. To solve the resulting NP-hard problem, we develop a highly efficient greedy algorithm, which gains its efficiency by leveraging object-level and iteration-level optimization. A comprehensive empirical evaluation over three real datasets demonstrates that the proposed framework significantly outperforms baseline techniques in terms of quality and also scales very well against the size of dataset.
引用
收藏
页码:421 / 432
页数:12
相关论文
共 50 条
  • [1] A framework for efficient multi-attribute movement data analysis
    Fabio Valdés
    Ralf Hartmut Güting
    [J]. The VLDB Journal, 2019, 28 : 427 - 449
  • [2] A framework for efficient multi-attribute movement data analysis
    Valdes, Fabio
    Gueting, Ralf Hartmut
    [J]. VLDB JOURNAL, 2019, 28 (04): : 427 - 449
  • [3] Efficient Similarity Join and Search on Multi-Attribute Data
    Li, Guoliang
    He, Jian
    Deng, Dong
    Li, Jian
    [J]. SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 1137 - 1151
  • [4] Building fuzzy OLAP using multi-attribute summarization
    Kasinadh, D. P. V.
    Krishna, P. Radha
    [J]. ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL I, PROCEEDINGS, 2007, : 370 - 374
  • [5] An adaptively multi-attribute index framework for big IoT data
    Huang, Chih-Yuan
    Chang, Yu-Jui
    [J]. COMPUTERS & GEOSCIENCES, 2021, 155
  • [6] Efficient multi-attribute pattern matching
    Ando, K
    Mizobuchi, S
    Shishibori, M
    Aoe, J
    [J]. INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 1998, 66 (1-2) : 21 - 38
  • [7] Similarity measure for multi-attribute data
    Li, CJ
    Prabhakaran, B
    Zheng, SQ
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1149 - 1152
  • [8] Multi-Attribute Compressive Data Gathering
    Chen, Guangshuo
    Liu, Xiao-Yang
    Kong, Linghe
    Lu, Jia-Liang
    Wu, Min-You
    [J]. 2014 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2014, : 2178 - 2183
  • [9] A Generic Framework for Automated Multi-attribute Negotiation
    Lai, Guoming
    Sycara, Katia
    [J]. GROUP DECISION AND NEGOTIATION, 2009, 18 (02) : 169 - 187
  • [10] MATF: a multi-attribute trust framework for MANETs
    Muhammad Saleem Khan
    Majid Iqbal Khan
    Saif-Ur-Rehman Malik
    Osman Khalid
    Mukhtar Azim
    Nadeem Javaid
    [J]. EURASIP Journal on Wireless Communications and Networking, 2016