Efficient Top-k Dominating Computation on Massive Data

被引:11
|
作者
Han, Xixian [1 ]
Li, Jianzhong [1 ]
Gao, Hong [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Haerbin Shi 150001, Heilongjiang Sh, Peoples R China
基金
中国国家自然科学基金;
关键词
Massive data; TDTS algorithm; table scan; early termination; pruning operation; SKYLINE COMPUTATION; ALGORITHMS;
D O I
10.1109/TKDE.2017.2665619
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many applications, top-k dominating query is an important operation to return k tuples with the highest domination scores in a potentially huge data space. It is analyzed that the existing algorithms have their performance problems when performed on massive data. This paper proposes a novel table-scan-based TDTS algorithm to efficiently compute top-k dominating results. TDTS first presorts the table for early termination. The early termination checking is proposed in this paper, along with the theoretical analysis of scan depth. The pruning operation for tuples is devised in this paper. The theoretical pruning effect shows that the number of tuples maintained in TDTS can be reduced substantially. The extensive experimental results, conducted on synthetic and real-life data sets, show that TDTS outperforms the existing algorithms significantly.
引用
收藏
页码:1199 / 1211
页数:13
相关论文
共 50 条
  • [1] Efficient Top-k Dominating Computation on Massive Data (Extended abstract)
    Han, Xixian
    Li, Jianzhong
    Gao, Hong
    [J]. 2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1771 - 1772
  • [2] Ranking the big sky: efficient top-k skyline computation on massive data
    Xixian Han
    Bailing Wang
    Jianzhong Li
    Hong Gao
    [J]. Knowledge and Information Systems, 2019, 60 : 415 - 446
  • [3] Ranking the big sky: efficient top-k skyline computation on massive data
    Han, Xixian
    Wang, Bailing
    Li, Jianzhong
    Gao, Hong
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (01) : 415 - 446
  • [4] Efficient Top-k Retrieval on Massive Data
    Han, Xixian
    Li, Jianzhong
    Gao, Hong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (10) : 2687 - 2699
  • [5] Efficient Top-k Retrieval on Massive Data
    Han, Xixian
    Li, Jianzhong
    Gao, Hong
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1496 - 1497
  • [6] TDEP: efficiently processing top-k dominating query on massive data
    Xixian Han
    Jianzhong Li
    Hong Gao
    [J]. Knowledge and Information Systems, 2015, 43 : 689 - 718
  • [7] TDEP: efficiently processing top-k dominating query on massive data
    Han, Xixian
    Li, Jianzhong
    Gao, Hong
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 43 (03) : 689 - 718
  • [8] Top-k Dominating Queries on Incomplete Data
    Miao, Xiaoye
    Gao, Yunjun
    Zheng, Baihua
    Chen, Gang
    Cui, Huiyong
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1500 - 1501
  • [9] Top-k Dominating Queries on Incomplete Data
    Miao, Xiaoye
    Gao, Yunjun
    Zheng, Baihua
    Chen, Gang
    Cui, Huiyong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) : 252 - 266
  • [10] Efficient computation of frequent and top-k elements in data streams
    Metwally, A
    Agrawal, D
    El Abbadi, A
    [J]. DATABASE THEORY - ICDT 2005, PROCEEDINGS, 2005, 3363 : 398 - 412