Entropy-Based Scheduling Policy for Cross Aggregate Ranking Workloads

被引:2
|
作者
Dai, Chengcheng [1 ]
Nutanong, Sarana [1 ]
Chow, Chi-Yin [1 ]
Cheng, Reynold [2 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
[2] Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
关键词
Query processing; knowledge and data engineering tools and techniques; TOP-K QUERIES; CLEANING UNCERTAIN DATA; SEARCH; MAPREDUCE;
D O I
10.1109/TSC.2016.2586062
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many data exploration applications require the ability to identify the top-k results according to a scoring function. We study a class of top-k ranking problems where top-k candidates in a dataset are scored with the assistance of another set. We call this class of workloads cross aggregate ranking. Example computation problems include evaluating the Hausdorff distance between two datasets, finding the medoid or radius within one dataset, and finding the closest or farthest pair between two datasets. In this paper, we propose a parallel and distributed solution to process cross aggregate ranking workloads. Our solution subdivides the aggregate score computation of each candidate into tasks while constantly maintains the tentative top-k results as an uncertain top-k result set. The crux of our proposed approach lies in our entropy-based scheduling technique to determine result-yielding tasks based on their abilities to reduce the uncertainty of the tentative result set. Experimental results show that our proposed approach consistently outperforms the best existing one in two different types of cross aggregate rank workloads using real datasets.
引用
收藏
页码:507 / 520
页数:14
相关论文
共 50 条
  • [1] An Entropy-Based Technique for Conferences Ranking
    Majeed, Fiaz
    Ul Haq, Rana Azhar
    [J]. DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2019, VOL 1, 2020, 1042 : 229 - 239
  • [2] Entropy-based discretization methods for ranking data
    de Sa, Claudio Rebelo
    Soares, Carlos
    Knobbe, Arno
    [J]. INFORMATION SCIENCES, 2016, 329 : 921 - 936
  • [3] Influential nodes ranking in complex networks: An entropy-based approach
    Zareie, Ahmad
    Sheikhahmadi, Amir
    Fatemi, Adel
    [J]. CHAOS SOLITONS & FRACTALS, 2017, 104 : 485 - 494
  • [4] Entropy-based Clustering for Improving Document Re-ranking
    Teng, Chong
    He, Yanxiang
    Ji, Donghong
    zhou, Cheng
    Geng, Yixuan
    Chen, Shu
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 3, 2009, : 662 - +
  • [5] An Entropy-based PSO for DAR task scheduling problem
    Zhang, Haowei
    Xie, Junwei
    Ge, Jiaang
    Lu, Wenlong
    Zong, Binfeng
    [J]. APPLIED SOFT COMPUTING, 2018, 73 : 862 - 873
  • [6] The Structure Entropy-Based Node Importance Ranking Method for Graph Data
    Liu, Shihu
    Gao, Haiyan
    [J]. ENTROPY, 2023, 25 (06)
  • [7] Tackling Information Asymmetry in Networks: A New Entropy-Based Ranking Index
    Paolo Barucca
    Guido Caldarelli
    Tiziano Squartini
    [J]. Journal of Statistical Physics, 2018, 173 : 1028 - 1044
  • [8] Tackling Information Asymmetry in Networks: A New Entropy-Based Ranking Index
    Barucca, Paolo
    Caldarelli, Guido
    Squartini, Tiziano
    [J]. JOURNAL OF STATISTICAL PHYSICS, 2018, 173 (3-4) : 1028 - 1044
  • [9] Entropy-based scheduling of resource-constrained construction projects
    Christodoulou, Symeon
    Ellinas, Georgios
    Aslani, Pooyan
    [J]. AUTOMATION IN CONSTRUCTION, 2009, 18 (07) : 919 - 928
  • [10] Entropy-Based Heuristic for Resource-Constrained Project Scheduling
    Christodoulou, Symeon E.
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2017, 31 (03)