HPPQ: A Parallel Package Queries Processing Approach for Large-Scale Data

被引:0
|
作者
Meihui Shi [1 ]
Derong Shen [1 ]
Tiezheng Nie [1 ]
Yue Kou [1 ]
Ge Yu [1 ]
机构
[1] the College of Computer Science and Engineering,Northeastern University
基金
中国国家自然科学基金;
关键词
package queries; heuristic algorithms; parallel processing; opposition-based learning;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
A lot of scholars have focused on developing effective techniques for package queries, and a lot of excellent approaches have been proposed. Unfortunately, most of the existing methods focus on a small volume of data. The rapid increase in data volume means that traditional methods of package queries find it difficult to meet the increasing requirements. To solve this problem, a novel optimization method of package queries(HPPQ) is proposed in this paper. First, the data is preprocessed into regions. Data preprocessing segments the dataset into multiple subsets and the centroid of the subsets is used for package queries, this effectively reduces the volume of candidate results. Furthermore, an efficient heuristic algorithm is proposed(namely IPOL-HS) based on the preprocessing results. This improves the quality of the candidate results in the iterative stage and improves the convergence rate of the heuristic algorithm. Finally, a strategy called HPR is proposed, which relies on a greedy algorithm and parallel processing to accelerate the rate of query. The experimental results show that our method can significantly reduce time consumption compared with existing methods.
引用
收藏
页码:146 / 159
页数:14
相关论文
共 50 条
  • [1] HPPQ: A Parallel Package Queries Processing Approach for Large-Scale Data
    Shi, Meihui
    Shen, Derong
    Nie, Tiezheng
    Kou, Yue
    Yu, Ge
    [J]. BIG DATA MINING AND ANALYTICS, 2018, 1 (02): : 146 - 159
  • [2] Dynamic and fast processing of queries on large-scale RDF data
    Pingpeng Yuan
    Changfeng Xie
    Hai Jin
    Ling Liu
    Guang Yang
    Xuanhua Shi
    [J]. Knowledge and Information Systems, 2014, 41 : 311 - 334
  • [3] Dynamic and fast processing of queries on large-scale RDF data
    Yuan, Pingpeng
    Xie, Changfeng
    Jin, Hai
    Liu, Ling
    Yang, Guang
    Shi, Xuanhua
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 311 - 334
  • [4] Parallel Strategy for the Large-Scale Data Streams Processing
    Yuan, Ya-Juan
    Ma, Guo-Jie
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND INFORMATION SYSTEMS, 2016, 52 : 232 - 234
  • [5] Extension of Parallel Primitives and Their Applications to Large-Scale Data Processing
    Nakano, Masashi
    Chang, Qiong
    Miyazaki, Jun
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, DEXA 2024, 2024, 14911 : 248 - 253
  • [6] Designing Parallel Data Processing for Large-Scale Sensor Orchestration
    Kabac, Milan
    Consel, Charles
    [J]. 2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 57 - 65
  • [7] LARGE-SCALE PARALLEL PROCESSING SYSTEMS
    SIEGEL, HJ
    SCHWEDERSKI, T
    MEYER, DG
    HSU, WT
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 1987, 11 (01) : 3 - 20
  • [8] A data parallel approach for large-scale Gaussian process modeling
    Choudhury, A
    Nair, PB
    Keane, AJ
    [J]. PROCEEDINGS OF THE SECOND SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2002, : 95 - 111
  • [9] Parallel Approach and Platform for Large-scale Web Data Extraction
    Shen, Yi
    Shi, Shengsheng
    Wang, Haitao
    Wei, Wu
    Yuan, Chunfeng
    Huang, Yihua
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 192 - 196
  • [10] Distributed frameworks and parallel algorithms for processing large-scale geographic data
    Hawick, KA
    Coddington, PD
    James, HA
    [J]. PARALLEL COMPUTING, 2003, 29 (10) : 1297 - 1333