Query Processing Techniques for Big Spatial-Keyword Data

被引:9
|
作者
Mahmood, Ahmed [1 ]
Aref, Walid G. [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
Spatial-keyword; Indexing; Systems; Big Data; Query Processing; SEARCH; PUBLISH/SUBSCRIBE;
D O I
10.1145/3035918.3054773
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The research presented in this paper analyzes different algorithms for scheduling a set of potentially interdependent jobs in order to minimize the total runtime, or makespan, when data communication costs are considered. On distributed file systems, such as the HDFS, files are spread across a cluster of machines. Once a request, such as a query, having as input the data in these files is translated into a set of jobs, these jobs must be scheduled across machines in the cluster. Jobs consume input files stored in the distributed file system or in cluster nodes, and produce output which is potentially consumed by future jobs. If a job needs a particular file as input, the job must either be executed on the same machine, or it must incur a time penalty to copy the file, increasing latency for the job. At the same time, independent jobs are ideally scheduled at the same time on different machines, in order to take advantage of parallelism. Both minimizing communication costs and maximizing parallelism serve to minimize the total runtime of a set of jobs. Furthermore, the problem gets more complex when certain jobs must wait for previous jobs to provide input, as is frequently the case when a set of jobs represents the steps of a query on a distributed database.
引用
收藏
页码:1777 / 1782
页数:6
相关论文
共 50 条
  • [21] A Novel Indexing Method for Spatial-Keyword Range Queries
    Tampakis, Panagiotis
    Spyrellis, Dimitris
    Doulkeridis, Christos
    Pelekis, Nikos
    Kalyvas, Christos
    Vlachou, Akrivi
    [J]. PROCEEDINGS OF 17TH INTERNATIONAL SYMPOSIUM ON SPATIAL AND TEMPORAL DATABASES, SSTD 2021, 2021, : 54 - 63
  • [22] Big Data and Query Optimization Techniques
    Chugh, Aarti
    Sharma, Vivek Kumar
    Jain, Charu
    [J]. ADVANCES IN COMPUTING AND INTELLIGENT SYSTEMS, ICACM 2019, 2020, : 337 - 345
  • [23] Efficient Collective Spatial Keyword Query Processing on Road Networks
    Gao, Yunjun
    Zhao, Jingwen
    Zheng, Baihua
    Chen, Gang
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (02) : 469 - 480
  • [24] Joint Top-K Spatial Keyword Query Processing
    Wu, Dingming
    Yiu, Man Lung
    Cong, Gao
    Jensen, Christian S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (10) : 1889 - 1903
  • [25] Efficient Spatial Keyword Query Processing in the Internet of Industrial Vehicles
    Li, Yanhong
    Luo, Changyin
    Zhu, Rongbo
    Chen, Yuanfang
    Zeng, Huacheng
    [J]. MOBILE NETWORKS & APPLICATIONS, 2018, 23 (04): : 864 - 878
  • [26] Diversification on big data in query processing
    Zhang, Meifan
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2020, 14 (04)
  • [27] Diversification on big data in query processing
    Meifan Zhang
    Hongzhi Wang
    Jianzhong Li
    Hong Gao
    [J]. Frontiers of Computer Science, 2020, 14
  • [28] Temporal Spatial-Keyword Search on Databases Using SQL
    Wang, Jingru
    Hou, Jiajia
    Huang, Feiran
    Lu, Wei
    Du, Xiaoyong
    [J]. WEB TECHNOLOGIES AND APPLICATIONS: APWEB 2016 WORKSHOPS, WDMA, GAP, AND SDMA, 2016, 9865 : 204 - 216
  • [29] Efficient Spatial Keyword Query Processing in the Internet of Industrial Vehicles
    Yanhong Li
    Changyin Luo
    Rongbo Zhu
    Yuanfang Chen
    Huacheng Zeng
    [J]. Mobile Networks and Applications, 2018, 23 : 864 - 878
  • [30] Temporal Spatial-Keyword Top-k Publish/Subscribe
    Chen, Lisi
    Cong, Gao
    Cao, Xin
    Tan, Kian-Lee
    [J]. 2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 255 - 266