Towards Parallel Spatial Query Processing for Big Spatial Data

被引:37
|
作者
Zhong, Yunqin [1 ,4 ]
Han, Jizhong [1 ]
Zhang, Tieying [1 ,4 ]
Li, Zhenhua [2 ]
Fang, Jinyun [1 ]
Chen, Guihai [3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Peking Univ, Beijing, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[4] Grad Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
spatial data management; distributed storage; spatial index; spatial query; spatial applications;
D O I
10.1109/IPDPSW.2012.245
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In recent years, spatial applications have become more and more important in both scientific research and industry. Spatial query processing is the fundamental functioning component to support spatial applications. However, the state-of-the-art techniques of spatial query processing are facing significant challenges as the data expand and user accesses increase. In this paper we propose and implement a novel scheme (named VegaGiStore) to provide efficient spatial query processing over big spatial data and numerous concurrent user queries. Firstly, a geography-aware approach is proposed to organize spatial data in terms of geographic proximity, and this approach can achieve high aggregate I/O throughput. Secondly, in order to improve data retrieval efficiency, we design a two-tier distributed spatial index for efficient pruning of the search space. Thirdly, we propose an "indexing + MapReduce" data processing architecture to improve the computation capability of spatial query. Performance evaluations of the real-deployed VegaGiStore system confirm its effectiveness.
引用
收藏
页码:2085 / 2094
页数:10
相关论文
共 50 条
  • [1] Query Processing Techniques for Big Spatial-Keyword Data
    Mahmood, Ahmed
    Aref, Walid G.
    [J]. SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 1777 - 1782
  • [2] Towards stream data parallel processing in spatial aggregating index
    Gorawski, Marcin
    Malczok, Rafal
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2008, 4967 : 209 - 218
  • [3] Spatial Data Indexing and Query Processing in GeoCloud
    Shankar, Karthi
    Sevugan, Prabu
    [J]. JOURNAL OF TESTING AND EVALUATION, 2019, 47 (06) : 4039 - 4053
  • [4] YEfficient Spatial Big Data Storage and Query in HBase
    Wang, Ping
    Xu, Fanhua
    Ma, Meng
    Duan, Lihua
    [J]. 4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 149 - 155
  • [5] A solution of spatial query processing and query optimization for spatial databases
    YUAN Jie 1
    2. Department of Intelligence Science
    3. Beijing Institute of Surveying and Mapping
    [J]. 重庆邮电大学学报(自然科学版), 2004, (05) : 165 - 172
  • [6] High-Performance Spatial Query Processing on Big Taxi Trip Data using GPGPUs
    Zhang, Jianting
    You, Simin
    Gruenwald, Le
    [J]. 2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 72 - 79
  • [7] An efficient data dissemination scheme for spatial query processing
    Park, Kwangjin
    Choo, Hyunseung
    Hwang, Chong-Sun
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2007, 22 (01): : 131 - 134
  • [8] An Efficient Data Dissemination Scheme for Spatial Query Processing
    Kwangjin Park
    Hyunseung Choo
    Chong-Sun Hwang
    [J]. Journal of Computer Science and Technology, 2007, 22 : 131 - 134
  • [9] Big Spatial Data Processing With Apache Spark
    Boyi Shangguan
    Peng Yue
    Wu, Zhaoyan
    Jiang, Liangcun
    [J]. 2017 6TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS, 2017, : 239 - 242
  • [10] Data partitioning for parallel spatial join processing
    Zhou, XF
    Abel, DJ
    Truffet, D
    [J]. ADVANCES IN SPATIAL DATABASES, 1997, 1262 : 178 - 196