Efficient Spark-Based Framework for Big Geospatial Data Query Processing and Analysis

被引:0
|
作者
Aljawarneh, Isam Mashhour [1 ]
Bellavista, Paolo [1 ]
Corradi, Antonio [1 ]
Montanari, Rebecca [1 ]
Foschini, Luca [1 ]
Zanotti, Andrea [1 ]
机构
[1] Univ Bologna, Bologna, Italy
关键词
querying spatial data; MapReduce; big data; spark; MAPREDUCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The exponential amount of geospatial data that has been accumulated in an accelerated pace has inevitably motivated the scientific community to examine novel parallel technologies for tuning the performance of spatial queries. Managing spatial data for an optimized query performance is particularly a challenging task. This is due to the growing complexity of geometric computations involved in querying spatial data, where traditional systems failed to beneficially expand. However, the use of large-scale and parallel-based computing infrastructures based on cost-effective commodity clusters and cloud computing environments introduces new management challenges to avoid bottlenecks such as overloading scarce computing resources, which may be caused by an unbalanced loading of parallel tasks. In this paper, we aim to fill those gaps by introducing a generic framework for optimizing the performance of big spatial data queries on top of Apache Spark. Our framework also supports advanced management functions including a unique self-adaptable load-balancing service to self-tune framework execution. Our experimental evaluation shows that our framework is scalable and efficient for querying massive amounts of real spatial datasets.
引用
收藏
页码:851 / 856
页数:6
相关论文
共 50 条
  • [41] Efficient astronomical query processing using Spark
    Brahem, Mariem
    Yeh, Laurent
    Zeitouni, Karine
    [J]. 26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, : 229 - 238
  • [42] Spark-based Rare Association Rule Mining for Big Datasets
    Liu, Ruilin
    Yang, Kai
    Sun, Yanjia
    Quan, Tao
    Yang, Jin
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2734 - 2739
  • [43] CONSTRUCT Queries Performance on a Spark-Based Big RDF Triplestore
    Sanchez-Ayte, Adam
    Jouanot, Fabrice
    Rousset, Marie-Christine
    [J]. SEMANTIC WEB, ESWC 2022, 2022, 13261 : 444 - 460
  • [44] Big Data Analysis for Sustainable Agriculture on a Geospatial Cloud Framework
    Delgado, Jorge A.
    Short, Nicholas M., Jr.
    Roberts, Daniel P.
    Vandenberg, Bruce
    [J]. FRONTIERS IN SUSTAINABLE FOOD SYSTEMS, 2019, 3
  • [45] Composable and Efficient Functional Big Data Processing Framework
    Wu, Dongyao
    Sakr, Sherif
    Zhu, Liming
    Lu, Qinghua
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 279 - 286
  • [46] A novel spark-based multi-step forecasting algorithm for big data time series
    Galicia, A.
    Torres, J. F.
    Martinez-Alvarez, F.
    Troncoso, A.
    [J]. INFORMATION SCIENCES, 2018, 467 : 800 - 818
  • [47] Elcano: A Geospatial Big Data Processing System based on SparkSQL
    Engelinus, Jonathan
    Badard, Thierry
    [J]. GISTAM: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON GEOGRAPHICAL INFORMATION SYSTEMS THEORY, APPLICATIONS AND MANAGEMENT, 2018, : 119 - 128
  • [48] An efficient spark-based adaptive windowing for entity matching
    Mestre, Demetrio Gomes
    Santos Pires, Carlos Eduardo
    Nascimento, Dimas Cassimiro
    Monteiro de Queiroz, Andreza Raquel
    Santos, Veruska Borges
    Araujo, Tiago Brasileiro
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 128 : 1 - 10
  • [49] Efficient Geospatial Data Analysis Framework in Fog Environment
    Saber, Walaa
    Eisa, Rania
    Attia, Radwa
    [J]. IEEE ACCESS, 2022, 10 : 133591 - 133600
  • [50] KP-S: A Spark-based Design of the K-Prototypes Clustering for Big Data
    Ben HajKacem, Mohamed Aymen
    Ben N'cir, Chiheb-Eddine
    Essoussi, Nadia
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 557 - 563