Efficient Spark-Based Framework for Big Geospatial Data Query Processing and Analysis

被引:0
|
作者
Aljawarneh, Isam Mashhour [1 ]
Bellavista, Paolo [1 ]
Corradi, Antonio [1 ]
Montanari, Rebecca [1 ]
Foschini, Luca [1 ]
Zanotti, Andrea [1 ]
机构
[1] Univ Bologna, Bologna, Italy
关键词
querying spatial data; MapReduce; big data; spark; MAPREDUCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The exponential amount of geospatial data that has been accumulated in an accelerated pace has inevitably motivated the scientific community to examine novel parallel technologies for tuning the performance of spatial queries. Managing spatial data for an optimized query performance is particularly a challenging task. This is due to the growing complexity of geometric computations involved in querying spatial data, where traditional systems failed to beneficially expand. However, the use of large-scale and parallel-based computing infrastructures based on cost-effective commodity clusters and cloud computing environments introduces new management challenges to avoid bottlenecks such as overloading scarce computing resources, which may be caused by an unbalanced loading of parallel tasks. In this paper, we aim to fill those gaps by introducing a generic framework for optimizing the performance of big spatial data queries on top of Apache Spark. Our framework also supports advanced management functions including a unique self-adaptable load-balancing service to self-tune framework execution. Our experimental evaluation shows that our framework is scalable and efficient for querying massive amounts of real spatial datasets.
引用
收藏
页码:851 / 856
页数:6
相关论文
共 50 条
  • [1] A Spark-based parallel framework for geospatial raster data processing
    Gao, Fan
    Yue, Peng
    [J]. 2018 7TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS (AGRO-GEOINFORMATICS), 2018, : 53 - 56
  • [2] A Dynamic Spark-based Classification Framework for Imbalanced Big Data
    Nahla B. Abdel-Hamid
    Sally ElGhamrawy
    Ali El Desouky
    Hesham Arafat
    [J]. Journal of Grid Computing, 2018, 16 : 607 - 626
  • [3] A Dynamic Spark-based Classification Framework for Imbalanced Big Data
    Abdel-Hamid, Nahla B.
    ElGhamrawy, Sally
    El Desouky, Ali
    Arafat, Hesham
    [J]. JOURNAL OF GRID COMPUTING, 2018, 16 (04) : 607 - 626
  • [4] Enabling Standard Geospatial Capabilities in Spark for the Efficient Processing of Geospatial Big Data
    Engelinus, Jonathan
    Badard, Thierry
    Bernier, Eveline
    [J]. GEOGRAPHICAL INFORMATION SYSTEMS THEORY, APPLICATIONS AND MANAGEMENT, GISTAM 2018, 2019, 1061 : 133 - 148
  • [5] A Spark-Based Big Data Platform for Massive Remote Sensing Data Processing
    Sun, Zhongyi
    Chen, Fengke
    Chi, Mingmin
    Zhu, Yangyong
    [J]. DATA SCIENCE, 2015, 9208 : 120 - 126
  • [6] Efficient Index and Query Algorithm Based on Geospatial Big Data
    Zhao, Huihui
    Zhao, Fan
    Chen, Renhai
    Feng, Zhiyong
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (02): : 333 - 345
  • [7] A spark-based big data analysis framework for real-time sentiment prediction on streaming data
    Kilinc, Deniz
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2019, 49 (09): : 1352 - 1364
  • [8] BDPS: An Efficient Spark-Based Big Data Processing Scheme for Cloud Fog-IoT Orchestration
    Hossen, Rakib
    Whaiduzzaman, Md
    Uddin, Mohammed Nasir
    Islam, Md. Jahidul
    Faruqui, Nuruzzaman
    Barros, Alistair
    Sookhak, Mehdi
    Mahi, Md. Julkar Nayeen
    [J]. INFORMATION, 2021, 12 (12)
  • [9] Spark-based Large-scale Matrix Inversion for Big Data Processing
    Liang, Yang
    Liu, Jun
    Fang, Cheng
    Ansari, Nirwan
    [J]. 2016 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2016,
  • [10] Spark-Based Large-Scale Matrix Inversion for Big Data Processing
    Liu, Jun
    Liang, Yang
    Ansari, Nirwan
    [J]. IEEE ACCESS, 2016, 4 : 2166 - 2176