A modular software architecture for processing of big geospatial data in the cloud

被引:33
|
作者
Kraemer, Michel [1 ,2 ]
Senner, Julia [1 ]
机构
[1] Fraunhofer Inst Comp Graph Res IGD, D-64283 Darmstadt, Germany
[2] Tech Univ Darmstadt, D-64283 Darmstadt, Germany
来源
COMPUTERS & GRAPHICS-UK | 2015年 / 49卷
关键词
Cloud computing; Big Data; Geoprocessing; Distributed systems; Software architectures; Domain-specific languages; SYSTEM;
D O I
10.1016/j.cag.2015.02.005
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we propose a software architecture that allows for processing of large geospatial data sets in the cloud. Our system is modular and flexible and supports multiple algorithm design paradigms such as MapReduce, in-memory computing or agent-based programming. It contains a web-based user interface where domain experts (e.g. GIS analysts or urban planners) can define high-level processing workflows using a domain-specific language (DSL). The workflows are passed through a number of components including a parser, interpreter, and a service called job manager. These components use declarative and procedural knowledge encoded in rules to generate a processing chain specifying the execution of the workflows on a given cloud infrastructure according to the constraints defined by the user. The job manager evaluates this chain, spawns processing services in the cloud and monitors them. The services communicate with each other through a distributed file system that is scalable and fault-tolerant. Compared to previous work describing cloud infrastructures and architectures we focus on the processing of big heterogeneous geospatial data. In addition to that, we do not rely on only one specific programming model or a certain cloud infrastructure but support several ones. Combined with the possibility to control the processing through DSL-based workflows, this makes our architecture very flexible and configurable. We do not only see the cloud as a means to store and distribute large data sets but also as a way to harness the processing power of distributed computing environments for large-volume geospatial data sets. The proposed architecture design has been developed for the IQmulus research project funded by the European Commission. The paper concludes with the evaluation results from applying our solution to two example workflows from this project. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:69 / 81
页数:13
相关论文
共 50 条
  • [1] An Architecture for Cost Optimization in the Processing of Big Geospatial Data in Public Cloud Providers
    Bachiega Junior, Joao
    Sousa Reis, Marco Antonio
    Holanda, Maristela
    Araujo, Aleteia P. F.
    [J]. 2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 190 - 197
  • [2] GEOSPATIAL BIG DATA PROCESSING IN HYBRID CLOUD ENVIRONMENTS
    Simonis, Ingo
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 419 - 421
  • [3] Fog Computing Architecture for Scalable Processing of Geospatial Big Data
    Barik, Rabindra K.
    Priyadarshini, Rojalina
    Lenka, Rakesh K.
    Dubey, Harishchandra
    Mankodiya, Kunal
    [J]. INTERNATIONAL JOURNAL OF APPLIED GEOSPATIAL RESEARCH, 2020, 11 (01) : 1 - 20
  • [4] Green Cloud Software Engineering for Big Data Processing
    Ganesan, Madhubala
    Kor, Ah-Lian
    Pattinson, Colin
    Rondeau, Eric
    [J]. SUSTAINABILITY, 2020, 12 (21) : 1 - 24
  • [5] Geospatial cloud computing and big data
    Yang, Chaowei Phil
    [J]. COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2017, 61 : 119 - 119
  • [6] A proposal to minimize the cost of processing big geospatial data in public cloud providers
    Bachiega, Joao
    Holanda, Maristela
    Araujo, Aleteia P. F.
    [J]. TRANSACTIONS IN GIS, 2021, 25 (03) : 1599 - 1624
  • [7] Architecture of Geospatial Big-Data Batch Processing Model Based on Hadoop
    Kim, Sang-Su
    Yu, Sung-Hwan
    [J]. 2015 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC), 2015, : 964 - 966
  • [8] Big Data in Cloud: A Data Architecture
    Oliveira e Sa, Jorge
    Martins, Cesar
    Simoes, Paulo
    [J]. NEW CONTRIBUTIONS IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, PT 1, 2015, 353 : 723 - 732
  • [9] Experimental Study of the Cloud Architecture Selection for Effective Big Data Processing
    Nikulchev, Evgeny
    Pluzhnik, Evgeniy
    Biryukov, Dmitry
    Lukyanchikov, Oleg
    Payain, Simon
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2015, 6 (06) : 22 - 26
  • [10] Parallel Processing Strategies for Big Geospatial Data
    Werner, Martin
    [J]. FRONTIERS IN BIG DATA, 2019, 2