END-TO-END PROCESS ORCHESTRATION OF EARTH OBSERVATION DATA WORKFLOWS WITH APACHE AIRFLOW ON HIGH PERFORMANCE COMPUTING

被引:3
|
作者
Tian, Liang [1 ]
Sedona, Rocco [1 ,2 ]
Mozaffari, Amirpasha [2 ]
Kreshpa, Enxhi [2 ]
Paris, Claudia [3 ]
Riedel, Morris [1 ,2 ]
Schultz, Martin G. [2 ]
Cavallaro, Gabriele [1 ,2 ]
机构
[1] Univ Iceland, Sch Engn & Nat Sci, IS-107 Reykjavik, Iceland
[2] Forschungszentrum Julich, Julich Supercomp Ctr, D-52428 Julich, Germany
[3] Univ Twente, NL-7514 AE Enschede, Netherlands
基金
欧盟地平线“2020”;
关键词
Workflows; Deep Learning (DL); High-Performance Computing (HPC); remote sensing data;
D O I
10.1109/IGARSS52108.2023.10283416
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Earth Observation (EO) data processing faces challenges due to large volumes, multiple sources, and diverse formats. To address this issue, this paper presents a scalable and parallelizable workflow using Apache Airflow, capable of integrating Machine Learning (ML) and Deep Learning (DL) models with Modular Supercomputing Architecture (MSA) systems. To test the workflow, we considered the production of large-scale Land-Cover (LC) maps as a case study. The workflow manager, Airflow, offers scalability, extensibility, and programmable task definition in Python. It allows us to execute different steps of the workflow in different High-Performance Computing (HPC) systems. The workflow is demonstrated on the Dynamical Exascale Entry Platform (DEEP) and J <spacing diaeresis>ulich Research on Exascale Cluster Architectures (JURECA) hosted at the J <spacing diaeresis>ulich Supercomputing Centre (JSC), a platform that incorporates heterogeneous JSC systems.
引用
收藏
页码:711 / 714
页数:4
相关论文
共 50 条
  • [1] End-to-end online performance data capture and analysis for scientific workflows
    Papadimitriou, George
    Wang, Cong
    Vahi, Karan
    da Silva, Rafael Ferreira
    Mandal, Anirban
    Liu, Zhengchun
    Mayani, Rajiv
    Rynge, Mats
    Kiran, Mariam
    Lynch, Vickie E.
    Kettimuthu, Rajkumar
    Deelman, Ewa
    Vetter, Jeffrey S.
    Foster, Ian
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 117 : 387 - 400
  • [2] End-to-end online performance data capture and analysis for scientific workflows
    Papadimitriou, George
    Wang, Cong
    Vahi, Karan
    da Silva, Rafael Ferreira
    Mandal, Anirban
    Liu, Zhengchun
    Mayani, Rajiv
    Rynge, Mats
    Kiran, Mariam
    Lynch, Vickie E.
    Kettimuthu, Rajkumar
    Deelman, Ewa
    Vetter, Jeffrey S.
    Foster, Ian
    Future Generation Computer Systems, 2021, 117 : 387 - 400
  • [3] End-to-End Scientific Data Management using Workflows
    Simmhan, Yogesh
    IEEE CONGRESS ON SERVICES 2008, PT I, PROCEEDINGS, 2008, : 472 - 473
  • [4] An optimized end-to-end process for the analysis of agile earth observation satellite missions
    Hahn, M.
    Mueller, T.
    Levenhagen, J.
    CEAS SPACE JOURNAL, 2014, 6 (3-4) : 145 - 154
  • [5] End-to-End Service Orchestration across SDN and Cloud Computing Domains
    Bonafiglia, Roberto
    Castellano, Gabriele
    Cerrato, Ivano
    Risso, Fulvio
    2017 IEEE CONFERENCE ON NETWORK SOFTWARIZATION (IEEE NETSOFT), 2017,
  • [6] A Framework for End-to-End Simulation of High-performance Computing Systems
    Denzel, Wolfgang E.
    Li, Jian
    Walker, Peter
    Jin, Yuho
    SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2010, 86 (5-6): : 331 - 350
  • [7] End-to-End Network Performance Monitoring for Dispersed Computing
    Quynh Nguyen
    Ghosh, Pradipta
    Krishnamachari, Bhaskar
    2018 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2018, : 707 - 711
  • [8] End-to-End Production Process Orchestration for Smart Printing Factories: An Application in Industry
    Traganos, Konstantinos
    Vanderfeesten, Irene
    Grefen, Paul
    Erasmus, Jonnro
    Gerrits, Ton
    Verhofstad, Wim
    2020 IEEE 24TH INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE (EDOC 2020), 2020, : 155 - 164
  • [9] End-to-End Research Data Management Workflows A Case Study with Dendro and EUDAT
    Silva, Fabio
    Amorim, Ricardo Carvalho
    Castro, Joao Aguiar
    da Silva, Joao Rocha
    Ribeiro, Cristina
    METADATA AND SEMANTICS RESEARCH, MTSR 2016, 2016, 672 : 369 - 375
  • [10] Data-based description of process performance in end-to-end order processing
    Schuh, Gunther
    Guetzlaff, Andreas
    Schmitz, Seth
    van der Aalst, Wil M. P.
    CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2020, 69 (01) : 381 - 384