END-TO-END PROCESS ORCHESTRATION OF EARTH OBSERVATION DATA WORKFLOWS WITH APACHE AIRFLOW ON HIGH PERFORMANCE COMPUTING

被引:3
|
作者
Tian, Liang [1 ]
Sedona, Rocco [1 ,2 ]
Mozaffari, Amirpasha [2 ]
Kreshpa, Enxhi [2 ]
Paris, Claudia [3 ]
Riedel, Morris [1 ,2 ]
Schultz, Martin G. [2 ]
Cavallaro, Gabriele [1 ,2 ]
机构
[1] Univ Iceland, Sch Engn & Nat Sci, IS-107 Reykjavik, Iceland
[2] Forschungszentrum Julich, Julich Supercomp Ctr, D-52428 Julich, Germany
[3] Univ Twente, NL-7514 AE Enschede, Netherlands
基金
欧盟地平线“2020”;
关键词
Workflows; Deep Learning (DL); High-Performance Computing (HPC); remote sensing data;
D O I
10.1109/IGARSS52108.2023.10283416
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Earth Observation (EO) data processing faces challenges due to large volumes, multiple sources, and diverse formats. To address this issue, this paper presents a scalable and parallelizable workflow using Apache Airflow, capable of integrating Machine Learning (ML) and Deep Learning (DL) models with Modular Supercomputing Architecture (MSA) systems. To test the workflow, we considered the production of large-scale Land-Cover (LC) maps as a case study. The workflow manager, Airflow, offers scalability, extensibility, and programmable task definition in Python. It allows us to execute different steps of the workflow in different High-Performance Computing (HPC) systems. The workflow is demonstrated on the Dynamical Exascale Entry Platform (DEEP) and J <spacing diaeresis>ulich Research on Exascale Cluster Architectures (JURECA) hosted at the J <spacing diaeresis>ulich Supercomputing Centre (JSC), a platform that incorporates heterogeneous JSC systems.
引用
收藏
页码:711 / 714
页数:4
相关论文
共 50 条
  • [21] TrustCell: Towards the end-to-end trustworthiness in data-oriented scientific computing
    Pallickara, Sangmi Lee
    Plale, Beth
    2006 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2006, : 33 - +
  • [22] Internet2 end-to-end performance tuning for distributed computing applications
    Liu, Jiang B.
    Mannemela, Pavan
    2005 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY - (ICIT), VOLS 1 AND 2, 2005, : 651 - 656
  • [23] SDN Orchestration for Dynamic End-to-End Control of Data Center Multi-Domain Optical Networking
    LIU Lei
    China Communications, 2015, 12 (08) : 10 - 21
  • [24] SDN Orchestration for Dynamic End-to-End Control of Data Center Multi-Domain Optical Networking
    Liu Lei
    CHINA COMMUNICATIONS, 2015, 12 (08) : 10 - 21
  • [25] An end-to-end statistical process with mobile network data for official statistics
    David Salgado
    Luis Sanguiao
    Bogdan Oancea
    Sandra Barragán
    Marian Necula
    EPJ Data Science, 10
  • [26] An end-to-end statistical process with mobile network data for official statistics
    Salgado, David
    Sanguiao, Luis
    Oancea, Bogdan
    Barragan, Sandra
    Necula, Marian
    EPJ DATA SCIENCE, 2021, 10 (01)
  • [27] End-to-End SDN/NFV Orchestration of Video Analytics Using Edge and Cloud Computing over Programmable Optical Networks
    Vilalta, Ricard
    Popescu, Ion
    Mayoral, Arturo
    Cao, Xiaoyuan
    Casellas, Ramon
    Yoshikane, Noboru
    Martinez, Ricardo
    Tsuritani, Takehiro
    Morita, Itsuro
    Munoz, Raul
    2017 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2017,
  • [28] On a Multi-objective Evolutionary Algorithm for Optimizing End-to-end Performance of Scientific Workflows in Distributed Environments
    Gu, Yi
    Shenq, Shwu-Ling
    Wu, Qishi
    Dasgupta, Dipankar
    45TH ANNUAL SIMULATION SYMPOSIUM 2012 (ANSS 2012), 2012, 44 (02): : 69 - 77
  • [29] Automatic End-to-End Decomposition and Semantic Annotation of Laws Using High-Performance-Computing and Open Data as a Potential Driver for Digital Transformation
    Alexopoulos, Charalampos
    Pihir, Igor
    Furjan, Martina Tomicic
    CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS, CECIIS 2022, 2022, : 189 - 194
  • [30] Performance Characterization of Hardware/Software Communication Interfaces in End-to-End Power Management Solutions of High-Performance Computing Processors
    del Vecchio, Antonio
    Ottaviano, Alessandro
    Bambini, Giovanni
    Acquaviva, Andrea
    Bartolini, Andrea
    Energies, 2024, 17 (22)