Scalable Artificial Intelligence for Earth Observation Data Using Hopsworks

被引:0
|
作者
Hagos, Desta Haileselassie [1 ]
Kakantousis, Theofilos [2 ]
Sheikholeslami, Sina [1 ]
Wang, Tianze [1 ]
Vlassov, Vladimir [1 ]
Payberah, Amir Hossein [1 ]
Meister, Moritz [2 ]
Andersson, Robin [2 ]
Dowling, Jim [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Div Software & Comp Syst, S-10044 Stockholm, Sweden
[2] Logical Clocks AB, S-11872 Stockholm, Sweden
关键词
Hopsworks; Copernicus; Earth Observation; machine learning; deep learning; artificial intelligence; model serving; big data; ablation studies; Maggy; ExtremeEarth;
D O I
10.3390/rs14081889
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper introduces the Hopsworks platform to the entire Earth Observation (EO) data community and the Copernicus programme. Hopsworks is a scalable data-intensive open-source Artificial Intelligence (AI) platform that was jointly developed by Logical Clocks and the KTH Royal Institute of Technology for building end-to-end Machine Learning (ML)/Deep Learning (DL) pipelines for EO data. It provides the full stack of services needed to manage the entire life cycle of data in ML. In particular, Hopsworks supports the development of horizontally scalable DL applications in notebooks and the operation of workflows to support those applications, including parallel data processing, model training, and model deployment at scale. To the best of our knowledge, this is the first work that demonstrates the services and features of the Hopsworks platform, which provide users with the means to build scalable end-to-end ML/DL pipelines for EO data, as well as support for the discovery and search for EO metadata. This paper serves as a demonstration and walkthrough of the stages of building a production-level model that includes data ingestion, data preparation, feature extraction, model training, model serving, and monitoring. To this end, we provide a practical example that demonstrates the aforementioned stages with real-world EO data and includes source code that implements the functionality of the platform. We also perform an experimental evaluation of two frameworks built on top of Hopsworks, namely Maggy and AutoAblation. We show that using Maggy for hyperparameter tuning results in roughly half the wall-clock time required to execute the same number of hyperparameter tuning trials using Spark while providing linear scalability as more workers are added. Furthermore, we demonstrate how AutoAblation facilitates the definition of ablation studies and enables the asynchronous parallel execution of ablation trials.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Towards a training data model for artificial intelligence in earth observation
    Yue, Peng
    Shangguan, Boyi
    Hu, Lei
    Jiang, Liangcun
    Zhang, Chenxiao
    Cao, Zhipeng
    Pan, Yinyin
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2022, 36 (11) : 2113 - 2137
  • [2] AiTLAS: Artificial Intelligence Toolbox for Earth Observation
    Dimitrovski, Ivica
    Kitanovski, Ivan
    Panov, Pance
    Kostovska, Ana
    Simidjievski, Nikola
    Kocev, Dragi
    REMOTE SENSING, 2023, 15 (09)
  • [3] EarthNets: Empowering artificial intelligence for Earth observation
    Xiong, Zhitong
    Zhang, Fahong
    Wang, Yi
    Shi, Yilei
    Zhu, Xiao Xiang
    IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2024,
  • [4] SCALABLE DATA PROCESSING PLATFORM FOR EARTH OBSERVATION DATA REPOSITORIES
    Astsatryan, Hrachya
    Lalayan, Arthur
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2023, 24 (01): : 35 - 44
  • [5] Scalable big earth observation data mining algorithms: a review
    Sisodiya, Neha
    Dube, Nitant
    Prakash, Om
    Thakkar, Priyank
    EARTH SCIENCE INFORMATICS, 2023, 16 (3) : 1993 - 2016
  • [6] Scalable big earth observation data mining algorithms: a review
    Neha Sisodiya
    Nitant Dube
    Om Prakash
    Priyank Thakkar
    Earth Science Informatics, 2023, 16 : 1993 - 2016
  • [7] Earth Observation and Artificial Intelligence Understanding emerging ethical issues and opportunities
    Kochupillai, Mrinalini
    Kahl, Matthias
    Schmitt, Michael
    Taubenboeck, Hannes
    Zhu, Xiao Xiang
    IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2022, 10 (04) : 90 - 124
  • [8] Unlocking the Use of Raw Multispectral Earth Observation Imagery for Onboard Artificial Intelligence
    Meoni, Gabriele
    Del Prete, Roberto
    Serva, Federico
    De Beusscher, Alix
    Colin, Olivier
    Longepe, Nicolas
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 12521 - 12537
  • [9] The Effect of Scalable Information on Artificial Intelligence
    Yang, Xiao-lei
    Mo, Jin-ping
    Qian, Wen-biao
    Yang, Qing-lin
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND NETWORK TECHNOLOGY (CCNT 2018), 2018, 291 : 108 - 112
  • [10] Toward Scalable Artificial Intelligence in Finance
    Sanz, Jorge L. C.
    Zhu, Yada
    2021 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2021), 2021, : 460 - 469