Scalable Artificial Intelligence for Earth Observation Data Using Hopsworks

被引:0
|
作者
Hagos, Desta Haileselassie [1 ]
Kakantousis, Theofilos [2 ]
Sheikholeslami, Sina [1 ]
Wang, Tianze [1 ]
Vlassov, Vladimir [1 ]
Payberah, Amir Hossein [1 ]
Meister, Moritz [2 ]
Andersson, Robin [2 ]
Dowling, Jim [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Div Software & Comp Syst, S-10044 Stockholm, Sweden
[2] Logical Clocks AB, S-11872 Stockholm, Sweden
关键词
Hopsworks; Copernicus; Earth Observation; machine learning; deep learning; artificial intelligence; model serving; big data; ablation studies; Maggy; ExtremeEarth;
D O I
10.3390/rs14081889
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper introduces the Hopsworks platform to the entire Earth Observation (EO) data community and the Copernicus programme. Hopsworks is a scalable data-intensive open-source Artificial Intelligence (AI) platform that was jointly developed by Logical Clocks and the KTH Royal Institute of Technology for building end-to-end Machine Learning (ML)/Deep Learning (DL) pipelines for EO data. It provides the full stack of services needed to manage the entire life cycle of data in ML. In particular, Hopsworks supports the development of horizontally scalable DL applications in notebooks and the operation of workflows to support those applications, including parallel data processing, model training, and model deployment at scale. To the best of our knowledge, this is the first work that demonstrates the services and features of the Hopsworks platform, which provide users with the means to build scalable end-to-end ML/DL pipelines for EO data, as well as support for the discovery and search for EO metadata. This paper serves as a demonstration and walkthrough of the stages of building a production-level model that includes data ingestion, data preparation, feature extraction, model training, model serving, and monitoring. To this end, we provide a practical example that demonstrates the aforementioned stages with real-world EO data and includes source code that implements the functionality of the platform. We also perform an experimental evaluation of two frameworks built on top of Hopsworks, namely Maggy and AutoAblation. We show that using Maggy for hyperparameter tuning results in roughly half the wall-clock time required to execute the same number of hyperparameter tuning trials using Spark while providing linear scalability as more workers are added. Furthermore, we demonstrate how AutoAblation facilitates the definition of ablation studies and enables the asynchronous parallel execution of ablation trials.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Accessing earth observation data using JPEG2000
    Carvalho, Helder
    Serrao, Carlos
    Serra, Antonio
    Dias, Miguel
    COMPUTATIONAL MODELLING OF OBJECTS REPRESENTED IN IMAGES: FUNDAMENTALS, METHODS AND APPLICATIONS, 2007, : 135 - 140
  • [32] Earth observation using radar data: an overview of applications and challenges
    Palmann, C.
    Mavromatis, S.
    Hernandez, M.
    Sequeira, J.
    Brisco, B.
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2008, 1 (02) : 171 - 195
  • [33] Using earth observation data in spatial planning: The Geoland project
    Evans, Neil
    EUROPEAN PLANNING STUDIES, 2007, 15 (07) : 985 - 989
  • [34] Bamboo Mapping Using Earth Observation Data: A Systematic Review
    Muna Tamang
    Subrata Nandy
    Ritika Srinet
    Ashesh Kumar Das
    Hitendra Padalia
    Journal of the Indian Society of Remote Sensing, 2022, 50 : 2055 - 2072
  • [35] EARTH OBSERVATION DATA POLICY
    GIBSON, R
    SPACE POLICY, 1993, 9 (04) : 272 - 272
  • [36] Archives for Earth observation data
    Harris, R
    Olby, N
    SPACE POLICY, 2000, 16 (03) : 223 - 227
  • [37] Bamboo Mapping Using Earth Observation Data: A Systematic Review
    Tamang, Muna
    Nandy, Subrata
    Srinet, Ritika
    Das, Ashesh Kumar
    Padalia, Hitendra
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2022, 50 (11) : 2055 - 2072
  • [38] Application of fuzzy artificial neural network to observation data analysis of earth dam monitoring
    Chen, Jiguang
    Lu, Xuechang
    Shuili Xuebao/Journal of Hydraulic Engineering, 2000, (01): : 19 - 22
  • [39] Transforming earth science and engineering: the power of artificial intelligence, data science and machine learning
    Sajeev, R.
    CURRENT SCIENCE, 2024, 127 (03): : 279 - 280
  • [40] Towards practical artificial intelligence in Earth sciences
    Sun, Ziheng
    ten Brink, Talya
    Carande, Wendy
    Koren, Gerbrand
    Cristea, Nicoleta
    Jorgenson, Corin
    Janga, Bhargavi
    Asamani, Gokul Prathin
    Achan, Sanjana
    Mahoney, Mike
    Huang, Qian
    Mehrabian, Armin
    Munasinghe, Thilanka
    Liu, Zhong
    Margolis, Aaron
    Webley, Peter
    Gong, Bing
    Rao, Yuhan
    Burgess, Annie
    Huang, Andrew
    Sandoval, Laura
    Pagan, Brianna R.
    Duzgun, Sebnem
    COMPUTATIONAL GEOSCIENCES, 2024, 28 (06) : 1305 - 1329