DROP Computing: Data Driven Pipeline Processing for the SKA

被引:0
|
作者
Wicenec, Andreas [1 ]
Pallot, Dave [1 ]
Tobar, Rodrigo [1 ]
Wu, Chen [1 ]
机构
[1] Univ Western Australia, ICRAR, Perth, WA, Australia
关键词
D O I
暂无
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
The correlator output of the SKA arrays will be of the order of 1 TB/s. That data rate will have to be processed by the Science Data Processor using dedicated HPC infrastructure in both Australia and South Africa. Radio astronomical processing in principle is thought to be highly data parallel, with little to no communication required between individual tasks. Together with the ever increasing number of cores (CPUs) and stream processors (GPUs) this led us to step back and think about the traditional pipeline and task driven approach on a more fundamental level. We have thus started to look into dataflow representations (Dennis & Misunas 1974) and data flow programming models (Davis 1978) as well as data flow languages (Johnston et al. 2004) and scheduling (Benoit et al. 2014). We have investigated a number of existing systems and prototyped some implementations using simplified, but real radio astronomy workflows. Despite the fact that many of these approaches are already focussing on data and dataflow as the most critical component, we still missed a rigorously data driven approach, where the data itself is essentially driving the whole process. In this talk we will present the new concept of DROP Computing (condensed data cloud), which is an integral part of the current SKA Data Layer architecture. In short a DROP is an abstract class, instances of which represent data (DataDrop), collections of DROPs (Container Drop), but also applications (ApplicationDrop, e.g. pipeline components). The rest are just details, which will be presented in the talk.
引用
收藏
页码:319 / 328
页数:10
相关论文
共 50 条
  • [31] Big Data Pipeline Scheduling and Adaptation on the Computing Continuum
    Kimovski, Dragi
    Bauer, Christian
    Mehran, Narges
    Prodan, Radu
    2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1153 - 1158
  • [32] Avoiding Data Corruption in Drop Computing Mobile Networks
    Ciobanu, Radu-Ioan
    Tabusca, Vladut-Constantin
    Dobre, Ciprian
    Bajenaru, Lidia
    Mavromoustakis, Constandinos X.
    Mastorakis, George
    IEEE ACCESS, 2019, 7 : 31170 - 31185
  • [33] Special Topic: Data Processing System and Scientific Applications of SKA Regional Centre
    An, Tao
    SCIENTIA SINICA-PHYSICA MECHANICA & ASTRONOMICA, 2023, 53 (02)
  • [34] Serverless data pipeline approaches for IoT data in fog and cloud computing
    Poojara, Shivananda R.
    Dehury, Chinmaya Kumar
    Jakovits, Pelle
    Srirama, Satish Narayana
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 130 : 91 - 105
  • [35] A Model for Sustainable Data Encryption, Storage, and Processing in Edge Computing-driven Internet of Things
    Huang, Chenze
    Zhong, Ying
    International Journal of Network Security, 26 (03): : 425 - 434
  • [36] A Sustainable Data Encryption Storage and Processing Framework via Edge Computing-Driven IoT
    Li, Qi
    Huang, Jian
    Li, Sihan
    Huang, Chenze
    ENGINEERING LETTERS, 2024, 32 (07) : 1510 - 1520
  • [37] Pipeline processing and quality control for Echelle data
    Morgante, G
    Pasian, F
    Ballester, P
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS VII (ADASS), 1998, 145 : 337 - 340
  • [38] An Extensible Parsing Pipeline for Unstructured Data Processing
    Jain, Shubham
    de Buitleir, Amy
    Fallon, Enda
    2021 23RD INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT 2021): ON-LINE SECURITY IN PANDEMIC ERA, 2021, : 312 - 318
  • [39] An integrated processing pipeline for irregular volume data
    Yang, CK
    Chiueh, TC
    VOLUME GRAPHICS 2005, 2005, : 147 - +
  • [40] GLAST (FERMI) Data-Processing Pipeline
    Flath, Daniel L.
    Johnson, Tony S.
    Turri, Massimiliano
    Heidenreich, Karen A.
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVIII, 2009, 411 : 193 - 196