DROP Computing: Data Driven Pipeline Processing for the SKA

被引:0
|
作者
Wicenec, Andreas [1 ]
Pallot, Dave [1 ]
Tobar, Rodrigo [1 ]
Wu, Chen [1 ]
机构
[1] Univ Western Australia, ICRAR, Perth, WA, Australia
关键词
D O I
暂无
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
The correlator output of the SKA arrays will be of the order of 1 TB/s. That data rate will have to be processed by the Science Data Processor using dedicated HPC infrastructure in both Australia and South Africa. Radio astronomical processing in principle is thought to be highly data parallel, with little to no communication required between individual tasks. Together with the ever increasing number of cores (CPUs) and stream processors (GPUs) this led us to step back and think about the traditional pipeline and task driven approach on a more fundamental level. We have thus started to look into dataflow representations (Dennis & Misunas 1974) and data flow programming models (Davis 1978) as well as data flow languages (Johnston et al. 2004) and scheduling (Benoit et al. 2014). We have investigated a number of existing systems and prototyped some implementations using simplified, but real radio astronomy workflows. Despite the fact that many of these approaches are already focussing on data and dataflow as the most critical component, we still missed a rigorously data driven approach, where the data itself is essentially driving the whole process. In this talk we will present the new concept of DROP Computing (condensed data cloud), which is an integral part of the current SKA Data Layer architecture. In short a DROP is an abstract class, instances of which represent data (DataDrop), collections of DROPs (Container Drop), but also applications (ApplicationDrop, e.g. pipeline components). The rest are just details, which will be presented in the talk.
引用
收藏
页码:319 / 328
页数:10
相关论文
共 50 条
  • [1] Bringing computation to the data: A MOEA-driven approach for optimising data processing in the context of the SKA and SRCNet
    Parra-Royon, Manuel
    Rodriguez-Gallardo, Alvaro
    Sanchez-Exposito, Susana
    Darriba-Pol, Laura
    Sanchez-Castaneda, Jesus
    Mendoza, MAngeles
    Garrido, Julian
    Moldon, Javier
    Verdes-Montenegro, Lourdes
    IEEE CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS 2024, IEEE EAIS 2024, 2024, : 161 - 168
  • [2] Data driven signal processing: An approach for energy efficient computing
    Chandrakasan, A
    Gutnik, V
    Xanthopoulos, T
    1996 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN - DIGEST OF TECHNICAL PAPERS, 1996, : 347 - 352
  • [3] Accelerated Genomics Data Processing using Memory-Driven Computing
    Becker, Matthias
    Chabbit, Milind
    Warnat-Herresthal, Stefanie
    Worlikar, Umesh
    Agrawal, Shobhit
    Bhat, Jaydeep
    Schulte-Schrepping, Jonas
    Bassler, Kevin
    Guenther, Patrick
    Schultze, Hartmut
    Ulas, Thomas
    Singhal, Sharad
    Schultze, Joachim L.
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1850 - 1855
  • [4] The MAGIC data processing pipeline
    Firpo Curcoll, R.
    Delfino, M.
    Neissner, C.
    Reichardt, I.
    Rico, J.
    Tallada, P.
    Tonello, N.
    INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP 2010): EVENT PROCESSING, 2011, 331
  • [5] GALFACTS Data Processing Pipeline
    Guram, S. S.
    Andrecut, M.
    George, S. J.
    Taylor, A. R.
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XX, 2011, 442 : 317 - 320
  • [6] The ISOPHOT pipeline data processing
    Richards, PJ
    Klaas, U
    Laureijs, RJ
    Abraham, P
    Schulz, B
    Morris, H
    Wilke, K
    Heinrichsen, I
    PROCEEDINGS OF THE CONFERENCE ON THE CALIBRATION LEGACY OF THE ISO MISSION, 2003, 481 : 279 - 284
  • [7] An Overview of the SKA Science Analysis Pipeline
    Hollitt, C.
    Johnston-Hollitt, M.
    Dehghan, S.
    Frean, M.
    Butler-Yeoman, T.
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XXV, 2017, 512 : 367 - 370
  • [8] Imaging SKA-scale data in three different computing environments
    Dodson, R.
    Vinsen, K.
    Wu, C.
    Popping, A.
    Meyer, M.
    Wicenec, A.
    Quinn, P.
    van Gorkom, J.
    Momjian, E.
    ASTRONOMY AND COMPUTING, 2016, 14 : 8 - 22
  • [9] On-premises Serverless Computing for Event-Driven Data Processing Applications
    Perez, Alfonso
    Risco, Sebastian
    Naranjo, Diana Maria
    Caballer, Miguel
    Molto, German
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 414 - 421
  • [10] Data-Driven Computing
    Kirchdoerfer, Trenton
    Ortiz, Michael
    ADVANCES IN COMPUTATIONAL PLASTICITY: A BOOK IN HONOUR OF D. ROGER J. OWEN, 2018, 46 : 165 - 183