Parallel processing in data analysis of the JUNO experiment

被引:0
|
作者
Yang, Yixiang [1 ]
机构
[1] Chinese Acad Sci, Inst High Energy Phys, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1088/1742-6596/2438/1/012057
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The JUNO experiment is being built mainly to determine the neutrino mass hierarchy by detecting neutrinos generated in the Yangjiang and Taishan nuclear plants in southern China. The detector will record 5.6 TB raw data every day for offline analysis, but each day it can only collect about 60 neutrino events scattered among huge background events. Selection of extremely sparse neutrino events brings a big challenge to offline data analysis. A typical neutrino physics event normally spans across a number of consecutive readout events, flagged by a fast positron signal followed by a slow neutron signal within a varying-size time window. To facilitate this analysis, a two-step data processing scheme has been proposed. In the first step (called data preparation), the event index data is produced and skimmed, which only contains information of minimum physics quantities of events as well as their addresses in the original reconstructed data file. In the second step (called time correlation analysis), event index data is further selected with stricter criteria. And then, for each selected event, the time correlation analysis is performed by reading all associated events within a pre-defined time window from the original data file according to the selected event's address and timestamp. This contribution will start to introduce the design of the above data processing scheme and then focus on the multi-threaded implementation of time correlation analysis based on the Intel Threading Building Block (TBB) in the SNiPER framework. Afterwards, this contribution will describe the implementation of distributed analysis using MPI in which the time correlation analysis task is divided into sub-tasks running on multiple computing nodes. At last, this contribution will present the detailed performance measurements made on a multiple-node test bed. By using both skimming and indexing techniques, the total amount of data finally used for neutrino signal time correlation analysis is significantly reduced, and the processing time could be reduced by two orders of magnitude.
引用
下载
收藏
页数:6
相关论文
共 50 条
  • [21] Status and prospects of the JUNO experiment
    Ranucci, Gioacchino
    XXVII INTERNATIONAL CONFERENCE ON NEUTRINO PHYSICS AND ASTROPHYSICS (NEUTRINO2016), 2017, 888
  • [22] On utilizing experiment data repository for performance analysis of parallel applications
    Truong, HL
    Fahringer, T
    EURO-PAR 2003 PARALLEL PROCESSING, PROCEEDINGS, 2003, 2790 : 27 - 37
  • [23] Off-line data processing and analysis for the GERDA experiment
    Agostini, M.
    Pandola, L.
    Zavarise, P.
    14TH INTERNATIONAL WORKSHOP ON ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH (ACAT 2011), 2012, 368
  • [24] Application of Regression Analysis in Data Processing of Physical Experiment of College
    Peng, Jianxin
    Lu, Yigang
    KNOWLEDGE DISCOVERY AND DATA MINING, 2012, 135 : 525 - 531
  • [25] The neutrino mass ordering and the JUNO experiment
    Antonelli, V.
    NUOVO CIMENTO C-COLLOQUIA AND COMMUNICATIONS IN PHYSICS, 2018, 41 (1-2):
  • [26] Efficiency analysis of seismic data processing with the application of parallel algorithm
    Zhu, Shu-Yun
    Zhu, Xu-Guang
    Xie, Dong-Lian
    Zhang, Li-Mei
    Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2011, 46 (03): : 493 - 499
  • [27] Guest Editorial: The Parallel Storage, Processing and Analysis for Big Data
    Li, Maozhen
    Tang, Zhuo
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (04) : 731 - 733
  • [28] AN EXPERIMENT IN DATA PROCESSING MANAGEMENT
    STUART, WJ
    DATAMATION, 1968, 14 (06): : 64 - &
  • [29] Guest Editorial: The Parallel Storage, Processing and Analysis for Big Data
    Maozhen Li
    Zhuo Tang
    International Journal of Parallel Programming, 2017, 45 : 731 - 733
  • [30] ANALYSIS, SYNTHESIS AND PARALLEL PROCESSING OF LARGE DATA AND KNOWLEDGE BASES
    POLYACHENKO, BE
    ANDON, FI
    GUNKO, OL
    DATA ANALYSIS, LEARNING SYMBOLIC AND NUMERIC KNOWLEDGE, 1989, : 519 - 530