GPU-acceleration on a low-latency binary-coalescence gravitational wave search pipeline

被引:14
|
作者
Guo, Xiangyu [1 ]
Chu, Qi [2 ]
Chung, Shin Kee [2 ]
Du, Zhihui [1 ]
Wen, Linqing [2 ]
Gu, Yanqi [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
[2] Univ Western Australia, Dept Phys & Astrophys, OzGrav, Crawley, WA 6009, Australia
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Gravitational wave; GPU; Data processing pipeline; Low latency;
D O I
10.1016/j.cpc.2018.05.002
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Low-latency detections of gravitational waves (GWs) from compact stellar binary coalescences are crucial to enable prompt follow-up observations to astrophysical transients by conventional telescopes, as demonstrated by the first joint GW and electromagnetic observations on July 17, 2017. Searching over the GW parameter space with the requirement of low-latency presents a computational challenge. This will become more severe considering denser sampling of the source space due to improving GW detector sensitivities. In our previous work, a low-latency matched filtering search method was developed, called Summed Parallel Infinite Impulse Response (SPIIR) filtering, which is suitable for parallelization, and an over 50x speedup of this method was achieved using Fermi-generation CPUs. In this paper, a multi-rate scheme for filtering, which reduces the computation time by a factor of several, is presented. The recent features in NVIDIA CPUs, namely the read-only data cache, warp-shuffle, and cross-warp atomic techniques, were exploited to improve the performance of filtering over previous GPU acceleration by a factor of 1.5 similar to 11x on a Maxwell-generation GPU, whereas on a Pascal-generation GPU, the hardwn be gained by employing these new techniques. This leads to an over 100are upgrade can bring along 7 similar to 11x speedup, and a further 1.5 similar to 2.5x speedup cax speedup for the multi-rate scheme using the Maxwell GPU, and an over 260x speedup using the Pascal GPU. This GPU-accelerated multi-rate scheme was incorporated into a low-latency search pipeline - the SPIIR pipeline - and an overall near-limit CPU usage reduction is expected. This IIR technique in general and its GPU acceleration technique here hold the potential to find applications in other signal processing fields, such as in image and radio astronomy data processing. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:62 / 71
页数:10
相关论文
共 23 条
  • [21] Template banks to search for low-mass binary black holes in advanced gravitational-wave detectors
    Brown, Duncan A.
    Kumar, Prayush
    Nitz, Alexander H.
    PHYSICAL REVIEW D, 2013, 87 (08):
  • [22] Very low latency search pipeline for low mass compact binary coalescences in the LIGO S6 and Virgo VSR2 data
    Buskulic, D.
    CLASSICAL AND QUANTUM GRAVITY, 2010, 27 (19)
  • [23] Search for gravitational waves from low mass compact binary coalescence in LIGO's sixth science run and Virgo's science runs 2 and 3
    Abadie, J.
    Abbott, B. P.
    Abbott, R.
    Abbott, T. D.
    Abernathy, M.
    Accadia, T.
    Acernese, F.
    Adams, C.
    Adhikari, R.
    Affeldt, C.
    Agathos, M.
    Ajith, P.
    Allen, B.
    Allen, G. S.
    Ceron, E. Amador
    Amariutei, D.
    Amin, R. S.
    Anderson, S. B.
    Anderson, W. G.
    Arai, K.
    Arain, M. A.
    Araya, M. C.
    Aston, S. M.
    Astone, P.
    Atkinson, D.
    Aufmuth, P.
    Aulbert, C.
    Aylott, B. E.
    Babak, S.
    Baker, P.
    Ballardin, G.
    Ballmer, S.
    Barker, D.
    Barone, F.
    Barr, B.
    Barriga, P.
    Barsotti, L.
    Barsuglia, M.
    Barton, M. A.
    Bartos, I.
    Bassiri, R.
    Bastarrika, M.
    Basti, A.
    Batch, J.
    Bauchrowitz, J.
    Bauer, Th. S.
    Bebronne, M.
    Behnke, B.
    Beker, M. G.
    Bell, A. S.
    PHYSICAL REVIEW D, 2012, 85 (08):