Finite volume methods are widely used numerical strategies for solving partial differential equations. This paper aims at obtaining a quantitative understanding of the achievable performance of the cell-centered finite volume method on 3D unstructured tetrahedral meshes, using traditional multicore CPUs as well as modem GPUs. By using an optimized implementation and a synthetic connectivity matrix that exhibits a perfect structure of equal-sized blocks lying on the main diagonal, we can closely relate the achievable computing performance to the size of these diagonal blocks. Moreover, we have derived a theoretical model for identifying characteristic levels of the attainable performance as a function of hardware parameters, based on which a realistic upper limit of the performance can be predicted accurately. For real-world tetrahedral meshes, the key to high performance lies in a reordering of the tetrahedra, such that the resulting connectivity matrix resembles a block diagonal form where the optimal size of the blocks depends on the hardware. Numerical experiments confirm that the achieved performance is close to the practically attainable maximum and it reaches 75% of the theoretical upper limit, independent of the actual tetrahedral mesh considered. From this, we develop a general model capable of identifying bottleneck performance of a system's memory hierarchy in irregular applications. (C) 2014 Elsevier Inc. All rights reserved.
机构:
Inst Appl Phys & Computat Math, POB 8009, Beijing 100088, Peoples R China
Xinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R ChinaInst Appl Phys & Computat Math, POB 8009, Beijing 100088, Peoples R China
Peng, Gang
Gao, Zhiming
论文数: 0引用数: 0
h-index: 0
机构:
Inst Appl Phys & Computat Math, POB 8009, Beijing 100088, Peoples R ChinaInst Appl Phys & Computat Math, POB 8009, Beijing 100088, Peoples R China
Gao, Zhiming
Yan, Wenjing
论文数: 0引用数: 0
h-index: 0
机构:
Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R ChinaInst Appl Phys & Computat Math, POB 8009, Beijing 100088, Peoples R China
机构:
Bhabha Atom Res Ctr, Computat Anal Div, Visakhapatnam, Andhra Pradesh, IndiaBhabha Atom Res Ctr, Computat Anal Div, Visakhapatnam, Andhra Pradesh, India
Sijoy, C. D.
Chaturvedi, S.
论文数: 0引用数: 0
h-index: 0
机构:
Bhabha Atom Res Ctr, Computat Anal Div, Visakhapatnam, Andhra Pradesh, IndiaBhabha Atom Res Ctr, Computat Anal Div, Visakhapatnam, Andhra Pradesh, India