Subudhi, S and Khillare, A and Munikrishna, N and Balakrishnan, N (2024) GPU accelerated Staggered Update Procedure (SUP). In: Computers and Fluids, 283 .
PDF
Com_flu_283_2024.pdf - Published Version Restricted to Registered users only Download (2MB) | Request a copy |
Abstract
The advancement in programmable capability of graphics hardware has paved new opportunities in the domain of high performance computing (HPC). The computational fluid dynamics (CFD) community, being a significant user of HPC, has started exploiting the inherent data parallelism in the numerical solvers to be able to make efficient use of these many-core, high throughput accelerator based processors. In the present work, we examine the process of accelerating our CPU based Staggered Update Procedure (SUP) solver, i.e., a higher order accurate cell-centred finite volume solver by off-loading the computationally most expensive region of the code pertaining to the explicit residual computation. We have adopted OpenACC, a directive based programming model to expose parallelism in the code. The framework evolved for GPU porting in the context of SUP is also of value to those intending to port their CFD solvers based on classical finite volume methodology. The performance analysis is conducted using scalar convection�diffusion equations in both two- and three-dimensions. The findings demonstrate a speedup factor of 9 (in case of 2D) and 28 (in case of 3D) when considering the explicit residual alone, achieved with a single NVIDIA Tesla V100 GPU card. In addition, we could establish superior algorithmic scalability by the way of recovering near perfect serial performance, on the heterogeneous CPU+GPU architecture. Further, overall code acceleration can be achieved by porting other parts of the solver on GPU. © 2024 Elsevier Ltd
Item Type: | Journal Article |
---|---|
Publication: | Computers and Fluids |
Publisher: | Elsevier Ltd |
Additional Information: | The copyright for this article belongs to the Elsevier Ltd. |
Keywords: | Digital storage; Finite volume method; Graphics processing unit, Finite-volume method; Graphic processing unit; Graphics processing; High order finite volume method; High-order; Higher-order; Meshless; Meshless solver; OpenACC; Processing units; Speedup; Staggered update procedure, Computer graphics equipment |
Department/Centre: | Division of Mechanical Sciences > Aerospace Engineering(Formerly Aeronautical Engineering) |
Date Deposited: | 10 Sep 2024 09:48 |
Last Modified: | 10 Sep 2024 09:48 |
URI: | http://eprints.iisc.ac.in/id/eprint/86068 |
Actions (login required)
View Item |