Mesh-particle interpolations on graphics processing units and multicore central processing units


METADATA ONLY
Loading...

Date

2011-06-13

Publication Type

Conference Paper

ETH Bibliography

yes

Citations

Altmetric
METADATA ONLY

Data

Rights / License

Abstract

Particle–mesh interpolations are fundamental operations for particle-in-cell codes, as implemented in vortex methods, plasma dynamics and electrostatics simulations. In these simulations, the mesh is used to solve the field equations and the gradients of the fields are used in order to advance the particles. The time integration of particle trajectories is performed through an extensive resampling of the flow field at the particle locations. The computational performance of this resampling turns out to be limited by the memory bandwidth of the underlying computer architecture. We investigate how mesh–particle interpolation can be efficiently performed on graphics processing units (GPUs) and multicore central processing units (CPUs), and we present two implementation techniques. The single-precision results for the multicore CPU implementation show an acceleration of 45–70×, depending on system size, and an acceleration of 85–155× for the GPU implementation over an efficient single-threaded C++ implementation. In double precision, we observe a performance improvement of 30–40× for the multicore CPU implementation and 20–45× for the GPU implementation. With respect to the 16-threaded standard C++ implementation, the present CPU technique leads to a performance increase of roughly 2.8–3.7× in single precision and 1.7–2.4× in double precision, whereas the GPU technique leads to an improvement of 9× in single precision and 2.2–2.8× in double precision.

Permanent link

Publication status

published

Editor

Book title

Volume

369 (1944)

Pages / Article No.

2164 - 2175

Publisher

Royal Society

Event

Conference on the Discrete Simulation of Fluid Dynamics (DSFD 2010)

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

Central processing units; Graphics processing units; High-performance computing; Mesh-particle

Organisational unit

03499 - Koumoutsakos, Petros (ehemalig) / Koumoutsakos, Petros (former) check_circle

Notes

Funding

Related publications and datasets