A Performance Study of Quantum ESPRESSO's PWscf Code on Multi-Core and GPU Systems
Author/Presenters
Event Type
Workshop

Accelerators
Benchmarks
Compiler Analysis and Optimization
Deep Learning
Effective Application of HPC
Energy
Exascale
GPU
I/O
Parallel Application Frameworks
Parallel Programming Languages, Libraries, Models and Notations
Performance
Simulation
Storage
TimeMonday, November 13th10:30am - 11am
Location704-706
DescriptionWe describe the porting of PWscf (Plane-Wave Self Consistent Field), a key component of the Quantum ESPRESSO open-source suite of codes for materials modeling, to GPU systems using CUDA Fortran. Kernel loop directives (CUF kernels) have been extensively used in order to have a single source code for both CPU and GPU implementations. The results of the GPU version have been carefully validated and the performance of the code on several GPU systems (both x86 and POWER8 based) has been compared with traditional Intel multi-core (CPU only) systems. This current GPU version can reduce the time-to-solution by an average factor of 2-3 running two different input cases widely used as benchmarks on small and large high performance computing systems.