Flexible Batched Sparse Matrix-Vector Product on GPUs
Event Type
Workshop

Algorithms
Exascale
Resiliency
SIGHPC Workshop
TimeMonday, November 13th11:10am - 11:30am
Location607
DescriptionWe propose a variety of batched routines for concurrently processing a large collection of small-size, independent sparse matrix vector products (SpMV) on graphics processing units (GPUs). These batched SpMV kernels are designed to be flexible in order to handle a batch of matrices which differ in size, nonzero count, and nonzero distribution. Furthermore, they support three most commonly used sparse storage formats: CSR, COO and ELL. Our experimental results on a state-of-the-art GPU reveal performance improvements of up to 25× compared to non-batched SpMV routines.