Tom Deakin is Lecturer in Advanced Computer Systems at the University of Bristol, researching the performance portability of massively parallel high performance simulation codes. He has given tutorials and lecture series on parallel programming models including OpenMP, SYCL, and OpenCL.
Timothy G. Mattson is a senior principal engineer at Intel where he's worked since 1993 on: the first TFLOP computer; the creation of MPI, OpenMP, and OpenCL; HW/SW co-design of many-core processors; data management systems; and the GraphBLAS API for expressing graph algorithms as sparse linear algebra.
8: Asynchronous Offload to Multiple GPUs
-
Published:2023
2023. "Asynchronous Offload to Multiple GPUs", Programming Your GPU with OpenMP: Performance Portability for GPUs, Tom Deakin, Timothy G. Mattson
Download citation file: