Groute: Asynchronous multi-GPU programming model with applications to large-scale graph processing
Metadata only
Date
2020-08Type
- Journal Article
Abstract
© 2020 ACM. Nodes with multiple GPUs are becoming the platform of choice for high-performance computing. However, most applications are written using bulk-synchronous programming models, which may not be optimal for irregular algorithms that benefit from low-latency, asynchronous communication. This article proposes constructs for asynchronous multi-GPU programming and describes their implementation in a thin runtime environment called Groute. Groute also implements common collective operations and distributed work-lists, enabling the development of irregular applications without substantial programming effort. We demonstrate that this approach achieves state-of-the-art performance and exhibits strong scaling for a suite of irregular applications on eight-GPU and heterogeneous systems, yielding over 7× speedup for some algorithms. Show more
Publication status
publishedExternal links
Journal / series
ACM Transactions on Parallel ComputingVolume
Pages / Article No.
Publisher
Association for Computing MachinerySubject
Multi-GPU; asynchronous programming; irregular algorithmsMore
Show all metadata