Show simple item record

dc.contributor.author
Eichenberger, Max
dc.contributor.supervisor
Hoefler, Torsten
dc.contributor.supervisor
Ben-Nun, Tal
dc.date.accessioned
2021-06-23T07:30:10Z
dc.date.available
2021-06-22T22:45:19Z
dc.date.available
2021-06-23T07:30:10Z
dc.date.issued
2021
dc.identifier.uri
http://hdl.handle.net/20.500.11850/490790
dc.identifier.doi
10.3929/ethz-b-000490790
dc.description.abstract
The Graphics Processing Unit is designed to manipulate plenty of memory fast. To use its full capacity, a deeper understanding of the underlying architecture is required. This thesis presents a simple but still flexible Copy API to move N-dimensional data fragments between memory spaces in a GPU efficiently. We introduce different strategies to divide fine-grained parallelism over a user given workload. These strategies are then benchmarked to show their possible performance variety. In a last step, we display the use of the Copy API on different algebraic applications, highlighting the advantages of access to simple and flexible data movement functions.
en_US
dc.format
application/pdf
en_US
dc.language.iso
en
en_US
dc.publisher
ETH Zurich
en_US
dc.rights.uri
http://rightsstatements.org/page/InC-NC/1.0/
dc.subject
GPUs
en_US
dc.subject
Parallel computing
en_US
dc.subject
Data movement
en_US
dc.subject
CUDA
en_US
dc.title
Flexible Data Movement API for GPUs
en_US
dc.type
Bachelor Thesis
dc.rights.license
In Copyright - Non-Commercial Use Permitted
ethz.size
54 p.
en_US
ethz.publication.place
Zurich
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02666 - Institut für Hochleistungsrechnersysteme / Inst. f. High Performance Computing Syst::03950 - Hoefler, Torsten / Hoefler, Torsten
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02666 - Institut für Hochleistungsrechnersysteme / Inst. f. High Performance Computing Syst::03950 - Hoefler, Torsten / Hoefler, Torsten
en_US
ethz.date.deposited
2021-06-22T22:45:44Z
ethz.source
FORM
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2021-06-23T07:30:21Z
ethz.rosetta.lastUpdated
2022-03-29T10:02:13Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Flexible%20Data%20Movement%20API%20for%20GPUs&rft.date=2021&rft.au=Eichenberger,%20Max&rft.genre=unknown&rft.btitle=Flexible%20Data%20Movement%20API%20for%20GPUs
 Search print copy at ETH Library

Files in this item

Thumbnail

Publication type

Show simple item record