Show simple item record

dc.contributor.author
Cesarini, Daniele
dc.contributor.author
Bartolini, Andrea
dc.contributor.author
Bonfà, Pietro
dc.contributor.author
Cavazzoni, Carlo
dc.contributor.author
Benini, Luca
dc.date.accessioned
2021-04-21T05:58:25Z
dc.date.available
2021-04-20T03:22:18Z
dc.date.available
2021-04-21T05:58:25Z
dc.date.issued
2021-05-01
dc.identifier.issn
0018-9340
dc.identifier.issn
1557-9956
dc.identifier.other
10.1109/TC.2020.2995269
en_US
dc.identifier.uri
http://hdl.handle.net/20.500.11850/479431
dc.description.abstract
Power and energy consumption are becoming key challenges for the supercomputers’ exascale race. HPC systems’ processors waist active power during communication and synchronization among the MPI processes in large-scale HPC applications. However, due to the time scale at which communication happens, transitioning into low-power states while waiting for the completion of each communication may introduce unacceptable overhead. In this article, we present COUNTDOWN, a run-time library for identifying and automatically reducing the power consumption of the CPUs during communication and synchronization. COUNTDOWN saves energy without penalizing the time-to-completion by lowering CPUs power consumption only during idle times for which power state transition overhead is negligible. This is done transparently to the user, without requiring labor-intensive and error-prone application code modifications, nor requiring recompilation of the application. We test our methodology on a production Tier-1 system. For the NAS benchmarks, COUNTDOWN saves between 6 and 50 percent energy, with a time-to-solution penalty lower than 5 percent. In a complete production—Quantum ESPRESSO—for a 3.5K cores run, COUNTDOWN saves 22.36 percent energy, with a performance penalty below 3 percent. Energy saving increases to 37 percent with a performance penalty of 6.38 percent, if the application is executed without communication tuning.
en_US
dc.language.iso
en
en_US
dc.publisher
IEEE
en_US
dc.subject
HPC
en_US
dc.subject
MPI
en_US
dc.subject
profiling
en_US
dc.subject
power management
en_US
dc.subject
energy/power saving
en_US
dc.subject
idleness
en_US
dc.subject
DVFS
en_US
dc.subject
DDCM
en_US
dc.subject
C-states
en_US
dc.subject
P-states
en_US
dc.subject
T-states
en_US
dc.title
COUNTDOWN: A Run-Time Library for Performance-Neutral Energy Saving in MPI Applications
en_US
dc.type
Journal Article
dc.date.published
2020-05-18
ethz.journal.title
IEEE Transactions on Computers
ethz.journal.volume
70
en_US
ethz.journal.issue
5
en_US
ethz.journal.abbreviated
IEEE trans. comput.
ethz.pages.start
682
en_US
ethz.pages.end
695
en_US
ethz.identifier.wos
ethz.identifier.scopus
ethz.publication.place
New York, NY
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02636 - Institut für Integrierte Systeme / Integrated Systems Laboratory::03996 - Benini, Luca / Benini, Luca
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02636 - Institut für Integrierte Systeme / Integrated Systems Laboratory::03996 - Benini, Luca / Benini, Luca
ethz.date.deposited
2021-04-20T03:22:31Z
ethz.source
SCOPUS
ethz.eth
yes
en_US
ethz.availability
Metadata only
en_US
ethz.rosetta.installDate
2021-04-21T05:58:35Z
ethz.rosetta.lastUpdated
2021-04-21T05:58:35Z
ethz.rosetta.exportRequired
true
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=COUNTDOWN:%20A%20Run-Time%20Library%20for%20Performance-Neutral%20Energy%20Saving%20in%20MPI%20Applications&rft.jtitle=IEEE%20Transactions%20on%20Computers&rft.date=2021-05-01&rft.volume=70&rft.issue=5&rft.spage=682&rft.epage=695&rft.issn=0018-9340&1557-9956&rft.au=Cesarini,%20Daniele&Bartolini,%20Andrea&Bonf%C3%A0,%20Pietro&Cavazzoni,%20Carlo&Benini,%20Luca&rft.genre=article&rft_id=info:doi/10.1109/TC.2020.2995269&
 Search print copy at ETH Library

Files in this item

FilesSizeFormatOpen in viewer

There are no files associated with this item.

Publication type

Show simple item record