COUNTDOWN - A Run-time Library for Application-agnostic Energy Saving in MPI Communication Primitives
Bonfà, PieroShow all
- Conference Paper
Rights / licenseIn Copyright - Non-Commercial Use Permitted
Energy and power consumption are prominent issues in today’s supercomputers and are foreseen as a limiting factor of future installations. In scientific computing, a significant amount of power is spent in the communication and synchronization-related idle times among distributed processes participating to the same application. However, due to the time scale at which communication happens, taking advantage of low-power states to reduce power in idle times in the computing resources, may introduce significant overheads. In this paper we present COUNTDOWN, a methodology and a tool for identifying and automatically reducing the frequency of the computing elements in order to save energy during communication and synchronization primitives. COUNTDOWN is able to filter out phases which would detriment the time to solution of the application transparently to the user, without touching the application code nor requiring recompilation of the application. We tested our methodology in a production Tier-0 system, a production application - Quantum ESPRESSO (QE) - with production datasets which can scale up to 3.5K cores. Experimental results show that our methodology saves 22.36% of energy consumption with a performance penalty of 2.88% in real production MPI-based application. Show more
Cardoso, João M.P.
Book titleANDARE '18 Proceedings of the 2nd Workshop on AutotuniNg and aDaptivity AppRoaches for Energy efficient HPC Systems
Pages / Article No.
SubjectHPC; MPI; profiling; power management; idleness; DVFS; DDCM; C-states; P-states; T-states; hardware performance counters; timer; energy saving; power saving
Organisational unit03996 - Benini, Luca / Benini, Luca
NotesConference lecture held on November 4, 2018
MoreShow all metadata