Abstract
Empirical performance modeling is a proven instrument to analyze the scaling behavior of HPC applications. Using a set of smaller-scale experiments, it can provide important insights into application behavior at larger scales. Extra-P is an empirical modeling tool that applies linear regression to automatically generate human-readable performance models. Similar to other regression-based modeling techniques, the accuracy of the models created by Extra-P decreases as the amount of noise in the underlying data increases. This is why the performance variability observed in many contemporary systems can become a serious challenge. In this paper, we introduce a novel adaptive modeling approach that makes Extra-P more noise resilient, exploiting the ability of deep neural networks to discover the effects of numerical parameters, such as the number of processes or the problem size, on performance when dealing with noisy measurements. Using synthetic analysis and data from three different case studies, we demonstrate that our solution improves the model accuracy at high noise levels by up to 25% while increasing their predictive power by about 15%. Show more
Publication status
publishedExternal links
Book title
2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS)Pages / Article No.
Publisher
IEEEEvent
Subject
Performance analysis; Performance modeling; Deep learning; Artificial neural networks; High performance computing; Parallel processingOrganisational unit
03950 - Hoefler, Torsten / Hoefler, Torsten
More
Show all metadata