ML Training with Cloud GPU Shortages: Is Cross-Region the Answer?

The widespread adoption of ML has led to a high demand for GPU hardware and consequently, severe shortages of GPUs in the public cloud. Allocating a sufficient number of GPUs to train or fine-tune today’s large ML models in a single cloud region is often difficult. Users can get access to more GPUs Show more

Publication status

published

External links

https://doi.org/10.1145/3642970.3655843

Book title

EuroMLSys '24: Proceedings of the 4th Workshop on Machine Learning and Systems

Pages / Article No.

107 - 116

Publisher

Association for Computing Machinery

Event

4th Workshop on Machine Learning and Systems (EuroMLSys 2024), Athens, Greece, April 22, 2024

Subject

Machine Learning; Cloud computing

Funding

204620 - MLin: Machine Learning Input Data Processing as a Service (SNF)

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

ML Training with Cloud GPU Shortages: Is Cross-Region the Answer? Mendeley CSV RIS BibTeX

ML Training with Cloud GPU Shortages: Is Cross-Region the Answer?

Mendeley

CSV

RIS

BibTeX