On Sample Selection for Continual Learning: a Video Streaming Case Study


Loading...

Date

2024-04

Publication Type

Journal Article

ETH Bibliography

yes

Citations

Altmetric

Data

Abstract

Machine learning (ML) is a powerful tool to model the complexity of communication networks. As networks evolve, we cannot only train once and deploy. Retraining models, known as continual learning, is necessary. Yet, to date, there is no established methodology to answer the key questions: With which samples to retrain? When should we retrain? We address these questions with the sample selection system Memento, which maintains a training set with the "most useful" samples to maximize sample space coverage. Memento particularly benefits rare patterns—the notoriously long "tail" in networking—and allows assessing rationally when retraining may help, i.e., when the coverage changes. We deployed Memento on Puffer, the live-TV streaming project, and achieved a 14% reduction of stall time, 3.5× the improvement of random sample selection. Memento is model-agnostic and can be applied beyond video streaming.

Publication status

published

Editor

Book title

Volume

54 (2)

Pages / Article No.

10 - 35

Publisher

Association for Computing Machinery

Event

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

video streaming; Machine Learning; Continual learning

Organisational unit

09477 - Vanbever, Laurent / Vanbever, Laurent check_circle

Notes

Funding

Related publications and datasets