Towards Network Model Generalization using Strategic Data Collection


Loading...

Date

2025

Publication Type

Other Conference Item

ETH Bibliography

yes

Citations

Scopus:
Altmetric

Data

Abstract

Essential networking applications, such as video streaming, require accurate network models to estimate current and future network states (e.g., is the network congested?). Due to the complexity of today’s networks and the subsequent difficulty of this modeling task, Machine Learning (ML)-based approaches have emerged as an alternative to first-principle modeling methods. However, proposed ML algorithms suffer from a generalization crisis: they often fail to perform in deployments outside of their training environment. Moreover, simple solutions such as naively training on more data do not guarantee improved generalization performance. We propose an interpretable approach to improving model gen- eralization by focusing on the quality of a dataset over sample quantity already during data collection. Notably, our approach’s interpretability allows us to reason on which environments to pri- oritize at the data acquisition stage. To this end, we investigate the impact of dataset metrics such as Round Trip Time (RTT) and throughput on both in-distribution (ID) and out-of-distribution (OOD) model performance. Our results suggest that strategically performing data collection in environments with broader state- space coverage in areas of higher RTT and lower throughput is key to achieving improved model generalization and OOD performance.

Publication status

published

Editor

Book title

ACM SIGCOMM Posters and Demos '25: Proceedings of the ACM SIGCOMM 2025 Posters and Demos

Journal / series

Volume

Pages / Article No.

31 - 33

Publisher

Association for Computing Machinery

Event

ACM SIGCOMM 2025

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

Network dynamics; Machine learning; Artificial intelligence

Organisational unit

09477 - Vanbever, Laurent / Vanbever, Laurent check_circle

Notes

Funding

Related publications and datasets