FAst In-Network GraY Failure Detection for ISPs
METADATA ONLY
Loading...
Author / Producer
Date
2022-08
Publication Type
Conference Paper
ETH Bibliography
yes
Citations
Altmetric
METADATA ONLY
Data
Rights / License
Abstract
Avoiding packet loss is crucial for ISPs. Unfortunately, malfunctioning hardware at ISPs can cause long-lasting packet drops, also known as gray failures, which are undetectable by existing monitoring tools. In this paper, we describe the design and implementation of FANcY, an ISP-Targeted system that detects and localizes gray failures quickly and accurately. FANcY complements previous monitoring approaches, which are mainly tailored for low-delay networks such as data center networks and do not work at ISP scale. We experimentally confirm FANcY's capability to accurately detect gray failures in seconds, as long as only tiny fractions of traffic experience losses. We also implement FANcY in an Intel Tofino switch, demonstrating how it enables fine-grained fast rerouting.
Permanent link
Publication status
published
External links
Editor
Book title
SIGCOMM '22: Proceedings of the ACM SIGCOMM 2022 Conference
Journal / series
Volume
Pages / Article No.
677 - 692
Publisher
Association for Computing Machinery
Event
36th ACM SiGCOMM Conference (SIGCOMM 2022)
Edition / version
Methods
Software
Geographic location
Date collected
Date created
Subject
Failure detection; Measurements; Network Hardware; Programmable data planes
Organisational unit
09477 - Vanbever, Laurent / Vanbever, Laurent
Notes
Conference lecture on August 25, 2022