FAst In-Network GraY Failure Detection for ISPs


METADATA ONLY
Loading...

Date

2022-08

Publication Type

Conference Paper

ETH Bibliography

yes

Citations

Altmetric
METADATA ONLY

Data

Rights / License

Abstract

Avoiding packet loss is crucial for ISPs. Unfortunately, malfunctioning hardware at ISPs can cause long-lasting packet drops, also known as gray failures, which are undetectable by existing monitoring tools. In this paper, we describe the design and implementation of FANcY, an ISP-Targeted system that detects and localizes gray failures quickly and accurately. FANcY complements previous monitoring approaches, which are mainly tailored for low-delay networks such as data center networks and do not work at ISP scale. We experimentally confirm FANcY's capability to accurately detect gray failures in seconds, as long as only tiny fractions of traffic experience losses. We also implement FANcY in an Intel Tofino switch, demonstrating how it enables fine-grained fast rerouting.

Permanent link

Publication status

published

Editor

Book title

SIGCOMM '22: Proceedings of the ACM SIGCOMM 2022 Conference

Journal / series

Volume

Pages / Article No.

677 - 692

Publisher

Association for Computing Machinery

Event

36th ACM SiGCOMM Conference (SIGCOMM 2022)

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

Failure detection; Measurements; Network Hardware; Programmable data planes

Organisational unit

09477 - Vanbever, Laurent / Vanbever, Laurent check_circle

Notes

Conference lecture on August 25, 2022

Funding

Related publications and datasets