Interpretable and explainable machine learning: A methods-centric overview with concrete examples
Open access
Date
2023-02-28Type
- Review Article
Abstract
Interpretability and explainability are crucial for machine learning (ML) and statistical applications in medicine, economics, law, and natural sciences and form an essential principle for ML model design and development. Although interpretability and explainability have escaped a precise and universal definition, many models and techniques motivated by these properties have been developed over the last 30 years, with the focus currently shifting toward deep learning. We will consider concrete examples of state-of-the-art, including specially tailored rule-based, sparse, and additive classification models, interpretable representation learning, and methods for explaining black-box models post hoc. The discussion will emphasize the need for and relevance of interpretability and explainability, the divide between them, and the inductive biases behind the presented “zoo” of interpretable models and explanation methods. Show more
Permanent link
https://doi.org/10.3929/ethz-b-000601093Publication status
publishedExternal links
Journal / series
WIREs Data Mining And Knowledge DiscoveryVolume
Pages / Article No.
Publisher
WileySubject
Explainable AI; Machine Learning; InterpretabilityOrganisational unit
09670 - Vogt, Julia / Vogt, Julia
Related publications and datasets
Is new version of: https://doi.org/10.3929/ethz-b-000454597
More
Show all metadata