WoodFisher: Efficient Second-Order Approximation for Neural Network Compression
- Conference Paper
Second-order information, in the form of Hessian- or Inverse-Hessian-vector products, is a fundamental tool for solving optimization problems. Recently, there has been significant interest in utilizing this information in the context of deep neural networks; however, relatively little is known about the quality of existing approximations in this context. Our work considers this question, examines the accuracy of existing approaches, and proposes a method called WoodFisher to compute a faithful and efficient estimate of the inverse Hessian.</p> <p>Our main application is to neural network compression, where we build on the classic Optimal Brain Damage/Surgeon framework. We demonstrate that WoodFisher significantly outperforms popular state-of-the-art methods for one-shot pruning. Further, even when iterative, gradual pruning is allowed, our method results in a gain in test accuracy over the state-of-the-art approaches for popular image classification datasets such as ImageNet ILSVRC. Further, we show how our method can be extended to take into account first-order information, and illustrate its ability to automatically set layer-wise pruning thresholds, or perform compression in the limited-data regime. Mehr anzeigen
BuchtitelAdvances in Neural Information Processing Systems 33
Seiten / Artikelnummer
Organisationseinheit09462 - Hofmann, Thomas / Hofmann, Thomas
AnmerkungenDue to the Coronavirus (COVID-19) the conference was conducted virtually.