Interpreting Deep Neural Networks Through Variable Importance

Given a trained model, which features are the most important?

Featured image

Video

Variable importance is one possible approach to achieve global interpretability, where the goal is to rank each input feature based on its contributions to predictive accuracy. This is in contrast to local interpretability, which aims to simply provide an explanation behind a specific prediction or group of predictions.

Here, This paper describes an approach to achieve global interpretability for deep neural networks using “RelATive cEntrality” (RATE), a recently-proposed variable importance criterion for Bayesian models (Gaussian process regression).

Furthermore, they present a unified framework under which RATE can be applied to deep neural networks and propose GroupRATE for ranking groups of variables.