Emerton Data — X-SHAP: towards multiplicative explainability of Machine Learning

At a glance

Interpretation of prediction model outputs can be as important as the prediction of machine learning models, e.g. insurance pricing, credit rejection or acceptance, recommendation to decision markers, medical diagnostic. The users need to understand the factors underlying the prediction. Model interpretability offers the possibility to better audit the robustness and fairness of predictive models. Simple models such as linear regressions or GLMs are quite accurate and easily interpretable.

On the contrary, the development of more complex models, such as machine learning ensemble models or deep learning models leads, to highly accurate but more complex models that are difficult to interpret. The trade-off between building a more accurate model vs. keeping a simple and interpretable model is not an easy choice.

In many cases, the simple interpretable model is still preferred. In order to solve the accuracy-interpretability trade-off, a large number of interpretable methods have been proposed. It is noteworthy that all these methods focus on additive contributions computation, none of them being able to tackle multiplicative contributions assessment. In this paper, we introduce, X-SHAP, a model-agnostic interpretability method that provides multiplicative contributions for individual predictions. Our main contributions are summarized as follows:

We extend the additive analytical solution to the model-agnostic multiplicative interpretability problem,
We introduce X-SHAP, an algorithm that provides approximate multiplicative contributions at individual levels,
We propose the X-SHAP toolbox, a new set of techniques used to understand global and segmented model structure by aggregating multiple local contributions.
We empirically verify desirable properties and compare the X-SHAP approach to both the additive algorithm Kernel SHAP, and to well-known metrics on various supervised problems.

Impact

X-SHAP offers a robust and model-agnostic methodology to assess multiplicative contributions. This unique method strengthens the set of techniques and tools contributing to making machine learning more transparent, auditable and accessible.

This method is expected to prove useful for multiplicative underlying structures of modeled phenomena, such as areas where modelers are used to apply log-GLMs (e.g. actuaries modeling claims, epidemiology spreading modeling, disease risk factors estimation, energy consumption forecasting). It is provided as a tool that can help these experts adopt machine learning models with appropriate interpretability framework that stick to their habits.