Skip to content

This is a repository for reproducibility purposes. In this research, a large number of datasets were used to create different ML models, which were then explained by XAI measures. Seeking to identify situations where XAI measures agreed or disagreed with each other.

Notifications You must be signed in to change notification settings

josesousaribeiro/XAI-Benchmark

Repository files navigation

XAI Benchmark

This repository has research analysis data for Explainable Artificial Intelligence - XAI.

The research group responsible for this study is composed of:

José Ribeiro - site: https://sites.google.com/view/jose-sousa-ribeiro

Raíssa Silva - site: https://sites.google.com/site/silvarailors

Lucas Cardoso - site: http://lattes.cnpq.br/9591352011725008

Ronnie Alves (Leader) - site: https://sites.google.com/site/alvesrco/

Abstract

Strategies based on Explainable Artificial Intelligence - XAI have emerged in computing to promote a better understanding of predictions made by black box models. Most XAI measures used today explain these types of models, generating attribute rankings aimed at explaining the model, that is, the analysis of Attribute Importance of Model. There is no consensus on which XAI measure generates an overall explainability rank. For this reason, several proposals for tools have emerged (Ciu, Dalex, Eli5, Lofo, Shap and Skater). An experimental benchmark of explainable AI techniques capable of producing global explainability ranks based on tabular data related to different problems and ensemble models are presented herein. Seeking to answer questions such as "Are the explanations generated by the different measures the same, similar or different?" and How does data complexity play along model explainability?" The results from the construction of 82 computational models and 592 ranks shed some light on the other side of the problem of explainability: dataset complexity!

Proposal

This research raises discussions for the current moment in the XAI area through two main questions: --- Considering the current tools aimed at explaining black box machine learning models, it can be inferred that they generate global rankings of same, similar, or different explainabilities? --- Following the same idea as in the previous question, are the generations of equal, similar, or different explainabilities related to specific properties of a dataset?

Seeking to answer the two hypotheses presented above, this research emerges as a comparative analysis of different XAI metrics, capable of producing model-agnostic global explainability ranks based on tabular data related to different problems.

Benchmark

In the benchmark proposed by this research, the explanatory ranks of 6 different XAI metrics (Ciu, Dalex, Eli5, Lofo, Shap and Skater) are compared using Spearman coefficients, resulting in conclusive results.

The methodology is presented in the diagram below:

alt text

All the information needed for a better understanding and reproducibility of the procedures performed in the research can be accessed directly on the notebook (Google Colab) available at root, as well as in the files folders.

Cite this work

@inproceedings{ribeiro2021does,
  title={Does Dataset Complexity Matters for Model Explainers?},
  author={Ribeiro, Jos{\'e} and Silva, Ra{\'\i}ssa and Cardoso, Lucas and Alves, Ronnie},
  booktitle={2021 IEEE International Conference on Big Data (Big Data)},
  pages={5257--5265},
  year={2021},
  organization={IEEE}
}

About

This is a repository for reproducibility purposes. In this research, a large number of datasets were used to create different ML models, which were then explained by XAI measures. Seeking to identify situations where XAI measures agreed or disagreed with each other.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published