Abstract
Multilevel models using logistic regression (MLogRM) and random forest models (RFM) are increasingly deployed in industry for the purpose of binary classification. The European Commission’s proposed Artificial Intelligence Act (AIA) necessitates, under certain conditions, that application of such models is fair, transparent, and ethical, which consequently implies technical assessment of these models. This paper proposes and demonstrates an audit framework for technical assessment of RFMs and MLogRMs by focussing on model-, discrimination-, and transparency & explainability-related aspects. To measure these aspects 20 KPIs are proposed, which are paired to a traffic light risk assessment method. An open-source dataset is used to train a RFM and a MLogRM model and these KPIs are computed and compared with the traffic lights. The performance of popular explainability methods such as kernel- and tree-SHAP are assessed. The framework is expected to assist regulatory bodies in performing conformity assessments of binary classifiers and also benefits providers and users deploying such AI-systems to comply with the AIA.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 15th International Conference on Agents and Artificial Intelligence - (Volume 2) |
| Editors | Ana Paula Rocha , Luc Steels, Jaap van den Herik |
| Place of Publication | Lisbon |
| Pages | 312-324 |
| Volume | 2 |
| DOIs | |
| Publication status | Published - 2023 |
| Event | 15th International Conference on Agents and Artificial Intelligence (ICAART 2023) - Lisbon, Portugal Duration: 22 Feb 2023 → 24 Feb 2023 |
Conference
| Conference | 15th International Conference on Agents and Artificial Intelligence (ICAART 2023) |
|---|---|
| Country/Territory | Portugal |
| City | Lisbon |
| Period | 22/02/23 → 24/02/23 |
Fingerprint
Dive into the research topics of 'An Audit Framework for Technical Assessment of Binary Classifiers'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver