A confusion matrix for evaluating feature attribution methods
Cita com:
hdl:2117/399128
Document typeConference lecture
Defense date2023
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Rights accessOpen Access
All rights reserved. This work is protected by the corresponding intellectual and industrial
property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public
communication or transformation of this work are prohibited without permission of the copyright holder
Abstract
The increasing use of deep learning models in critical areas of computer vision and the consequent need for insights into model behaviour have led to the development of numerous feature attribution methods. However, these attributions must be both meaningful and plausible to end-users, which is not always the case. Recent research has emphasized the importance of faithfulness in attributions, as plausibility without faithfulness can result in misleading explanations and incorrect decisions. In this work., we propose a novel approach to evaluate the faithfulness of feature attribution methods by constructing an ‘Attribution Confusion Matrix’, which allows us to leverage a wide range of existing metrics from the traditional confusion matrix. This approach effectively introduces multiple evaluation measures for faithfulness in feature attribution methods in a unified and consistent framework. We demonstrate the effectiveness of our approach on various datasets, attribution methods, and models, emphasizing the importance of faithfulness in generating plausible and reliable explanations while also illustrating the distinct behaviour of different feature attribution methods.
Description
© 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
CitationArias, A. [et al.]. A confusion matrix for evaluating feature attribution methods. A: IEEE Conference on Computer Vision and Pattern Recognition. "2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops: Vancouver, Canada, 18-22 June 2023: proceedings". Institute of Electrical and Electronics Engineers (IEEE), 2023, p. 3709-3714. ISBN 979-8-3503-0249-3. DOI 10.1109/CVPRW59228.2023.00380.
ISBN979-8-3503-0249-3
Publisher versionhttps://ieeexplore.ieee.org/document/10208308
Files | Description | Size | Format | View |
---|---|---|---|---|
Arias-Duart_A_C ... thods_CVPRW_2023_paper.pdf | 470,9Kb | View/Open |