Applying and verifying an explainability method based on policy graphs in the context of reinforcement learning
dc.contributor.author | Climent Muñoz, Antoni |
dc.contributor.author | Gnatyshak, Dmitry |
dc.contributor.author | Álvarez Napagao, Sergio |
dc.contributor.other | Universitat Politècnica de Catalunya. Doctorat en Intel·ligència Artificial |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Ciències de la Computació |
dc.contributor.other | Barcelona Supercomputing Center |
dc.date.accessioned | 2021-11-16T13:30:48Z |
dc.date.available | 2021-11-16T13:30:48Z |
dc.date.issued | 2021 |
dc.identifier.citation | Climent, A.; Gnatyshak, D.; Álvarez-Napagao, S. Applying and verifying an explainability method based on policy graphs in the context of reinforcement learning. A: International Conference of the Catalan Association for Artificial Intelligence. "Artificial Intelligence Research and Development: proceedings of the 23rd International Conference of the Catalan Association for Artificial Intelligence". IOS Press, 2021, p. 455-464. ISBN 978-1-64368-211-2. DOI 10.3233/FAIA210166. |
dc.identifier.isbn | 978-1-64368-211-2 |
dc.identifier.uri | http://hdl.handle.net/2117/356542 |
dc.description.abstract | The advancement on explainability techniques is quite relevant in the field of Reinforcement Learning (RL) and its applications can be beneficial for the development of intelligent agents that are understandable by humans and are able cooperate with them. When dealing with Deep RL some approaches already exist in the literature, but a common problem is that it can be tricky to define whether the explanations generated for an agent really reflect the behaviour of the trained agent. In this work we will apply an approach for explainability based on the creation of a Policy Graph (PG) that represents the agent’s behaviour. Our main contribution is a way to measure the similarity between the explanations and the agent’s behaviour, by building another agent that follows a policy based on the explainability method and comparing the behaviour of both agents. |
dc.format.extent | 10 p. |
dc.language.iso | eng |
dc.publisher | IOS Press |
dc.rights | Attribution-NonCommercial 4.0 International |
dc.rights.uri | https://creativecommons.org/licenses/by-nc/4.0/ |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Agents intel·ligents |
dc.subject.lcsh | Intelligent agents (Computer software) |
dc.subject.lcsh | Reinforcement learning |
dc.subject.other | Explainable AI |
dc.subject.other | Policy graphs |
dc.title | Applying and verifying an explainability method based on policy graphs in the context of reinforcement learning |
dc.type | Conference report |
dc.subject.lemac | Agents intel·ligents (Programari) |
dc.subject.lemac | Aprenentatge per reforç |
dc.contributor.group | Universitat Politècnica de Catalunya. KEMLG - Grup d'Enginyeria del Coneixement i Aprenentatge Automàtic |
dc.identifier.doi | 10.3233/FAIA210166 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | https://ebooks.iospress.nl/doi/10.3233/FAIA210166 |
dc.rights.access | Open Access |
local.identifier.drac | 32211581 |
dc.description.version | Postprint (published version) |
local.citation.author | Climent, A.; Gnatyshak, D.; Álvarez-Napagao, S. |
local.citation.contributor | International Conference of the Catalan Association for Artificial Intelligence |
local.citation.publicationName | Artificial Intelligence Research and Development: proceedings of the 23rd International Conference of the Catalan Association for Artificial Intelligence |
local.citation.startingPage | 455 |
local.citation.endingPage | 464 |
Files in this item
This item appears in the following Collection(s)
Except where otherwise noted, content on this work
is licensed under a Creative Commons license
:
Attribution-NonCommercial 4.0 Generic