Competitive function approximation for reinforcement learning

Agostini, Alejandro Gabriel; Celaya Llover, Enric

dc.contributor.author	Agostini, Alejandro Gabriel
dc.contributor.author	Celaya Llover, Enric
dc.contributor.other	Institut de Robòtica i Informàtica Industrial
dc.date.accessioned	2015-06-29T18:57:10Z
dc.date.available	2015-06-29T18:57:10Z
dc.date.created	2014
dc.date.issued	2014
dc.identifier.citation	Agostini, A.; Celaya, E. "Competitive function approximation for reinforcement learning". 2014.
dc.identifier.uri	http://hdl.handle.net/2117/28454
dc.description.abstract	The application of reinforcement learning to problems with continuous domains requires representing the value function by means of function approximation. We identify two aspects of reinforcement learning that make the function approximation process hard: non-stationarity of the target function and biased sampling. Non-stationarity is the result of the bootstrapping nature of dynamic programming where the value function is estimated using its current approximation. Biased sampling occurs when some regions of the state space are visited too often, causing a reiterated updating with similar values which fade out the occasional updates of infrequently sampled regions. We propose a competitive approach for function approximation where many different local approximators are available at a given input and the one with expectedly best approximation is selected by means of a relevance function. The local nature of the approximators allows their fast adaptation to non-stationary changes and mitigates the biased sampling problem. The coexistence of multiple approximators updated and tried in parallel permits obtaining a good estimation much faster than would be possible with a single approximator. Experiments in different benchmark problems show that the competitive strategy provides a faster and more stable learning than non-competitive approaches.
dc.format.extent	32 p.
dc.language.iso	eng
dc.relation.ispartofseries	IRI-TR-14-05
dc.rights	Attribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subject	Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial
dc.subject.other	learning (artificial intelligence)
dc.subject.other	reinforcement learning
dc.subject.other	competitive strategy
dc.subject.other	Gaussian mixture model
dc.title	Competitive function approximation for reinforcement learning
dc.type	External research report
dc.contributor.group	Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel·ligents
dc.subject.inspec	Classificació INSPEC::Cybernetics::Artificial intelligence
dc.rights.access	Open Access
local.identifier.drac	15302509
dc.description.version	Preprint
local.citation.author	Agostini, A.; Celaya, E.
local.citation.publicationName	Competitive function approximation for reinforcement learning

Fitxers d'aquest items

Nom:: 1599-Competitive-Function-Appr ...
Mida:: 4,347Mb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Reports de recerca [12]
Reports de recerca [50]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Competitive function approximation for reinforcement learning

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora