A competitive strategy for function approximation in Q-learning

Agostini, Alejandro Gabriel; Celaya Llover, Enric

Visualitza/Obre

1248-A-Competitive-Strategy-for-Function-Approximation-in-Q-Learning.pdf (335,3Kb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Agostini, Alejandro Gabriel

Celaya Llover, Enric

Tipus de documentText en actes de congrés

Data publicació2011

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

In this work we propose an approach for generalization in continuous domain Reinforcement Learning that, instead of using a single function approximator, tries many different function approximators in parallel, each one defined in a different region of the domain. Associated with each approximator is a relevance function that locally quantifies the quality of its approximation, so that, at each input point, the approximator with highest relevance can be selected. The relevance function is defined using parametric estimations of the variance of the q-values and the density of samples in the input space, which are used to quantify the accuracy and the confidence in the approximation, respectively. These parametric estimations are obtained from a probability density distribution represented as a Gaussian Mixture Model embedded in the input-output space of each approximator. In our experiments, the proposed approach required a lesser number of experiences for learning and produced more stable convergence profiles than when using a single function approximator.

CitacióAgostini, A.G.; Celaya Llover, E. A competitive strategy for function approximation in Q-learning. A: International Joint Conference on Artificial Intelligence. "Proceedings of the 2011 International Joint Conference on Artificial Intelligence". 2011, p. 1146-1151.

URIhttp://hdl.handle.net/2117/14123

Versió de l'editorhttp://ijcai.org/papers11/Papers/IJCAI11-196.pdf

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
1248-A-Competit ... ximation-in-Q-Learning.pdf		335,3Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

A competitive strategy for function approximation in Q-learning

Visualitza/Obre

Explora