Reinforcement learning for robot control using probability density estimations

Agostini, Alejandro Gabriel; Celaya Llover, Enric

dc.contributor.author	Agostini, Alejandro Gabriel
dc.contributor.author	Celaya Llover, Enric
dc.contributor.other	Institut de Robòtica i Informàtica Industrial
dc.date.accessioned	2010-11-22T19:18:29Z
dc.date.available	2010-11-22T19:18:29Z
dc.date.created	2010
dc.date.issued	2010
dc.identifier.citation	Agostini, A.G.; Celaya, E. Reinforcement learning for robot control using probability density estimations. A: International Conference on Informatics in Control, Automation and Robotics. "7Th International Conference on Informatics in Control, Automation and Robotics". Funchal: INSTICC Press. Institute for Systems and Technologies of Information, Control and Communication, 2010, p. 160-168.
dc.identifier.uri	http://hdl.handle.net/2117/10368
dc.description.abstract	The successful application of Reinforcement Learning (RL) techniques to robot control is limited by the fact that, in most robotic tasks, the state and action spaces are continuous, multidimensional, and in essence, too large for conventional RL algorithms to work. The well known curse of dimensionality makes infeasible using a tabular representation of the value function, which is the classical approach that provides convergence guarantees. When a function approximation technique is used to generalize among similar states, the convergence of the algorithm is compromised, since updates unavoidably affect an extended region of the domain, that is, some situations are modified in a way that has not been really experienced, and the update may degrade the approximation. We propose a RL algorithm that uses a probability density estimation in the joint space of states, actions and Q-values as a means of function approximation. This allows us to devise an updating approach that, taking into account the local sampling density, avoids an excessive modification of the approximation far from the observed sample.
dc.format.extent	9 p.
dc.language.iso	eng
dc.publisher	INSTICC Press. Institute for Systems and Technologies of Information, Control and Communication
dc.rights	Attribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subject	Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic
dc.subject.lcsh	Machine learning
dc.subject.other	generalisation (artificial intelligence) intelligent robots learning (artificial intelligence)
dc.title	Reinforcement learning for robot control using probability density estimations
dc.type	Conference report
dc.subject.lemac	Aprenentatge automàtic
dc.contributor.group	Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel·ligents
dc.description.peerreviewed	Peer Reviewed
dc.subject.inspec	Classificació INSPEC::Cybernetics::Artificial intelligence::Learning (artificial intelligence)
dc.relation.publisherversion	http://www.icinco.org/Abstracts/2010/ICINCO_2010_Abstracts.htm
dc.rights.access	Restricted access - publisher's policy
local.identifier.drac	4127980
dc.description.version	Postprint (published version)
local.citation.author	Agostini, A.G.; Celaya, E.
local.citation.contributor	International Conference on Informatics in Control, Automation and Robotics
local.citation.pubplace	Funchal
local.citation.publicationName	7Th International Conference on Informatics in Control, Automation and Robotics
local.citation.startingPage	160
local.citation.endingPage	168

Fitxers d'aquest items

Nom:: draccelaya.pdf
Mida:: 563,1Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [576]
Ponències/Comunicacions de congressos [292]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Reinforcement learning for robot control using probability density estimations

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora