Learning to solve complex tasks by reinforcement: a new algorithm

Martín Muñoz, Mario; Cortés García, Claudio Ulises

dc.contributor.author	Martín Muñoz, Mario
dc.contributor.author	Cortés García, Claudio Ulises
dc.contributor.other	Universitat Politècnica de Catalunya. Departament de Ciències de la Computació
dc.date.accessioned	2016-01-27T18:01:47Z
dc.date.available	2016-01-27T18:01:47Z
dc.date.issued	1995-04
dc.identifier.citation	Martin, M., Cortes, C. "Learning to solve complex tasks by reinforcement: a new algorithm". 1995.
dc.identifier.uri	http://hdl.handle.net/2117/82160
dc.description.abstract	In this paper, a new approach for learning to solve complex problems by reinforcement is proposed. In order to solve complex tasks the system is guided by a teacher who previously proposes intermediate general tasks to learn. The learnt behaviors to solve these tasks are added to the system's set of actions increasing its skills until it is able to easily solve the desired complex task. This approach uses a new reinforcement learning mechanism, robust to ambiguous information and able to learn general behaviors. These mechanisms are studied, described and finally tested with a set of experiments in a complex environment.
dc.format.extent	14 p.
dc.language.iso	eng
dc.relation.ispartofseries	LSI-95-14-R
dc.subject	Àrees temàtiques de la UPC::Informàtica
dc.subject.other	Machine learning
dc.subject.other	Reinforcement learning
dc.subject.other	Robotics
dc.subject.other	Reactive systems
dc.title	Learning to solve complex tasks by reinforcement: a new algorithm
dc.type	External research report
dc.contributor.group	Universitat Politècnica de Catalunya. KEMLG - Grup d'Enginyeria del Coneixement i Aprenentatge Automàtic
dc.rights.access	Open Access
local.identifier.drac	646804
dc.description.version	Postprint (published version)
local.citation.author	Martin, M.; Cortes, C.

Fitxers d'aquest items

Nom:: R95-14.ps
Mida:: 228,5Kb
Format:: Postscript

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Reports de recerca [96]
Reports de recerca [1.107]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Learning to solve complex tasks by reinforcement: a new algorithm

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora