Mostra el registre d'ítem simple
Learning to solve complex tasks by reinforcement: a new algorithm
dc.contributor.author | Martín Muñoz, Mario |
dc.contributor.author | Cortés García, Claudio Ulises |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Ciències de la Computació |
dc.date.accessioned | 2016-01-27T18:01:47Z |
dc.date.available | 2016-01-27T18:01:47Z |
dc.date.issued | 1995-04 |
dc.identifier.citation | Martin, M., Cortes, C. "Learning to solve complex tasks by reinforcement: a new algorithm". 1995. |
dc.identifier.uri | http://hdl.handle.net/2117/82160 |
dc.description.abstract | In this paper, a new approach for learning to solve complex problems by reinforcement is proposed. In order to solve complex tasks the system is guided by a teacher who previously proposes intermediate general tasks to learn. The learnt behaviors to solve these tasks are added to the system's set of actions increasing its skills until it is able to easily solve the desired complex task. This approach uses a new reinforcement learning mechanism, robust to ambiguous information and able to learn general behaviors. These mechanisms are studied, described and finally tested with a set of experiments in a complex environment. |
dc.format.extent | 14 p. |
dc.language.iso | eng |
dc.relation.ispartofseries | LSI-95-14-R |
dc.subject | Àrees temàtiques de la UPC::Informàtica |
dc.subject.other | Machine learning |
dc.subject.other | Reinforcement learning |
dc.subject.other | Robotics |
dc.subject.other | Reactive systems |
dc.title | Learning to solve complex tasks by reinforcement: a new algorithm |
dc.type | External research report |
dc.contributor.group | Universitat Politècnica de Catalunya. KEMLG - Grup d'Enginyeria del Coneixement i Aprenentatge Automàtic |
dc.rights.access | Open Access |
local.identifier.drac | 646804 |
dc.description.version | Postprint (published version) |
local.citation.author | Martin, M.; Cortes, C. |
Fitxers d'aquest items
Aquest ítem apareix a les col·leccions següents
-
Reports de recerca [96]
-
Reports de recerca [1.107]