Active learning of manipulation sequences

Martínez Martínez, David; Alenyà Ribas, Guillem; Jiménez Schlegl, Pablo; Torras, Carme; Rossmann, Jürgen; Wantia, Nils; Eren Erdal, Aksoy; Haller, Simon; Piater, Justus

doi:10.1109/ICRA.2014.6907693

dc.contributor.author	Martínez Martínez, David
dc.contributor.author	Alenyà Ribas, Guillem
dc.contributor.author	Jiménez Schlegl, Pablo
dc.contributor.author	Torras, Carme
dc.contributor.author	Rossmann, Jürgen
dc.contributor.author	Wantia, Nils
dc.contributor.author	Eren Erdal, Aksoy
dc.contributor.author	Haller, Simon
dc.contributor.author	Piater, Justus
dc.contributor.other	Institut de Robòtica i Informàtica Industrial
dc.date.accessioned	2015-09-03T14:21:07Z
dc.date.available	2015-09-03T14:21:07Z
dc.date.issued	2014
dc.identifier.citation	Martínez, D., Alenyà, G., Jimenez, P., Torras, C., Rossmann, J., Wantia, N., Eren Erdal, A., Haller, S., Piater, J. Active learning of manipulation sequences. A: IEEE International Conference on Robotics and Automation. "Proceedings of the ICRA - 2014 - IEEE International Conference on Robotics and Automation". Hong Kong: 2014, p. 5671-5678.
dc.identifier.uri	http://hdl.handle.net/2117/76605
dc.description.abstract	We describe a system allowing a robot to learn goal-directed manipulation sequences such as steps of an assembly task. Learning is based on a free mix of exploration and instruction by an external teacher, and may be active in the sense that the system tests actions to maximize learning progress and asks the teacher if needed. The main component is a symbolic planning engine that operates on learned rules, defined by actions and their pre- and postconditions. Learned by model-based reinforcement learning, rules are immediately available for planning. Thus, there are no distinct learning and application phases. We show how dynamic plans, replanned after every action if necessary, can be used for automatic execution of manipulation sequences, for monitoring of observed manipulation sequences, or a mix of the two, all while extending and refining the rule base on the fly. Quantitative results indicate fast convergence using few training examples, and highly effective teacher intervention at early stages of learning.
dc.format.extent	8 p.
dc.language.iso	eng
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subject	Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial
dc.subject.other	learning (artificial intelligence)
dc.subject.other	planning (artificial intelligence)
dc.subject.other	uncertainty handling.
dc.title	Active learning of manipulation sequences
dc.type	Conference report
dc.contributor.group	Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI
dc.identifier.doi	10.1109/ICRA.2014.6907693
dc.description.peerreviewed	Peer Reviewed
dc.subject.inspec	Classificació INSPEC::Cybernetics::Artificial intelligence
dc.relation.publisherversion	http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6907693
dc.rights.access	Open Access
local.identifier.drac	15271713
dc.description.version	Postprint (author’s final draft)
dc.relation.projectid	info:eu-repo/grantAgreement/EC/FP7/269959/EU/Intelligent observation and execution of Actions and manipulations/INTELLACT
local.citation.author	Martínez, D.; Alenyà, G.; Jimenez, P.; Torras, C.; Rossmann, J.; Wantia, N.; Eren Erdal, A.; Haller, S.; Piater, J.
local.citation.contributor	IEEE International Conference on Robotics and Automation
local.citation.pubplace	Hong Kong
local.citation.publicationName	Proceedings of the ICRA - 2014 - IEEE International Conference on Robotics and Automation
local.citation.startingPage	5671
local.citation.endingPage	5678

Fitxers d'aquest items

Nom:: 1495-Active-learning-of-manipu ...
Mida:: 1,746Mb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [576]
Ponències/Comunicacions de congressos [252]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Active learning of manipulation sequences

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora