Mostra el registre d'ítem simple

dc.contributor.authorMartínez Martínez, David
dc.contributor.authorAlenyà Ribas, Guillem
dc.contributor.authorTorras, Carme
dc.contributor.otherInstitut de Robòtica i Informàtica Industrial
dc.date.accessioned2016-03-30T17:25:59Z
dc.date.issued2015
dc.identifier.citationMartínez, D., Alenyà, G., Torras, C. V-MIN: efficient reinforcement learning through demonstrations and relaxed reward demands. A: AAAI Conference on Artificial Intelligence. "Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence". Austin: 2015, p. 2857-2863.
dc.identifier.urihttp://hdl.handle.net/2117/84917
dc.description.abstractReinforcement learning (RL) is a common paradigm for learning tasks in robotics. However, a lot of exploration is usually required, making RL too slow for high-level tasks. We present V-MIN, an algorithm that integrates teacher demonstrations with RL to learn complex tasks faster. The algorithm combines active demonstration requests and autonomous exploration to find policies yielding rewards higher than a given threshold Vmin. This threshold sets the degree of quality with which the robot is expected to complete the task, thus allowing the user to either opt for very good policies that require many learning experiences, or to be more permissive with sub-optimal policies that are easier to learn. The threshold can also be increased online to force the system to improve its policies until the desired behavior is obtained. Furthermore, the algorithm generalizes previously learned knowledge, adapting well to changes. The performance of V-MIN has been validated through experimentation, including domains from the international planning competition. Our approach achieves the desired behavior where previous algorithms failed.
dc.format.extent7 p.
dc.language.isoeng
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Robòtica
dc.subject.otherlearning (artificial intelligence)
dc.subject.otheruncertainty handling
dc.subject.otherreinforcement learning
dc.subject.otheractive learning
dc.subject.othermodel-based reinforcement learning
dc.titleV-MIN: efficient reinforcement learning through demonstrations and relaxed reward demands
dc.typeConference report
dc.contributor.groupUniversitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI
dc.description.peerreviewedPeer Reviewed
dc.subject.inspecClassificació INSPEC::Cybernetics::Artificial intelligence
dc.relation.publisherversionhttp://www.aaai.org/ocs/index.php/AAAI/AAAI15/paper/view/9634/9952
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac17087237
dc.description.versionPostprint (author's final draft)
dc.date.lift10000-01-01
local.citation.authorMartínez, D.; Alenyà, G.; Torras, C.
local.citation.contributorAAAI Conference on Artificial Intelligence
local.citation.pubplaceAustin
local.citation.publicationNameProceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence
local.citation.startingPage2857
local.citation.endingPage2863


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple