Mostra el registre d'ítem simple

dc.contributorAngulo Bahón, Cecilio
dc.contributor.authorPérez Sala, Xavier
dc.date.accessioned2011-03-10T15:09:35Z
dc.date.available2011-03-10T15:09:35Z
dc.date.issued2010-09-03
dc.identifier.urihttp://hdl.handle.net/2099.1/11320
dc.description.abstractWe propose a robust system for automatic Robot Navigation in uncontrolled en- vironments. The system is composed by three main modules: the Arti cial Vision module, the Reinforcement Learning module, and the behavior control module. The aim of the system is to allow a robot to automatically nd a path that arrives to a pre xed goal. Turn and straight movements in uncontrolled environments are automatically estimated and controlled using the proposed modules. The Arti cial Vision module is responsible of obtaining a quanti ed representa- tion of the robot vision. This is done by the automatic detection and description of image interest points using state-of-the-art strategies. Once an image is described with a set of local feature vectors, the view is codi ed as a vector of visual words frequencies computed from a previous scene representation, which robustly discrim- inate among the di erent possible views of the robot in the environment. Local features changes in time are also used to estimate robot movement and consequently control robot behavior be means of the analysis of the computed vanishing points. The Reinforcement Learning (RL) module receives a vector quanti ed by the Arti cial Vision module plus robot sensor estimations. RL strategy computes the required state and reward. The state corresponds to the normalized received quan- ti ed vector together with the robot proximity sensor quanti cations. The reward value is computed using the distance between the robot and the goal. Given the high dimensionality of the problem we deal with, conventional RF strategies make the search problem unfeasible. Because of this reason, we propose the use of an al- gorithm from the articulation control eld, named Natural Actor-Critic, which can deal with high dimensionality problems. We tested the proposed methodology in uncontrolled environments using the Sony Aibo robot. The results shown that the robot looked for the goal, producing behavior changes based on experience, but without nding the optimal route. 3
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic
dc.subjectÀrees temàtiques de la UPC::Informàtica::Robòtica
dc.subject.lcshReinforcement learning
dc.subject.lcshRobotics
dc.subject.otherArtificial Vision module
dc.subject.otherBehavior control module
dc.subject.otherReinforcement Learning module
dc.subject.otherRobot Navigation
dc.titleVision-based Navigation and Reinforcement Learning Path Finding for Social Robots
dc.typeMaster thesis
dc.subject.lemacAprenentatge per reforç
dc.subject.lemacRobòtica
dc.rights.accessOpen Access
dc.audience.educationlevelMàster
dc.audience.mediatorFacultat d'Informàtica de Barcelona
dc.audience.degreeMÀSTER UNIVERSITARI EN INTEL·LIGÈNCIA ARTIFICIAL (Pla 2009)


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple