Vision-based Navigation and Reinforcement Learning Path Finding for Social Robots

Pérez Sala, Xavier

dc.contributor	Angulo Bahón, Cecilio
dc.contributor.author	Pérez Sala, Xavier
dc.date.accessioned	2011-03-10T15:09:35Z
dc.date.available	2011-03-10T15:09:35Z
dc.date.issued	2010-09-03
dc.identifier.uri	http://hdl.handle.net/2099.1/11320
dc.description.abstract	We propose a robust system for automatic Robot Navigation in uncontrolled en- vironments. The system is composed by three main modules: the Arti cial Vision module, the Reinforcement Learning module, and the behavior control module. The aim of the system is to allow a robot to automatically nd a path that arrives to a pre xed goal. Turn and straight movements in uncontrolled environments are automatically estimated and controlled using the proposed modules. The Arti cial Vision module is responsible of obtaining a quanti ed representa- tion of the robot vision. This is done by the automatic detection and description of image interest points using state-of-the-art strategies. Once an image is described with a set of local feature vectors, the view is codi ed as a vector of visual words frequencies computed from a previous scene representation, which robustly discrim- inate among the di erent possible views of the robot in the environment. Local features changes in time are also used to estimate robot movement and consequently control robot behavior be means of the analysis of the computed vanishing points. The Reinforcement Learning (RL) module receives a vector quanti ed by the Arti cial Vision module plus robot sensor estimations. RL strategy computes the required state and reward. The state corresponds to the normalized received quan- ti ed vector together with the robot proximity sensor quanti cations. The reward value is computed using the distance between the robot and the goal. Given the high dimensionality of the problem we deal with, conventional RF strategies make the search problem unfeasible. Because of this reason, we propose the use of an al- gorithm from the articulation control eld, named Natural Actor-Critic, which can deal with high dimensionality problems. We tested the proposed methodology in uncontrolled environments using the Sony Aibo robot. The results shown that the robot looked for the goal, producing behavior changes based on experience, but without nding the optimal route. 3
dc.language.iso	eng
dc.publisher	Universitat Politècnica de Catalunya
dc.rights	Attribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subject	Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic
dc.subject	Àrees temàtiques de la UPC::Informàtica::Robòtica
dc.subject.lcsh	Reinforcement learning
dc.subject.lcsh	Robotics
dc.subject.other	Artificial Vision module
dc.subject.other	Behavior control module
dc.subject.other	Reinforcement Learning module
dc.subject.other	Robot Navigation
dc.title	Vision-based Navigation and Reinforcement Learning Path Finding for Social Robots
dc.type	Master thesis
dc.subject.lemac	Aprenentatge per reforç
dc.subject.lemac	Robòtica
dc.rights.access	Open Access
dc.audience.educationlevel	Màster
dc.audience.mediator	Facultat d'Informàtica de Barcelona
dc.audience.degree	MÀSTER UNIVERSITARI EN INTEL·LIGÈNCIA ARTIFICIAL (Pla 2009)

Fitxers d'aquest items

Nom:: Master thesis_ Xavier Pererz ...
Mida:: 13,15Mb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Master in Artificial Intelligence - MAI (Pla 2006) [73]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Vision-based Navigation and Reinforcement Learning Path Finding for Social Robots

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora